精品自在线观看影片天天看_久久人人爽人人爽人人片69AV_日本韩国欧美一区二区三区_欧美日韩国产中文字幕_亚洲欧洲精品无码AV一区_日本日本乱码伦视频在线播放_亚洲无码国产精品

Meiyue Song,Zhihua Yu,Jiaxin Wang,Jiarui Wang,Yuting Lu,Baicun Li,Xiaoxu Wang,Qinghua Huang,Zhijun Li,Nikolaos I. Kanellakis,Jiangfeng Liu,Jing Wang,Binglu Wang,Juntao Yang

from arxiv, submitted to Medical Image Analysis

The conventional pretraining-and-finetuning paradigm, while effective for common diseases with ample data, faces challenges in diagnosing data-scarce occupational diseases like pneumoconiosis. Recently, large language models (LLMs) have exhibits unprecedented ability when conducting multiple tasks in dialogue, bringing opportunities to diagnosis. A common strategy might involve using adapter layers for vision-language alignment and diagnosis in a dialogic manner. Yet, this approach often requires optimization of extensive learnable parameters in the text branch and the dialogue head, potentially diminishing the LLMs' efficacy, especially with limited training data. In our work, we innovate by eliminating the text branch and substituting the dialogue head with a classification head. This approach presents a more effective method for harnessing LLMs in diagnosis with fewer learnable parameters. Furthermore, to balance the retention of detailed image information with progression towards accurate diagnosis, we introduce the contextual multi-token engine. This engine is specialized in adaptively generating diagnostic tokens. Additionally, we propose the information emitter module, which unidirectionally emits information from image tokens to diagnosis tokens. Comprehensive experiments validate the superiority of our methods and the effectiveness of proposed modules. Our codes can be found at //github.com/CodeMonsterPHD/PneumoLLM/tree/main.

相關內容

大語言(yan)模(mo)型

關注 56

大語言(yan)(yan)(yan)模(mo)(mo)(mo)型是(shi)基于(yu)海量(liang)文本(ben)數據訓練(lian)的(de)(de)(de)(de)深(shen)度學習模(mo)(mo)(mo)型。它(ta)不僅能(neng)夠(gou)生(sheng)成自然(ran)語言(yan)(yan)(yan)文本(ben)，還能(neng)夠(gou)深(shen)入理解(jie)文本(ben)含(han)義，處理各種自然(ran)語言(yan)(yan)(yan)任務(wu)，如文本(ben)摘要、問(wen)答(da)、翻譯等(deng)(deng)。2023年，大語言(yan)(yan)(yan)模(mo)(mo)(mo)型及(ji)其在人(ren)(ren)(ren)工智能(neng)領域(yu)的(de)(de)(de)(de)應(ying)用已成為(wei)全球科(ke)技研究的(de)(de)(de)(de)熱點，其在規模(mo)(mo)(mo)上的(de)(de)(de)(de)增長尤為(wei)引人(ren)(ren)(ren)注目，參(can)數量(liang)已從最初的(de)(de)(de)(de)十幾(ji)億躍(yue)升(sheng)到(dao)如今的(de)(de)(de)(de)一萬億。參(can)數量(liang)的(de)(de)(de)(de)提升(sheng)使得模(mo)(mo)(mo)型能(neng)夠(gou)更加(jia)精(jing)細地(di)捕捉(zhuo)人(ren)(ren)(ren)類(lei)(lei)語言(yan)(yan)(yan)微妙(miao)之處，更加(jia)深(shen)入地(di)理解(jie)人(ren)(ren)(ren)類(lei)(lei)語言(yan)(yan)(yan)的(de)(de)(de)(de)復雜(za)性(xing)。在過去(qu)的(de)(de)(de)(de)一年里(li)，大語言(yan)(yan)(yan)模(mo)(mo)(mo)型在吸納新知識、分解(jie)復雜(za)任務(wu)以及(ji)圖(tu)文對齊等(deng)(deng)多方(fang)面都有顯著(zhu)提升(sheng)。隨(sui)著(zhu)技術的(de)(de)(de)(de)不斷(duan)成熟，它(ta)將不斷(duan)拓展其應(ying)用范圍(wei)，為(wei)人(ren)(ren)(ren)類(lei)(lei)提供更加(jia)智能(neng)化(hua)和個性(xing)化(hua)的(de)(de)(de)(de)服務(wu)，進(jin)一步(bu)改善(shan)人(ren)(ren)(ren)們的(de)(de)(de)(de)生(sheng)活和生(sheng)產方(fang)式。

語言模型化 · 知識 (knowledge) · 大語言模型 · MoDELS · CC ·

2024 年 1 月 26 日

Query of CC: Unearthing Large Scale Domain-Specific Knowledge from Public Corpora

Zhaoye Fei,Yunfan Shao,Linyang Li,Zhiyuan Zeng,Hang Yan,Xipeng Qiu,Dahua Lin

Large language models have demonstrated remarkable potential in various tasks, however, there remains a significant scarcity of open-source models and data for specific domains. Previous works have primarily focused on manually specifying resources and collecting high-quality data on specific domains, which significantly consume time and effort. To address this limitation, we propose an efficient data collection method~\textit{Query of CC} based on large language models. This method bootstraps seed information through a large language model and retrieves related data from public corpora. It not only collects knowledge-related data for specific domains but unearths the data with potential reasoning procedures. Through the application of this method, we have curated a high-quality dataset called~\textsc{Knowledge Pile}, encompassing four major domains, including stem and humanities sciences, among others. Experimental results demonstrate that~\textsc{Knowledge Pile} significantly improves the performance of large language models in mathematical and knowledge-related reasoning ability tests. To facilitate academic sharing, we open-source our dataset and code, providing valuable support to the academic community.

INTERACT · Agent · 流形 · Learning · MoDELS ·

2024 年 1 月 25 日

STEMFold: Stochastic Temporal Manifold for Multi-Agent Interactions in the Presence of Hidden Agents

Hemant Kumawat,Biswadeep Chakraborty,Saibal Mukhopadhyay

from arxiv, Under review as a conference paper at $6^{th}$ Annual Learning for Dynamics & Control Conference 2024

Learning accurate, data-driven predictive models for multiple interacting agents following unknown dynamics is crucial in many real-world physical and social systems. In many scenarios, dynamics prediction must be performed under incomplete observations, i.e., only a subset of agents are known and observable from a larger topological system while the behaviors of the unobserved agents and their interactions with the observed agents are not known. When only incomplete observations of a dynamical system are available, so that some states remain hidden, it is generally not possible to learn a closed-form model in these variables using either analytic or data-driven techniques. In this work, we propose STEMFold, a spatiotemporal attention-based generative model, to learn a stochastic manifold to predict the underlying unmeasured dynamics of the multi-agent system from observations of only visible agents. Our analytical results motivate STEMFold design using a spatiotemporal graph with time anchors to effectively map the observations of visible agents to a stochastic manifold with no prior information about interaction graph topology. We empirically evaluated our method on two simulations and two real-world datasets, where it outperformed existing networks in predicting complex multiagent interactions, even with many unobserved agents.

磁流變材料 · 圖像分割 · 示例 · MoDELS · 數據集 ·

2024 年 1 月 25 日

SymTC: A Symbiotic Transformer-CNN Net for Instance Segmentation of Lumbar Spine MRI

Jiasong Chen,Linchen Qian,Linhai Ma,Timur Urakov,Weiyong Gu,Liang Liang

Intervertebral disc disease, a prevalent ailment, frequently leads to intermittent or persistent low back pain, and diagnosing and assessing of this disease rely on accurate measurement of vertebral bone and intervertebral disc geometries from lumbar MR images. Deep neural network (DNN) models may assist clinicians with more efficient image segmentation of individual instances (disks and vertebrae) of the lumbar spine in an automated way, which is termed as instance image segmentation. In this work, we proposed SymTC, an innovative lumbar spine MR image segmentation model that combines the strengths of Transformer and Convolutional Neural Network (CNN). Specifically, we designed a parallel dual-path architecture to merge CNN layers and Transformer layers, and we integrated a novel position embedding into the self-attention module of Transformer, enhancing the utilization of positional information for more accurate segmentation. To further improves model performance, we introduced a new data augmentation technique to create synthetic yet realistic MR image dataset, named SSMSpine, which is made publicly available. We evaluated our SymTC and the other 15 existing image segmentation models on our private in-house dataset and the public SSMSpine dataset, using two metrics, Dice Similarity Coefficient and 95% Hausdorff Distance. The results show that our SymTC has the best performance for segmenting vertebral bones and intervertebral discs in lumbar spine MR images. The SymTC code and SSMSpine dataset are available at //github.com/jiasongchen/SymTC.

MoDELS · 表示 · state-of-the-art · 分離的 · ACM Multimedia ·

2024 年 1 月 25 日

HyperSound: Generating Implicit Neural Representations of Audio Signals with Hypernetworks

Filip Szatkowski,Karol J. Piczak,Przemys?aw Spurek,Jacek Tabor,Tomasz Trzciński

from arxiv, NeurIPS 2022 MetaLearn workshop

Implicit neural representations (INRs) are a rapidly growing research field, which provides alternative ways to represent multimedia signals. Recent applications of INRs include image super-resolution, compression of high-dimensional signals, or 3D rendering. However, these solutions usually focus on visual data, and adapting them to the audio domain is not trivial. Moreover, it requires a separately trained model for every data sample. To address this limitation, we propose HyperSound, a meta-learning method leveraging hypernetworks to produce INRs for audio signals unseen at training time. We show that our approach can reconstruct sound waves with quality comparable to other state-of-the-art models.

估計/估計量 · 優化器 · 樣本 · Performer · ONCE ·

2024 年 1 月 24 日

Early Detection of Treatments Side Effect: A Sequential Approach

Jiayue Wang,Ben Boukai

from arxiv, There are 21 pages, 8 pictures and 4 tables

With the emergence and spread of infectious diseases with pandemic potential, such as COVID- 19, the urgency for vaccine development have led to unprecedented compressed and accelerated schedules that shortened the standard development timeline. In a relatively short time, the leading pharmaceutical companies1, received an Emergency Use Authorization (EUA) for vaccine\prime s en-mass deployment To monitor the potential side effect(s) of the vaccine during the (initial) vaccination campaign, we developed an optimal sequential test that allows for the early detection of potential side effect(s). This test employs a rule to stop the vaccination process once the observed number of side effect incidents exceeds a certain (pre-determined) threshold. The optimality of the proposed sequential test is justified when compared with the ({\alpha}, {\beta}) optimality of the non-randomized fixed-sample Uniformly Most Powerful (UMP) test. In the case of a single side effect, we study the properties of the sequential test and derive the exact expressions of the Average Sample Number (ASN) curve of the stopping time (and its variance) via the regularized incomplete beta function. Additionally, we derive the asymptotic distribution of the relative savings in ASN as compared to maximal sample size. Moreover, we construct the post-test parameter estimate and studied its sampling properties, including its asymptotic behavior under local-type alternatives. These limiting behavior results are the consistency and asymptotic normality of the post-test parameter estimator. We conclude the paper with a small simulation study illustrating the asymptotic performance of the point and interval estimation and provide a detailed example, based on COVID-19 side effect data (see Beatty et al. (2021)) of our suggested testing procedure.

Principle · AI · 設計 · AIM · Responsible AI ·

2024 年 1 月 24 日

AI Ethics Principles in Practice: Perspectives of Designers and Developers

Conrad Sanderson,David Douglas,Qinghua Lu,Emma Schleiger,Jon Whittle,Justine Lacey,Glenn Newnham,Stefan Hajkowicz,Cathy Robinson,David Hansen

from arxiv, submitted to IEEE Transactions on Technology & Society

As consensus across the various published AI ethics principles is approached, a gap remains between high-level principles and practical techniques that can be readily adopted to design and develop responsible AI systems. We examine the practices and experiences of researchers and engineers from Australia's national scientific research agency (CSIRO), who are involved in designing and developing AI systems for many application areas. Semi-structured interviews were used to examine how the practices of the participants relate to and align with a set of high-level AI ethics principles proposed by the Australian Government. The principles comprise: (1) privacy protection and security, (2) reliability and safety, (3) transparency and explainability, (4) fairness, (5) contestability, (6) accountability, (7) human-centred values, (8) human, social and environmental wellbeing. Discussions on the gained insights from the interviews include various tensions and trade-offs between the principles, and provide suggestions for implementing each high-level principle. We also present suggestions aiming to enhance associated support mechanisms.

大語言模型 · Prompt · 情景 · MoDELS · state-of-the-art ·

2024 年 1 月 23 日

The Language Barrier: Dissecting Safety Challenges of LLMs in Multilingual Contexts

Lingfeng Shen,Weiting Tan,Sihao Chen,Yunmo Chen,Jingyu Zhang,Haoran Xu,Boyuan Zheng,Philipp Koehn,Daniel Khashabi

As the influence of large language models (LLMs) spans across global communities, their safety challenges in multilingual settings become paramount for alignment research. This paper examines the variations in safety challenges faced by LLMs across different languages and discusses approaches to alleviating such concerns. By comparing how state-of-the-art LLMs respond to the same set of malicious prompts written in higher- vs. lower-resource languages, we observe that (1) LLMs tend to generate unsafe responses much more often when a malicious prompt is written in a lower-resource language, and (2) LLMs tend to generate more irrelevant responses to malicious prompts in lower-resource languages. To understand where the discrepancy can be attributed, we study the effect of instruction tuning with reinforcement learning from human feedback (RLHF) or supervised finetuning (SFT) on the HH-RLHF dataset. Surprisingly, while training with high-resource languages improves model alignment, training in lower-resource languages yields minimal improvement. This suggests that the bottleneck of cross-lingual alignment is rooted in the pretraining stage. Our findings highlight the challenges in cross-lingual LLM safety, and we hope they inform future research in this direction.

INTERACT · Agent · 語言模型化 · 回合 · Next ·

2023 年 8 月 6 日

Generative Agents: Interactive Simulacra of Human Behavior

Joon Sung Park,Joseph C. O'Brien,Carrie J. Cai,Meredith Ringel Morris,Percy Liang,Michael S. Bernstein

Believable proxies of human behavior can empower interactive applications ranging from immersive environments to rehearsal spaces for interpersonal communication to prototyping tools. In this paper, we introduce generative agents--computational software agents that simulate believable human behavior. Generative agents wake up, cook breakfast, and head to work; artists paint, while authors write; they form opinions, notice each other, and initiate conversations; they remember and reflect on days past as they plan the next day. To enable generative agents, we describe an architecture that extends a large language model to store a complete record of the agent's experiences using natural language, synthesize those memories over time into higher-level reflections, and retrieve them dynamically to plan behavior. We instantiate generative agents to populate an interactive sandbox environment inspired by The Sims, where end users can interact with a small town of twenty five agents using natural language. In an evaluation, these generative agents produce believable individual and emergent social behaviors: for example, starting with only a single user-specified notion that one agent wants to throw a Valentine's Day party, the agents autonomously spread invitations to the party over the next two days, make new acquaintances, ask each other out on dates to the party, and coordinate to show up for the party together at the right time. We demonstrate through ablation that the components of our agent architecture--observation, planning, and reflection--each contribute critically to the believability of agent behavior. By fusing large language models with computational, interactive agents, this work introduces architectural and interaction patterns for enabling believable simulations of human behavior.

Neural Networks · 圖 · Networks · 圖神經網絡 · 可辨認的 ·

2022 年 2 月 28 日

Hyperbolic Graph Neural Networks: A Review of Methods and Applications

Menglin Yang,Min Zhou,Zhihao Li,Jiahong Liu,Lujia Pan,Hui Xiong,Irwin King

Graph neural networks generalize conventional neural networks to graph-structured data and have received widespread attention due to their impressive representation ability. In spite of the remarkable achievements, the performance of Euclidean models in graph-related learning is still bounded and limited by the representation ability of Euclidean geometry, especially for datasets with highly non-Euclidean latent anatomy. Recently, hyperbolic space has gained increasing popularity in processing graph data with tree-like structure and power-law distribution, owing to its exponential growth property. In this survey, we comprehensively revisit the technical details of the current hyperbolic graph neural networks, unifying them into a general framework and summarizing the variants of each component. More importantly, we present various HGNN-related applications. Last, we also identify several challenges, which potentially serve as guidelines for further flourishing the achievements of graph learning in hyperbolic spaces.

Continuity · 學成 · Vision · 計算機視覺 · 批量學習 ·

2021 年 9 月 23 日

Recent Advances of Continual Learning in Computer Vision: An Overview

Haoxuan Qu,Hossein Rahmani,Li Xu,Bryan Williams,Jun Liu

from arxiv, 21 pages, 5 figures

In contrast to batch learning where all training data is available at once, continual learning represents a family of methods that accumulate knowledge and learn continuously with data available in sequential order. Similar to the human learning process with the ability of learning, fusing, and accumulating new knowledge coming at different time steps, continual learning is considered to have high practical significance. Hence, continual learning has been studied in various artificial intelligence tasks. In this paper, we present a comprehensive review of the recent progress of continual learning in computer vision. In particular, the works are grouped by their representative techniques, including regularization, knowledge distillation, memory, generative replay, parameter isolation, and a combination of the above techniques. For each category of these techniques, both its characteristics and applications in computer vision are presented. At the end of this overview, several subareas, where continuous knowledge accumulation is potentially helpful while continual learning has not been well studied, are discussed.