国产一本二本三本的区别视频,亚洲国产原创精品国语一区,牛牛AV人人夜夜澡人人爽

Agent-based modeling and simulation has evolved as a powerful tool for modeling complex systems, offering insights into emergent behaviors and interactions among diverse agents. Integrating large language models into agent-based modeling and simulation presents a promising avenue for enhancing simulation capabilities. This paper surveys the landscape of utilizing large language models in agent-based modeling and simulation, examining their challenges and promising future directions. In this survey, since this is an interdisciplinary field, we first introduce the background of agent-based modeling and simulation and large language model-empowered agents. We then discuss the motivation for applying large language models to agent-based simulation and systematically analyze the challenges in environment perception, human alignment, action generation, and evaluation. Most importantly, we provide a comprehensive overview of the recent works of large language model-empowered agent-based modeling and simulation in multiple scenarios, which can be divided into four domains: cyber, physical, social, and hybrid, covering simulation of both real-world and virtual environments. Finally, since this area is new and quickly evolving, we discuss the open problems and promising future directions.

相關內容

大語言模(mo)型

關注 56

大(da)語(yu)(yu)(yu)言(yan)模(mo)型(xing)是基于(yu)海量(liang)文本數據(ju)訓練的(de)(de)深(shen)度學習模(mo)型(xing)。它不僅能(neng)夠(gou)生成(cheng)自然語(yu)(yu)(yu)言(yan)文本，還能(neng)夠(gou)深(shen)入(ru)理(li)解(jie)文本含義，處(chu)理(li)各種(zhong)自然語(yu)(yu)(yu)言(yan)任務(wu)，如文本摘要(yao)、問答(da)、翻(fan)譯等(deng)。2023年，大(da)語(yu)(yu)(yu)言(yan)模(mo)型(xing)及其(qi)在(zai)人(ren)工智(zhi)能(neng)領域的(de)(de)應用已成(cheng)為全球科技研(yan)究的(de)(de)熱點，其(qi)在(zai)規(gui)模(mo)上的(de)(de)增長尤為引人(ren)注目(mu)，參數量(liang)已從最初的(de)(de)十幾億躍升(sheng)到如今(jin)的(de)(de)一(yi)(yi)萬億。參數量(liang)的(de)(de)提(ti)升(sheng)使得(de)模(mo)型(xing)能(neng)夠(gou)更加(jia)(jia)精細地捕捉人(ren)類語(yu)(yu)(yu)言(yan)微妙之處(chu)，更加(jia)(jia)深(shen)入(ru)地理(li)解(jie)人(ren)類語(yu)(yu)(yu)言(yan)的(de)(de)復雜性(xing)。在(zai)過去(qu)的(de)(de)一(yi)(yi)年里，大(da)語(yu)(yu)(yu)言(yan)模(mo)型(xing)在(zai)吸納新知識(shi)、分解(jie)復雜任務(wu)以及圖文對齊等(deng)多方面都有(you)顯著(zhu)提(ti)升(sheng)。隨(sui)著(zhu)技術的(de)(de)不斷(duan)成(cheng)熟，它將不斷(duan)拓(tuo)展其(qi)應用范圍(wei)，為人(ren)類提(ti)供更加(jia)(jia)智(zhi)能(neng)化和個性(xing)化的(de)(de)服務(wu)，進一(yi)(yi)步改(gai)善人(ren)們的(de)(de)生活和生產方式。

大語言模型 · 表示 · surge · Extensibility · state-of-the-art ·

2024 年 2 月 7 日

RA-Rec: An Efficient ID Representation Alignment Framework for LLM-based Recommendation

Xiaohan Yu,Li Zhang,Xin Zhao,Yue Wang,Zhongrui Ma

from arxiv, 10 pages

Large language models (LLM) have recently emerged as a powerful tool for a variety of natural language processing tasks, bringing a new surge of combining LLM with recommendation systems, termed as LLM-based RS. Current approaches generally fall into two main paradigms, the ID direct usage paradigm and the ID translation paradigm, noting their core weakness stems from lacking recommendation knowledge and uniqueness. To address this limitation, we propose a new paradigm, ID representation, which incorporates pre-trained ID embeddings into LLMs in a complementary manner. In this work, we present RA-Rec, an efficient ID representation alignment framework for LLM-based recommendation, which is compatible with multiple ID-based methods and LLM architectures. Specifically, we treat ID embeddings as soft prompts and design an innovative alignment module and an efficient tuning method with tailored data construction for alignment. Extensive experiments demonstrate RA-Rec substantially outperforms current state-of-the-art methods, achieving up to 3.0% absolute HitRate@100 improvements while utilizing less than 10x training data.

泛函 · 變換 · 復合數據 · MoDELS · 評論員 ·

2024 年 2 月 5 日

FuseMoE: Mixture-of-Experts Transformers for Fleximodal Fusion

Xing Han,Huy Nguyen,Carl Harris,Nhat Ho,Suchi Saria

from arxiv, 35 pages, 8 tables, 5 figures

As machine learning models in critical fields increasingly grapple with multimodal data, they face the dual challenges of handling a wide array of modalities, often incomplete due to missing elements, and the temporal irregularity and sparsity of collected samples. Successfully leveraging this complex data, while overcoming the scarcity of high-quality training samples, is key to improving these models' predictive performance. We introduce ``FuseMoE'', a mixture-of-experts framework incorporated with an innovative gating function. Designed to integrate a diverse number of modalities, FuseMoE is effective in managing scenarios with missing modalities and irregularly sampled data trajectories. Theoretically, our unique gating function contributes to enhanced convergence rates, leading to better performance in multiple downstream tasks. The practical utility of FuseMoE in real world is validated by a challenging set of clinical risk prediction tasks.

3D · MoDELS · 去噪 · 優化器 · 圖 ·

2024 年 2 月 5 日

Geometry-Complete Diffusion for 3D Molecule Generation and Optimization

Alex Morehead,Jianlin Cheng

from arxiv, 27 pages, 5 figures, 5 tables. Under review. Also presented at ICLR 2023's MLDD workshop. Code available at //github.com/BioinfoMachineLearning/Bio-Diffusion

Denoising diffusion probabilistic models (DDPMs) have recently taken the field of generative modeling by storm, pioneering new state-of-the-art results in disciplines such as computer vision and computational biology for diverse tasks ranging from text-guided image generation to structure-guided protein design. Along this latter line of research, methods have recently been proposed for generating 3D molecules using equivariant graph neural networks (GNNs) within a DDPM framework. However, such methods are unable to learn important geometric and physical properties of 3D molecules during molecular graph generation, as they adopt molecule-agnostic and non-geometric GNNs as their 3D graph denoising networks, which negatively impacts their ability to effectively scale to datasets of large 3D molecules. In this work, we address these gaps by introducing the Geometry-Complete Diffusion Model (GCDM) for 3D molecule generation, which outperforms existing 3D molecular diffusion models by significant margins across conditional and unconditional settings for the QM9 dataset as well as for the larger GEOM-Drugs dataset. Importantly, we demonstrate that the geometry-complete denoising process GCDM learns for 3D molecule generation allows the model to generate realistic and stable large molecules at the scale of GEOM-Drugs, whereas previous methods fail to do so with the features they learn. Additionally, we show that extensions of GCDM can not only effectively design 3D molecules for specific protein pockets but also that GCDM's geometric features can effectively be repurposed to directly optimize the geometry and chemical composition of existing 3D molecules for specific molecular properties, demonstrating new, real-world versatility of molecular diffusion models. Our source code and data are freely available at //github.com/BioinfoMachineLearning/Bio-Diffusion.

大語言模型 · 視覺識別系統 · MoDELS · state-of-the-art · 語言模型化 ·

2024 年 2 月 3 日

Vi(E)va LLM! A Conceptual Stack for Evaluating and Interpreting Generative AI-based Visualizations

Luca Podo,Muhammad Ishmal,Marco Angelini

The automatic generation of visualizations is an old task that, through the years, has shown more and more interest from the research and practitioner communities. Recently, large language models (LLM) have become an interesting option for supporting generative tasks related to visualization, demonstrating initial promising results. At the same time, several pitfalls, like the multiple ways of instructing an LLM to generate the desired result, the different perspectives leading the generation (code-based, image-based, grammar-based), and the presence of hallucinations even for the visualization generation task, make their usage less affordable than expected. Following similar initiatives for benchmarking LLMs, this paper copes with the problem of modeling the evaluation of a generated visualization through an LLM. We propose a theoretical evaluation stack, EvaLLM, that decomposes the evaluation effort in its atomic components, characterizes their nature, and provides an overview of how to implement and interpret them. We also designed and implemented an evaluation platform that provides a benchmarking resource for the visualization generation task. The platform supports automatic and manual scoring conducted by multiple assessors to support a fine-grained and semantic evaluation based on the EvaLLM stack. Two case studies on GPT3.5-turbo with Code Interpreter and Llama2-70-b models show the benefits of EvaLLM and illustrate interesting results on the current state-of-the-art LLM-generated visualizations.

Guidance · MoDELS · 縮放 · ForCES · DDPM ·

2024 年 2 月 2 日

Characteristic Guidance: Non-linear Correction for Diffusion Model at Large Guidance Scale

Candi Zheng,Yuan Lan

from arxiv, 8 pages, 7 figures

Popular guidance for denoising diffusion probabilistic model (DDPM) linearly combines distinct conditional models together to provide enhanced control over samples. However, this approach overlooks nonlinear effects that become significant when guidance scale is large. To address this issue, we propose characteristic guidance, a guidance method that provides first-principle non-linear correction for classifier-free guidance. Such correction forces the guided DDPMs to respect the Fokker-Planck (FP) equation of diffusion process, in a way that is training-free and compatible with existing sampling methods. Experiments show that characteristic guidance enhances semantic characteristics of prompts and mitigate irregularities in image generation, proving effective in diverse applications ranging from simulating magnet phase transitions to latent space sampling.

3D · Elevate · surge · Taxonomy · 講稿 ·

2023 年 10 月 24 日

Recent Advances in Multi-modal 3D Scene Understanding: A Comprehensive Survey and Evaluation

Yinjie Lei,Zixuan Wang,Feng Chen,Guoqing Wang,Peng Wang,Yang Yang

Multi-modal 3D scene understanding has gained considerable attention due to its wide applications in many areas, such as autonomous driving and human-computer interaction. Compared to conventional single-modal 3D understanding, introducing an additional modality not only elevates the richness and precision of scene interpretation but also ensures a more robust and resilient understanding. This becomes especially crucial in varied and challenging environments where solely relying on 3D data might be inadequate. While there has been a surge in the development of multi-modal 3D methods over past three years, especially those integrating multi-camera images (3D+2D) and textual descriptions (3D+language), a comprehensive and in-depth review is notably absent. In this article, we present a systematic survey of recent progress to bridge this gap. We begin by briefly introducing a background that formally defines various 3D multi-modal tasks and summarizes their inherent challenges. After that, we present a novel taxonomy that delivers a thorough categorization of existing methods according to modalities and tasks, exploring their respective strengths and limitations. Furthermore, comparative results of recent approaches on several benchmark datasets, together with insightful analysis, are offered. Finally, we discuss the unresolved issues and provide several potential avenues for future research.

圖 · 異常檢測 · Networking · Neural Networks · Learning ·

2022 年 9 月 29 日

Graph Anomaly Detection with Graph Neural Networks: Current Status and Challenges

Hwan Kim,Byung Suk Lee,Won-Yong Shin,Sungsu Lim

from arxiv, 9 pages, 2 figures, 1 tables; to appear in the IEEE Access (Please cite our journal version.)

Graphs are used widely to model complex systems, and detecting anomalies in a graph is an important task in the analysis of complex systems. Graph anomalies are patterns in a graph that do not conform to normal patterns expected of the attributes and/or structures of the graph. In recent years, graph neural networks (GNNs) have been studied extensively and have successfully performed difficult machine learning tasks in node classification, link prediction, and graph classification thanks to the highly expressive capability via message passing in effectively learning graph representations. To solve the graph anomaly detection problem, GNN-based methods leverage information about the graph attributes (or features) and/or structures to learn to score anomalies appropriately. In this survey, we review the recent advances made in detecting graph anomalies using GNN models. Specifically, we summarize GNN-based methods according to the graph type (i.e., static and dynamic), the anomaly type (i.e., node, edge, subgraph, and whole graph), and the network architecture (e.g., graph autoencoder, graph convolutional network). To the best of our knowledge, this survey is the first comprehensive review of graph anomaly detection methods based on GNNs.

Learning · 圖 · Extensibility · motivation · 講稿 ·

2022 年 6 月 27 日

FederatedScope-GNN: Towards a Unified, Comprehensive and Efficient Package for Federated Graph Learning

Zhen Wang,Weirui Kuang,Yuexiang Xie,Liuyi Yao,Yaliang Li,Bolin Ding,Jingren Zhou

from arxiv, Accpeted by KDD'2022; We have released FederatedScope for users on //github.com/alibaba/FederatedScope

The incredible development of federated learning (FL) has benefited various tasks in the domains of computer vision and natural language processing, and the existing frameworks such as TFF and FATE has made the deployment easy in real-world applications. However, federated graph learning (FGL), even though graph data are prevalent, has not been well supported due to its unique characteristics and requirements. The lack of FGL-related framework increases the efforts for accomplishing reproducible research and deploying in real-world applications. Motivated by such strong demand, in this paper, we first discuss the challenges in creating an easy-to-use FGL package and accordingly present our implemented package FederatedScope-GNN (FS-G), which provides (1) a unified view for modularizing and expressing FGL algorithms; (2) comprehensive DataZoo and ModelZoo for out-of-the-box FGL capability; (3) an efficient model auto-tuning component; and (4) off-the-shelf privacy attack and defense abilities. We validate the effectiveness of FS-G by conducting extensive experiments, which simultaneously gains many valuable insights about FGL for the community. Moreover, we employ FS-G to serve the FGL application in real-world E-commerce scenarios, where the attained improvements indicate great potential business benefits. We publicly release FS-G, as submodules of FederatedScope, at //github.com/alibaba/FederatedScope to promote FGL's research and enable broad applications that would otherwise be infeasible due to the lack of a dedicated package.

學成 · 泛化理論 · AIM · state-of-the-art · 強化學習 ·

2019 年 10 月 24 日

Meta-World: A Benchmark and Evaluation for Multi-Task and Meta Reinforcement Learning

Tianhe Yu,Deirdre Quillen,Zhanpeng He,Ryan Julian,Karol Hausman,Chelsea Finn,Sergey Levine

from arxiv, CoRL 2019. Videos are here: meta-world.github.io and open-sourced codes are available at: //github.com/rlworkgroup/metaworld

Meta-reinforcement learning algorithms can enable robots to acquire new skills much more quickly, by leveraging prior experience to learn how to learn. However, much of the current research on meta-reinforcement learning focuses on task distributions that are very narrow. For example, a commonly used meta-reinforcement learning benchmark uses different running velocities for a simulated robot as different tasks. When policies are meta-trained on such narrow task distributions, they cannot possibly generalize to more quickly acquire entirely new tasks. Therefore, if the aim of these methods is to enable faster acquisition of entirely new behaviors, we must evaluate them on task distributions that are sufficiently broad to enable generalization to new behaviors. In this paper, we propose an open-source simulated benchmark for meta-reinforcement learning and multi-task learning consisting of 50 distinct robotic manipulation tasks. Our aim is to make it possible to develop algorithms that generalize to accelerate the acquisition of entirely new, held-out tasks. We evaluate 6 state-of-the-art meta-reinforcement learning and multi-task learning algorithms on these tasks. Surprisingly, while each task and its variations (e.g., with different object positions) can be learned with reasonable success, these algorithms struggle to learn with multiple tasks at the same time, even with as few as ten distinct training tasks. Our analysis and open-source environments pave the way for future research in multi-task learning and meta-learning that can enable meaningful generalization, thereby unlocking the full potential of these methods.

INTERACT · 情感分類 · 分解的 · INFORMS · Performer ·

2018 年 1 月 18 日

Contextual and Position-Aware Factorization Machines for Sentiment Classification

Shuai Wang,Mianwei Zhou,Geli Fei,Yi Chang,Bing Liu

While existing machine learning models have achieved great success for sentiment classification, they typically do not explicitly capture sentiment-oriented word interaction, which can lead to poor results for fine-grained analysis at the snippet level (a phrase or sentence). Factorization Machine provides a possible approach to learning element-wise interaction for recommender systems, but they are not directly applicable to our task due to the inability to model contexts and word sequences. In this work, we develop two Position-aware Factorization Machines which consider word interaction, context and position information. Such information is jointly encoded in a set of sentiment-oriented word interaction vectors. Compared to traditional word embeddings, SWI vectors explicitly capture sentiment-oriented word interaction and simplify the parameter learning. Experimental results show that while they have comparable performance with state-of-the-art methods for document-level classification, they benefit the snippet/sentence-level sentiment analysis.