久草精品视频在线观看_亚洲人色大成年网站在线观看_西西人体大胆午夜视频无码_加勒比不卡一区国产在线_免费VA欧美一区不卡在线_日韩欧美亚洲综合久久99E_亚洲最新AV片不卡无码久久

Large Language Models (LLMs) have the potential to fundamentally change the way people engage in computer programming. Agent-based modeling (ABM) has become ubiquitous in natural and social sciences and education, yet no prior studies have explored the potential of LLMs to assist it. We designed NetLogo Chat to support the learning and practice of NetLogo, a programming language for ABM. To understand how users perceive, use, and need LLM-based interfaces, we interviewed 30 participants from global academia, industry, and graduate schools. Experts reported more perceived benefits than novices and were more inclined to adopt LLMs in their workflow. We found significant differences between experts and novices in their perceptions, behaviors, and needs for human-AI collaboration. We surfaced a knowledge gap between experts and novices as a possible reason for the benefit gap. We identified guidance, personalization, and integration as major needs for LLM-based interfaces to support the programming of ABM.

相關內容

大語(yu)言模型

關注 56

大(da)語(yu)(yu)言模(mo)(mo)(mo)型(xing)(xing)是(shi)基(ji)于海(hai)量文本(ben)數(shu)據訓練的(de)(de)深度學(xue)習模(mo)(mo)(mo)型(xing)(xing)。它不(bu)(bu)僅能(neng)夠(gou)生成自(zi)然(ran)語(yu)(yu)言文本(ben)，還能(neng)夠(gou)深入(ru)理(li)(li)解文本(ben)含(han)義(yi)，處(chu)(chu)理(li)(li)各種自(zi)然(ran)語(yu)(yu)言任(ren)務，如(ru)文本(ben)摘要(yao)、問答、翻(fan)譯(yi)等(deng)。2023年(nian)，大(da)語(yu)(yu)言模(mo)(mo)(mo)型(xing)(xing)及其在(zai)人(ren)工(gong)智能(neng)領域的(de)(de)應用已(yi)成為(wei)全球科技研(yan)究的(de)(de)熱(re)點，其在(zai)規模(mo)(mo)(mo)上(shang)的(de)(de)增(zeng)長尤為(wei)引人(ren)注目，參(can)數(shu)量已(yi)從最初的(de)(de)十幾億(yi)(yi)躍升到如(ru)今的(de)(de)一(yi)萬億(yi)(yi)。參(can)數(shu)量的(de)(de)提升使(shi)得(de)模(mo)(mo)(mo)型(xing)(xing)能(neng)夠(gou)更(geng)加(jia)精細地(di)(di)捕捉人(ren)類語(yu)(yu)言微妙之處(chu)(chu)，更(geng)加(jia)深入(ru)地(di)(di)理(li)(li)解人(ren)類語(yu)(yu)言的(de)(de)復雜性。在(zai)過去的(de)(de)一(yi)年(nian)里，大(da)語(yu)(yu)言模(mo)(mo)(mo)型(xing)(xing)在(zai)吸納新知識、分解復雜任(ren)務以及圖文對(dui)齊等(deng)多(duo)方(fang)面(mian)都有顯著提升。隨著技術(shu)的(de)(de)不(bu)(bu)斷(duan)(duan)成熟，它將不(bu)(bu)斷(duan)(duan)拓(tuo)展其應用范圍，為(wei)人(ren)類提供更(geng)加(jia)智能(neng)化(hua)和(he)個性化(hua)的(de)(de)服(fu)務，進一(yi)步(bu)改善人(ren)們的(de)(de)生活和(he)生產方(fang)式。

MoDELS · 表示 · 大語言模型 · 代碼 · Automator ·

2024 年 3 月 11 日

RepairLLaMA: Efficient Representations and Fine-Tuned Adapters for Program Repair

André Silva,Sen Fang,Martin Monperrus

Automated Program Repair (APR) has evolved significantly with the advent of Large Language Models (LLMs). Fine-tuning LLMs for program repair is a recent avenue of research, with many dimensions which have not been explored. Existing work mostly fine-tunes LLMs with naive code representations and is fundamentally limited in its ability to fine-tune larger LLMs. To address this problem, we propose RepairLLaMA, a novel program repair approach that combines 1) code representations for APR and 2) the state-of-the-art parameter-efficient LLM fine-tuning technique called LoRA. This results in RepairLLaMA producing a highly effective `program repair adapter' for fixing bugs with language models. Our experiments demonstrate the validity of both concepts. First, fine-tuning adapters with program repair specific code representations enables the model to use meaningful repair signals. Second, parameter-efficient fine-tuning helps fine-tuning to converge and contributes to the effectiveness of the repair adapter to fix data-points outside the fine-tuning data distribution. Overall, RepairLLaMA correctly fixes 125 Defects4J v2 and 82 HumanEval-Java bugs, outperforming all baselines.

LLaMA · 多峰值 · MoDELS · Performer · 數據集 ·

2024 年 3 月 11 日

Amharic LLaMA and LLaVA: Multimodal LLMs for Low Resource Languages

Michael Andersland

Large Language Models (LLMs) like GPT-4 and LLaMA have shown incredible proficiency at natural language processing tasks and have even begun to excel at tasks across other modalities such as vision and audio. Despite their success, LLMs often struggle to perform well on low-resource languages because there is so little training data available. This shortcoming is especially prevalent with open source models. In this work, we explore training LLaMA-2 to speak Amharic, a language which is spoken by over 50 million people world wide, but has orders of magnitude less data available than languages like English. We employ methods previously used for training LLMs on other languages with data scarcity, and use open source translation models to perform data augmentation and grow our dataset from millions of tokens to billions. We further enhance the capabilities of our model by connecting an image encoder and training on a translated visual instruction tuning dataset in the same manner as LLaVA, resulting in a multimodal Amharic LLM that can understand images along with text. We introduce an Amharic version of a popular benchmarking dataset to evaluate our work. Our models and dataset are open sourced and available on GitHub.

TOOLS · 可辨認的 · INFORMS · Integration · 查準率/準確率 ·

2024 年 3 月 10 日

From Text to Self: Users' Perceptions of Potential of AI on Interpersonal Communication and Self

Yue Fu,Sami Foell,Xuhai Xu,Alexis Hiniker

In the rapidly evolving landscape of AI-mediated communication (AIMC), tools powered by Large Language Models (LLMs) are becoming integral to interpersonal communication. Employing a mixed-methods approach, we conducted a one-week diary and interview study to explore users' perceptions of these tools' ability to: 1) support interpersonal communication in the short-term, and 2) lead to potential long-term effects. Our findings indicate that participants view AIMC support favorably, citing benefits such as increased communication confidence, and finding precise language to express their thoughts, navigating linguistic and cultural barriers. However, the study also uncovers current limitations of AIMC tools, including verbosity, unnatural responses, and excessive emotional intensity. These shortcomings are further exacerbated by user concerns about inauthenticity and potential overreliance on the technology. Furthermore, we identified four key communication spaces delineated by communication stakes (high or low) and relationship dynamics (formal or informal) that differentially predict users' attitudes toward AIMC tools. Specifically, participants found the tool is more suitable for communicating in formal relationships than informal ones and more beneficial in high-stakes than low-stakes communication.

Performer · state-of-the-art · 后向 · 前向 · 路徑 ·

2024 年 3 月 8 日

Planning and Inverse Kinematics of Hyper-Redundant Manipulators with VO-FABRIK

Cristian Morasso,Daniele Meli,Yann Divet,Salvatore Sessa,Alessandro Farinelli

from arxiv, In publication in Springer Proceedings for the European Robotics Forum 2024

Hyper-redundant Robotic Manipulators (HRMs) offer great dexterity and flexibility of operation, but solving Inverse Kinematics (IK) is challenging. In this work, we introduce VO-FABRIK, an algorithm combining Forward and Backward Reaching Inverse Kinematics (FABRIK) for repeatable deterministic IK computation, and an approach inspired from velocity obstacles to perform path planning under collision and joint limits constraints. We show preliminary results on an industrial HRM with 19 actuated joints. Our algorithm achieves good performance where a state-of-the-art IK solver fails.

MoDELS · Processing（編程語言） · 生成式對抗網絡 · Networking · 潛在 ·

2024 年 3 月 8 日

Warfare:Breaking the Watermark Protection of AI-Generated Content

Guanlin Li,Yifei Chen,Jie Zhang,Jiwei Li,Shangwei Guo,Tianwei Zhang

AI-Generated Content (AIGC) is gaining great popularity, with many emerging commercial services and applications. These services leverage advanced generative models, such as latent diffusion models and large language models, to generate creative content (e.g., realistic images and fluent sentences) for users. The usage of such generated content needs to be highly regulated, as the service providers need to ensure the users do not violate the usage policies (e.g., abuse for commercialization, generating and distributing unsafe content). A promising solution to achieve this goal is watermarking, which adds unique and imperceptible watermarks on the content for service verification and attribution. Numerous watermarking approaches have been proposed recently. However, in this paper, we show that an adversary can easily break these watermarking mechanisms. Specifically, we consider two possible attacks. (1) Watermark removal: the adversary can easily erase the embedded watermark from the generated content and then use it freely bypassing the regulation of the service provider. (2) Watermark forging: the adversary can create illegal content with forged watermarks from another user, causing the service provider to make wrong attributions. We propose Warfare, a unified methodology to achieve both attacks in a holistic way. The key idea is to leverage a pre-trained diffusion model for content processing and a generative adversarial network for watermark removal or forging. We evaluate Warfare on different datasets and embedding setups. The results prove that it can achieve high success rates while maintaining the quality of the generated content. Compared to existing diffusion model-based attacks, Warfare is 5,050~11,000x faster.

MoDELS · 學成 · Next · Processing（編程語言） · Taxonomy ·

2021 年 8 月 12 日

AMMUS : A Survey of Transformer-based Pretrained Models in Natural Language Processing

Katikapalli Subramanyam Kalyan,Ajit Rajasekharan,Sivanesan Sangeetha

from arxiv, Preprint under review

Transformer-based pretrained language models (T-PTLMs) have achieved great success in almost every NLP task. The evolution of these models started with GPT and BERT. These models are built on the top of transformers, self-supervised learning and transfer learning. Transformed-based PTLMs learn universal language representations from large volumes of text data using self-supervised learning and transfer this knowledge to downstream tasks. These models provide good background knowledge to downstream tasks which avoids training of downstream models from scratch. In this comprehensive survey paper, we initially give a brief overview of self-supervised learning. Next, we explain various core concepts like pretraining, pretraining methods, pretraining tasks, embeddings and downstream adaptation methods. Next, we present a new taxonomy of T-PTLMs and then give brief overview of various benchmarks including both intrinsic and extrinsic. We present a summary of various useful libraries to work with T-PTLMs. Finally, we highlight some of the future research directions which will further improve these models. We strongly believe that this comprehensive survey paper will serve as a good reference to learn the core concepts as well as to stay updated with the recent happenings in T-PTLMs.

復合數據 · 可穿戴設備 · 求逆 · INFORMS · Integration ·

2021 年 6 月 2 日

IoT Solutions with Multi-Sensor Fusion and Signal-Image Encoding for Secure Data Transfer and Decision Making

Piyush K. Sharma,Mark Dennison,Adrienne Raglin

from arxiv, Advances in Mass Data Analysis of Images and Signals in Artificial Intelligence and Pattern Recognition 15th International Conference, MDA 2020 Amsterdam, The Netherlands, July 20-21, 2020. //www.ibai-publishing.org/html/proceedings_2020/pdf/proceedings_book_MDA-AI&PR_2020.pdf

Deployment of Internet of Things (IoT) devices and Data Fusion techniques have gained popularity in public and government domains. This usually requires capturing and consolidating data from multiple sources. As datasets do not necessarily originate from identical sensors, fused data typically results in a complex data problem. Because military is investigating how heterogeneous IoT devices can aid processes and tasks, we investigate a multi-sensor approach. Moreover, we propose a signal to image encoding approach to transform information (signal) to integrate (fuse) data from IoT wearable devices to an image which is invertible and easier to visualize supporting decision making. Furthermore, we investigate the challenge of enabling an intelligent identification and detection operation and demonstrate the feasibility of the proposed Deep Learning and Anomaly Detection models that can support future application that utilizes hand gesture data from wearable devices.

跳躍連接 · Neural Networks · 優化器 · 線性的 · 圖 ·

2021 年 5 月 10 日

Optimization of Graph Neural Networks: Implicit Acceleration by Skip Connections and More Depth

Keyulu Xu,Mozhi Zhang,Stefanie Jegelka,Kenji Kawaguchi

Graph Neural Networks (GNNs) have been studied from the lens of expressive power and generalization. However, their optimization properties are less well understood. We take the first step towards analyzing GNN training by studying the gradient dynamics of GNNs. First, we analyze linearized GNNs and prove that despite the non-convexity of training, convergence to a global minimum at a linear rate is guaranteed under mild assumptions that we validate on real-world graphs. Second, we study what may affect the GNNs' training speed. Our results show that the training of GNNs is implicitly accelerated by skip connections, more depth, and/or a good label distribution. Empirical results confirm that our theoretical results for linearized GNNs align with the training behavior of nonlinear GNNs. Our results provide the first theoretical support for the success of GNNs with skip connections in terms of optimization, and suggest that deep GNNs with skip connections would be promising in practice.

Vision · 模型評估 · 可約的 · 計算機視覺 · DNN ·

2020 年 3 月 24 日

A Survey of Methods for Low-Power Deep Learning and Computer Vision

Abhinav Goel,Caleb Tung,Yung-Hsiang Lu,George K. Thiruvathukal

from arxiv, Accepted for publication at 2020 IEEE 6th World Forum on Internet of Things (WF-IoT), New Orleans, LA, USA 2020

Deep neural networks (DNNs) are successful in many computer vision tasks. However, the most accurate DNNs require millions of parameters and operations, making them energy, computation and memory intensive. This impedes the deployment of large DNNs in low-power devices with limited compute resources. Recent research improves DNN models by reducing the memory requirement, energy consumption, and number of operations without significantly decreasing the accuracy. This paper surveys the progress of low-power deep learning and computer vision, specifically in regards to inference, and discusses the methods for compacting and accelerating DNN models. The techniques can be divided into four major categories: (1) parameter quantization and pruning, (2) compressed convolutional filters and matrix factorization, (3) network architecture search, and (4) knowledge distillation. We analyze the accuracy, advantages, disadvantages, and potential solutions to the problems with the techniques in each category. We also discuss new evaluation metrics as a guideline for future research.

圖注意力網絡 · 文本分類 · 圖 · 注意力機制 · Networking ·

2020 年 3 月 22 日

Multi-Label Text Classification using Attention-based Graph Neural Network

Ankit Pal,Muru Selvakumar,Malaikannan Sankarasubbu

In Multi-Label Text Classification (MLTC), one sample can belong to more than one class. It is observed that most MLTC tasks, there are dependencies or correlations among labels. Existing methods tend to ignore the relationship among labels. In this paper, a graph attention network-based model is proposed to capture the attentive dependency structure among the labels. The graph attention network uses a feature matrix and a correlation matrix to capture and explore the crucial dependencies between the labels and generate classifiers for the task. The generated classifiers are applied to sentence feature vectors obtained from the text feature extraction network (BiLSTM) to enable end-to-end training. Attention allows the system to assign different weights to neighbor nodes per label, thus allowing it to learn the dependencies among labels implicitly. The results of the proposed model are validated on five real-world MLTC datasets. The proposed model achieves similar or better performance compared to the previous state-of-the-art models.