女生喊疼男生越往里寨的免费观看_久久99热这里只有国产中文精品8_欧美一区一本大道香蕉免费_天堂亚洲日本VA中文字幕_男人扒开女人双腿猛进在线观看_日本一区二区三区免费高清视频_好吊日视频一区二区三区

Interpretability tools that offer explanations in the form of a dialogue have demonstrated their efficacy in enhancing users' understanding, as one-off explanations may occasionally fall short in providing sufficient information to the user. Current solutions for dialogue-based explanations, however, require many dependencies and are not easily transferable to tasks they were not designed for. With LLMCheckup, we present an easily accessible tool that allows users to chat with any state-of-the-art large language model (LLM) about its behavior. We enable LLMs to generate all explanations by themselves and take care of intent recognition without fine-tuning, by connecting them with a broad spectrum of Explainable AI (XAI) tools, e.g. feature attributions, embedding-based similarity, and prompting strategies for counterfactual and rationale generation. LLM (self-)explanations are presented as an interactive dialogue that supports follow-up questions and generates suggestions. LLMCheckup provides tutorials for operations available in the system, catering to individuals with varying levels of expertise in XAI and supports multiple input modalities. We introduce a new parsing strategy called multi-prompt parsing substantially enhancing the parsing accuracy of LLMs. Finally, we showcase the tasks of fact checking and commonsense question answering.

相關內容

大語言(yan)模型

關注 56

大(da)語(yu)(yu)言(yan)(yan)(yan)模(mo)(mo)型(xing)(xing)(xing)(xing)是(shi)基于(yu)海量(liang)(liang)(liang)文(wen)本(ben)(ben)數據訓練(lian)的(de)(de)(de)深度(du)學(xue)習模(mo)(mo)型(xing)(xing)(xing)(xing)。它(ta)不僅能(neng)夠生成自然語(yu)(yu)言(yan)(yan)(yan)文(wen)本(ben)(ben)，還能(neng)夠深入理(li)(li)解(jie)文(wen)本(ben)(ben)含義，處理(li)(li)各種自然語(yu)(yu)言(yan)(yan)(yan)任務(wu)，如(ru)(ru)文(wen)本(ben)(ben)摘要、問答、翻(fan)譯等。2023年，大(da)語(yu)(yu)言(yan)(yan)(yan)模(mo)(mo)型(xing)(xing)(xing)(xing)及(ji)其在(zai)人(ren)工智能(neng)領域(yu)的(de)(de)(de)應用(yong)已成為全球(qiu)科技研究(jiu)的(de)(de)(de)熱點，其在(zai)規模(mo)(mo)上的(de)(de)(de)增長(chang)尤(you)為引人(ren)注目，參數量(liang)(liang)(liang)已從(cong)最初的(de)(de)(de)十幾(ji)億躍升到如(ru)(ru)今的(de)(de)(de)一萬億。參數量(liang)(liang)(liang)的(de)(de)(de)提升使得(de)模(mo)(mo)型(xing)(xing)(xing)(xing)能(neng)夠更(geng)加精細地捕捉人(ren)類(lei)語(yu)(yu)言(yan)(yan)(yan)微(wei)妙(miao)之處，更(geng)加深入地理(li)(li)解(jie)人(ren)類(lei)語(yu)(yu)言(yan)(yan)(yan)的(de)(de)(de)復雜(za)性(xing)。在(zai)過去的(de)(de)(de)一年里，大(da)語(yu)(yu)言(yan)(yan)(yan)模(mo)(mo)型(xing)(xing)(xing)(xing)在(zai)吸納新(xin)知識、分解(jie)復雜(za)任務(wu)以及(ji)圖文(wen)對齊等多方面都有顯著(zhu)提升。隨著(zhu)技術的(de)(de)(de)不斷成熟，它(ta)將不斷拓(tuo)展其應用(yong)范圍，為人(ren)類(lei)提供更(geng)加智能(neng)化(hua)和(he)個性(xing)化(hua)的(de)(de)(de)服(fu)務(wu)，進一步(bu)改善(shan)人(ren)們(men)的(de)(de)(de)生活和(he)生產方式。

MoDELS · INTERACT · 數據集 · INFORMS · Performer ·

2024 年 3 月 6 日

KIWI: A Dataset of Knowledge-Intensive Writing Instructions for Answering Research Questions

Fangyuan Xu,Kyle Lo,Luca Soldaini,Bailey Kuehl,Eunsol Choi,David Wadden

Large language models (LLMs) adapted to follow user instructions are now widely deployed as conversational agents. In this work, we examine one increasingly common instruction-following task: providing writing assistance to compose a long-form answer. To evaluate the capabilities of current LLMs on this task, we construct KIWI, a dataset of knowledge-intensive writing instructions in the scientific domain. Given a research question, an initial model-generated answer and a set of relevant papers, an expert annotator iteratively issues instructions for the model to revise and improve its answer. We collect 1,260 interaction turns from 234 interaction sessions with three state-of-the-art LLMs. Each turn includes a user instruction, a model response, and a human evaluation of the model response. Through a detailed analysis of the collected responses, we find that all models struggle to incorporate new information into an existing answer, and to perform precise and unambiguous edits. Further, we find that models struggle to judge whether their outputs successfully followed user instructions, with accuracy at least 10 points short of human agreement. Our findings indicate that KIWI will be a valuable resource to measure progress and improve LLMs' instruction-following capabilities for knowledge intensive writing tasks.

TAP · 可辨認的 · 模型評估 · 設計 · EASE ·

2024 年 3 月 6 日

Tappy: Predicting Tap Accuracy of User-Interface Elements by Reverse-Engineering Webpage Structures

Hiroki Usuba,Junichi Sato,Naomi Sasaya,Shota Yamanaka,Fumiya Yamashita

Selecting a UI element is a fundamental operation on webpages, and the ease of tapping a target object has a significant impact on usability. It is thus important to analyze existing UIs in order to design better ones. However, tools proposed in previous studies cannot identify whether an element is tappable on modern webpages. In this study, we developed Tappy that can identify tappable UI elements on webpages and estimate the tap-success rate based on the element size. Our interviews of professional designers and engineers showed that Tappy helped discussions of UI design on the basis of its quantitative metric. Furthermore, we have launched this tool to be freely available to external users, so readers can access Tappy by visiting the website (//tappy.yahoo.co.jp).

SUMO · V2X · 最優化 · 穩健性 · Robot ·

2024 年 3 月 6 日

ICAT: An Indoor Connected and Autonomous Testbed for Vehicle Computing

Zhaofeng Tian,William He,Boyang Tian,Ren Zhong,Erfan Foorginejad,Weisong Shi

Indoor autonomous driving testbeds have emerged to complement expensive outdoor testbeds and virtual simulations, offering scalable and cost-effective solutions for research in navigation, traffic optimization, and swarm intelligence. However, they often lack the robust sensing and computing infrastructure for advanced research. Addressing these limitations, we introduce the Indoor Connected Autonomous Testbed (ICAT), a platform that not only tackles the unique challenges of indoor autonomous driving but also innovates vehicle computing and V2X communication. Moreover, ICAT leverages digital twins through CARLA and SUMO simulations, facilitating both centralized and decentralized autonomy deployments.

線性的 · 奇異值分解 · 相同 · massive MIMO · MIMO ·

2024 年 3 月 5 日

Low-Complexity Linear Decoupling of Users for Uplink Massive MU-MIMO Detection

S. Sowmya,Gokularam Muthukrishnan,K. Giridhar

Multi-user massive MIMO is a promising candidate for future wireless communication systems. It enables users with different requirements to be connected to the same base station (BS) on the same set of resources. In uplink massive MU-MIMO, while users with different requirements are served, decoupled signal detection helps in using a user-specific detection scheme for every user. In this paper, we propose a low-complexity linear decoupling scheme called Sequential Decoupler (SD), which aids in the parallel detection of each user's data streams. The proposed algorithm shows significant complexity reduction, particularly when the number of users in the system increases. In the numerical simulations, it has been observed that the complexity of the proposed scheme is only 0.15% of the conventional Singular Value Decomposition (SVD) based decoupling and 47% to the pseudo-inverse based decoupling schemes when 80 users with two antennas each are served by the BS.

MoDELS · contrastive · Learning · 模型評估 · 聯邦學習 ·

2024 年 3 月 5 日

FLGuard: Byzantine-Robust Federated Learning via Ensemble of Contrastive Models

Younghan Lee,Yungi Cho,Woorim Han,Ho Bae,Yunheung Paek

from arxiv, Accepted by 28th European Symposium on Research in Computer Security (ESORICS 2023)

Federated Learning (FL) thrives in training a global model with numerous clients by only sharing the parameters of their local models trained with their private training datasets. Therefore, without revealing the private dataset, the clients can obtain a deep learning (DL) model with high performance. However, recent research proposed poisoning attacks that cause a catastrophic loss in the accuracy of the global model when adversaries, posed as benign clients, are present in a group of clients. Therefore, recent studies suggested byzantine-robust FL methods that allow the server to train an accurate global model even with the adversaries present in the system. However, many existing methods require the knowledge of the number of malicious clients or the auxiliary (clean) dataset or the effectiveness reportedly decreased hugely when the private dataset was non-independently and identically distributed (non-IID). In this work, we propose FLGuard, a novel byzantine-robust FL method that detects malicious clients and discards malicious local updates by utilizing the contrastive learning technique, which showed a tremendous improvement as a self-supervised learning method. With contrastive models, we design FLGuard as an ensemble scheme to maximize the defensive capability. We evaluate FLGuard extensively under various poisoning attacks and compare the accuracy of the global model with existing byzantine-robust FL methods. FLGuard outperforms the state-of-the-art defense methods in most cases and shows drastic improvement, especially in non-IID settings. //github.com/201younghanlee/FLGuard

代碼 · anchor · Extensibility · 評論員 · 可理解性 ·

2024 年 3 月 4 日

Ivie: Lightweight Anchored Explanations of Just-Generated Code

Litao Yan,Alyssa Hwang,Zhiyuan Wu,Andrew Head

from arxiv, 15 pages, 10 figures, to be published in the CHI Conference on Human Factors in Computing Systems (CHI 24)

Programming assistants have reshaped the experience of programming into one where programmers spend less time writing and more time critically examining code. In this paper, we explore how programming assistants can be extended to accelerate the inspection of generated code. We introduce an extension to the programming assistant called Ivie, or instantly visible in-situ explanations. When using Ivie, a programmer's generated code is instantly accompanied by explanations positioned just adjacent to the code. Our design was optimized for extremely low-cost invocation and dismissal. Explanations are compact and informative. They describe meaningful expressions, from individual variables to entire blocks of code. We present an implementation of Ivie that forks VS Code, applying a modern LLM for timely segmentation and explanation of generated code. In a lab study, we compared Ivie to a contemporary baseline tool for code understanding. Ivie improved understanding of generated code, and was received by programmers as a highly useful, low distraction, desirable complement to the programming assistant.

多峰值 · Taxonomy · MoDELS · 可理解性 · 有向 ·

2023 年 2 月 9 日

A Comprehensive Survey on Multimodal Recommender Systems: Taxonomy, Evaluation, and Future Directions

Hongyu Zhou,Xin Zhou,Zhiwei Zeng,Lingzi Zhang,Zhiqi Shen

from arxiv, 33 pages, 4 figures

Recommendation systems have become popular and effective tools to help users discover their interesting items by modeling the user preference and item property based on implicit interactions (e.g., purchasing and clicking). Humans perceive the world by processing the modality signals (e.g., audio, text and image), which inspired researchers to build a recommender system that can understand and interpret data from different modalities. Those models could capture the hidden relations between different modalities and possibly recover the complementary information which can not be captured by a uni-modal approach and implicit interactions. The goal of this survey is to provide a comprehensive review of the recent research efforts on the multimodal recommendation. Specifically, it shows a clear pipeline with commonly used techniques in each step and classifies the models by the methods used. Additionally, a code framework has been designed that helps researchers new in this area to understand the principles and techniques, and easily runs the SOTA models. Our framework is located at: //github.com/enoche/MMRec

知識 (knowledge) · Processing（編程語言） · 圖 · NLP · 知識圖譜 ·

2022 年 9 月 30 日

A Decade of Knowledge Graphs in Natural Language Processing: A Survey

Phillip Schneider,Tim Schopf,Juraj Vladika,Mikhail Galkin,Elena Simperl,Florian Matthes

from arxiv, Accepted to AACL-IJCNLP 2022

In pace with developments in the research field of artificial intelligence, knowledge graphs (KGs) have attracted a surge of interest from both academia and industry. As a representation of semantic relations between entities, KGs have proven to be particularly relevant for natural language processing (NLP), experiencing a rapid spread and wide adoption within recent years. Given the increasing amount of research work in this area, several KG-related approaches have been surveyed in the NLP research community. However, a comprehensive study that categorizes established topics and reviews the maturity of individual research streams remains absent to this day. Contributing to closing this gap, we systematically analyzed 507 papers from the literature on KGs in NLP. Our survey encompasses a multifaceted review of tasks, research types, and contributions. As a result, we present a structured overview of the research landscape, provide a taxonomy of tasks, summarize our findings, and highlight directions for future work.

Next · Integration · 有向 · 控制器 · Continuity ·

2022 年 3 月 5 日

AI for Next Generation Computing: Emerging Trends and Future Directions

Sukhpal Singh Gill,Minxian Xu,Carlo Ottaviani,Panos Patros,Rami Bahsoon,Arash Shaghaghi,Muhammed Golec,Vlado Stankovski,Huaming Wu,Ajith Abraham,Manmeet Singh,Harshit Mehta,Soumya K. Ghosh,Thar Baker,Ajith Kumar Parlikad,Hanan Lutfiyya,Salil S. Kanhere,Rizos Sakellariou,Schahram Dustdar,Omer Rana,Ivona Brandic,Steve Uhlig

from arxiv, Accepted for Publication in Elsevier IoT Journal, 2022

Autonomic computing investigates how systems can achieve (user) specified control outcomes on their own, without the intervention of a human operator. Autonomic computing fundamentals have been substantially influenced by those of control theory for closed and open-loop systems. In practice, complex systems may exhibit a number of concurrent and inter-dependent control loops. Despite research into autonomic models for managing computer resources, ranging from individual resources (e.g., web servers) to a resource ensemble (e.g., multiple resources within a data center), research into integrating Artificial Intelligence (AI) and Machine Learning (ML) to improve resource autonomy and performance at scale continues to be a fundamental challenge. The integration of AI/ML to achieve such autonomic and self-management of systems can be achieved at different levels of granularity, from full to human-in-the-loop automation. In this article, leading academics, researchers, practitioners, engineers, and scientists in the fields of cloud computing, AI/ML, and quantum computing join to discuss current research and potential future directions for these fields. Further, we discuss challenges and opportunities for leveraging AI and ML in next generation computing for emerging computing paradigms, including cloud, fog, edge, serverless and quantum computing environments.

情感分析 · MoDELS · 循環神經網絡 · entity · Neural Networks ·

2018 年 6 月 8 日

Multilingual Sentiment Analysis: An RNN-Based Framework for Limited Data

Ethem F. Can,Aysu Ezen-Can,Fazli Can

from arxiv, ACM SIGIR 2018 Workshop on Learning from Limited or Noisy Data (LND4IR'18)

Sentiment analysis is a widely studied NLP task where the goal is to determine opinions, emotions, and evaluations of users towards a product, an entity or a service that they are reviewing. One of the biggest challenges for sentiment analysis is that it is highly language dependent. Word embeddings, sentiment lexicons, and even annotated data are language specific. Further, optimizing models for each language is very time consuming and labor intensive especially for recurrent neural network models. From a resource perspective, it is very challenging to collect data for different languages. In this paper, we look for an answer to the following research question: can a sentiment analysis model trained on a language be reused for sentiment analysis in other languages, Russian, Spanish, Turkish, and Dutch, where the data is more limited? Our goal is to build a single model in the language with the largest dataset available for the task, and reuse it for languages that have limited resources. For this purpose, we train a sentiment analysis model using recurrent neural networks with reviews in English. We then translate reviews in other languages and reuse this model to evaluate the sentiments. Experimental results show that our robust approach of single model trained on English reviews statistically significantly outperforms the baselines in several different languages.