亚洲男人的天堂2018av,欧美草比,久久久久久免费视频精选,国色天香在线看免费,久久久久亚洲av成人片仓井空

<tfoot id='mJ4o8'></tfoot>

<legend id='4EywF'><style id='Pe5Nx'><dir id='21icC'><q id='ReMKY'></q></dir></style></legend>

<i id='qyTmW'><tr id='lTstj'><dt id='H59FL'><q id='mRZ6m'><span id='aDJrz'><b id='L8IKi'><form id='ePvHZ'><ins id='YNyOW'></ins><ul id='MfM8v'></ul><sub id='gbn7c'></sub></form><legend id='4yIKh'></legend><bdo id='WYBWk'><pre id='okRle'><center id='2JBPX'></center></pre></bdo></b><th id='ItljK'></th></span></q></dt></tr></i><div id='Knzam'><tfoot id='7mFDK'></tfoot><dl id='FIj7y'><fieldset id='VYbNY'></fieldset></dl></div>

·

大語言模型 · 語言模型化 · MoDELS · 樣例 · 論文 ·

2024 年 3 月 20 日

A Survey on Large Language Model (LLM) Security and Privacy: The Good, the Bad, and the Ugly

Yifan Yao,Jinhao Duan,Kaidi Xu,Yuanfang Cai,Zhibo Sun,Yue Zhang

Large Language Models (LLMs), such as ChatGPT and Bard, have revolutionized natural language understanding and generation. They possess deep language comprehension, human-like text generation capabilities, contextual awareness, and robust problem-solving skills, making them invaluable in various domains (e.g., search engines, customer support, translation). In the meantime, LLMs have also gained traction in the security community, revealing security vulnerabilities and showcasing their potential in security-related tasks. This paper explores the intersection of LLMs with security and privacy. Specifically, we investigate how LLMs positively impact security and privacy, potential risks and threats associated with their use, and inherent vulnerabilities within LLMs. Through a comprehensive literature review, the paper categorizes the papers into "The Good" (beneficial LLM applications), "The Bad" (offensive applications), and "The Ugly" (vulnerabilities of LLMs and their defenses). We have some interesting findings. For example, LLMs have proven to enhance code security (code vulnerability detection) and data privacy (data confidentiality protection), outperforming traditional methods. However, they can also be harnessed for various attacks (particularly user-level attacks) due to their human-like reasoning abilities. We have identified areas that require further research efforts. For example, Research on model and parameter extraction attacks is limited and often theoretical, hindered by LLM parameter scale and confidentiality. Safe instruction tuning, a recent development, requires more exploration. We hope that our work can shed light on the LLMs' potential to both bolster and jeopardize cybersecurity.

相關內容

大語言模型

大(da)語言模型

大(da)(da)語(yu)言(yan)(yan)模型(xing)是基于海量文本(ben)(ben)數(shu)據訓練的(de)(de)(de)(de)(de)深度學(xue)習模型(xing)。它不(bu)僅(jin)能夠生成自然語(yu)言(yan)(yan)文本(ben)(ben)，還能夠深入理解文本(ben)(ben)含義(yi)，處(chu)理各(ge)種自然語(yu)言(yan)(yan)任務，如文本(ben)(ben)摘要、問答、翻譯(yi)等(deng)。2023年，大(da)(da)語(yu)言(yan)(yan)模型(xing)及(ji)其(qi)在(zai)(zai)(zai)人(ren)(ren)工智(zhi)能領域的(de)(de)(de)(de)(de)應(ying)用已成為全球科技研究(jiu)的(de)(de)(de)(de)(de)熱點(dian)，其(qi)在(zai)(zai)(zai)規模上的(de)(de)(de)(de)(de)增長尤為引人(ren)(ren)注目，參數(shu)量已從最初的(de)(de)(de)(de)(de)十幾億(yi)躍(yue)升到如今的(de)(de)(de)(de)(de)一萬億(yi)。參數(shu)量的(de)(de)(de)(de)(de)提(ti)升使得模型(xing)能夠更加(jia)精細(xi)地捕捉人(ren)(ren)類(lei)語(yu)言(yan)(yan)微妙之處(chu)，更加(jia)深入地理解人(ren)(ren)類(lei)語(yu)言(yan)(yan)的(de)(de)(de)(de)(de)復(fu)雜性。在(zai)(zai)(zai)過去的(de)(de)(de)(de)(de)一年里(li)，大(da)(da)語(yu)言(yan)(yan)模型(xing)在(zai)(zai)(zai)吸(xi)納新知識、分解復(fu)雜任務以及(ji)圖文對(dui)齊等(deng)多方(fang)面都有顯著提(ti)升。隨著技術的(de)(de)(de)(de)(de)不(bu)斷(duan)成熟，它將不(bu)斷(duan)拓展其(qi)應(ying)用范圍，為人(ren)(ren)類(lei)提(ti)供更加(jia)智(zhi)能化(hua)和個性化(hua)的(de)(de)(de)(de)(de)服(fu)務，進一步改善人(ren)(ren)們的(de)(de)(de)(de)(de)生活和生產方(fang)式(shi)。

MoDELS · 語言模型化 · 話題 · INFORMS · 模型評估 ·

2024 年 5 月 3 日

Sowing the Wind, Reaping the Whirlwind: The Impact of Editing Language Models

Rima Hazra,Sayan Layek,Somnath Banerjee,Soujanya Poria

from arxiv, Under review. {//huggingface.co/datasets/SoftMINER-Group/NicheHazardQA}

In the rapidly advancing field of artificial intelligence, the concept of Red-Teaming or Jailbreaking large language models (LLMs) has emerged as a crucial area of study. This approach is especially significant in terms of assessing and enhancing the safety and robustness of these models. This paper investigates the intricate consequences of such modifications through model editing, uncovering a complex relationship between enhancing model accuracy and preserving its ethical integrity. Our in-depth analysis reveals a striking paradox: while injecting accurate information is crucial for model reliability, it can paradoxically destabilize the model's foundational framework, resulting in unpredictable and potentially unsafe behaviors. Additionally, we propose a benchmark dataset NicheHazardQA to investigate this unsafe behavior both within the same and cross topical domain. This aspect of our research sheds light on how the edits, impact the model's safety metrics and guardrails. Our findings show that model editing serves as a cost-effective tool for topical red-teaming by methodically applying targeted edits and evaluating the resultant model behavior.

詞表 · 解碼 · 模型評估 · 變換 · Signal Processing ·

2024 年 5 月 3 日

EEG2TEXT: Open Vocabulary EEG-to-Text Decoding with EEG Pre-Training and Multi-View Transformer

Hanwen Liu,Daniel Hajialigol,Benny Antony,Aiguo Han,Xuan Wang

Deciphering the intricacies of the human brain has captivated curiosity for centuries. Recent strides in Brain-Computer Interface (BCI) technology, particularly using motor imagery, have restored motor functions such as reaching, grasping, and walking in paralyzed individuals. However, unraveling natural language from brain signals remains a formidable challenge. Electroencephalography (EEG) is a non-invasive technique used to record electrical activity in the brain by placing electrodes on the scalp. Previous studies of EEG-to-text decoding have achieved high accuracy on small closed vocabularies, but still fall short of high accuracy when dealing with large open vocabularies. We propose a novel method, EEG2TEXT, to improve the accuracy of open vocabulary EEG-to-text decoding. Specifically, EEG2TEXT leverages EEG pre-training to enhance the learning of semantics from EEG signals and proposes a multi-view transformer to model the EEG signal processing by different spatial regions of the brain. Experiments show that EEG2TEXT has superior performance, outperforming the state-of-the-art baseline methods by a large margin of up to 5% in absolute BLEU and ROUGE scores. EEG2TEXT shows great potential for a high-performance open-vocabulary brain-to-text system to facilitate communication.

優化器 · 成對型 · 歐幾里得距離 · 變換 · 經驗分布 ·

2024 年 5 月 3 日

Wasserstein Wormhole: Scalable Optimal Transport Distance with Transformers

Doron Haviv,Russell Zhang Kunes,Thomas Dougherty,Cassandra Burdziak,Tal Nawy,Anna Gilbert,Dana Pe'er

from arxiv, To appear at the Forty-first International Conference on Machine Learning (ICML2024)

Optimal transport (OT) and the related Wasserstein metric (W) are powerful and ubiquitous tools for comparing distributions. However, computing pairwise Wasserstein distances rapidly becomes intractable as cohort size grows. An attractive alternative would be to find an embedding space in which pairwise Euclidean distances map to OT distances, akin to standard multidimensional scaling (MDS). We present Wasserstein Wormhole, a transformer-based autoencoder that embeds empirical distributions into a latent space wherein Euclidean distances approximate OT distances. Extending MDS theory, we show that our objective function implies a bound on the error incurred when embedding non-Euclidean distances. Empirically, distances between Wormhole embeddings closely match Wasserstein distances, enabling linear time computation of OT distances. Along with an encoder that maps distributions to embeddings, Wasserstein Wormhole includes a decoder that maps embeddings back to distributions, allowing for operations in the embedding space to generalize to OT spaces, such as Wasserstein barycenter estimation and OT interpolation. By lending scalability and interpretability to OT approaches, Wasserstein Wormhole unlocks new avenues for data analysis in the fields of computational geometry and single-cell biology.

MoDELS · AI · AIGC · Taxonomy · 語言模型化 ·

2024 年 5 月 3 日

Fake Artificial Intelligence Generated Contents (FAIGC): A Survey of Theories, Detection Methods, and Opportunities

Xiaomin Yu,Yezhaohui Wang,Yanfang Chen,Zhen Tao,Dinghao Xi,Shichao Song,Simin Niu,Zhiyu Li

In recent years, generative artificial intelligence models, represented by Large Language Models (LLMs) and Diffusion Models (DMs), have revolutionized content production methods. These artificial intelligence-generated content (AIGC) have become deeply embedded in various aspects of daily life and work. However, these technologies have also led to the emergence of Fake Artificial Intelligence Generated Content (FAIGC), posing new challenges in distinguishing genuine information. It is crucial to recognize that AIGC technology is akin to a double-edged sword; its potent generative capabilities, while beneficial, also pose risks for the creation and dissemination of FAIGC. In this survey, We propose a new taxonomy that provides a more comprehensive breakdown of the space of FAIGC methods today. Next, we explore the modalities and generative technologies of FAIGC. We introduce FAIGC detection methods and summarize the related benchmark from various perspectives. Finally, we discuss outstanding challenges and promising areas for future research.

推斷 · MoDELS · state-of-the-art · Integration · 語言模型化 ·

2024 年 5 月 2 日

Verification and Refinement of Natural Language Explanations through LLM-Symbolic Theorem Proving

Xin Quan,Marco Valentino,Louise A. Dennis,André Freitas

Natural language explanations have become a proxy for evaluating explainable and multi-step Natural Language Inference (NLI) models. However, assessing the validity of explanations for NLI is challenging as it typically involves the crowd-sourcing of apposite datasets, a process that is time-consuming and prone to logical errors. To address existing limitations, this paper investigates the verification and refinement of natural language explanations through the integration of Large Language Models (LLMs) and Theorem Provers (TPs). Specifically, we present a neuro-symbolic framework, named Explanation-Refiner, that augments a TP with LLMs to generate and formalise explanatory sentences and suggest potential inference strategies for NLI. In turn, the TP is employed to provide formal guarantees on the logical validity of the explanations and to generate feedback for subsequent improvements. We demonstrate how Explanation-Refiner can be jointly used to evaluate explanatory reasoning, autoformalisation, and error correction mechanisms of state-of-the-art LLMs as well as to automatically enhance the quality of human-annotated explanations of variable complexity in different domains.

大語言模型 · Prompt · IR · 知識 (knowledge) · 噪聲 ·

2024 年 5 月 1 日

The Power of Noise: Redefining Retrieval for RAG Systems

Florin Cuconasu,Giovanni Trappolini,Federico Siciliano,Simone Filice,Cesare Campagnano,Yoelle Maarek,Nicola Tonellotto,Fabrizio Silvestri

Retrieval-Augmented Generation (RAG) has recently emerged as a method to extend beyond the pre-trained knowledge of Large Language Models by augmenting the original prompt with relevant passages or documents retrieved by an Information Retrieval (IR) system. RAG has become increasingly important for Generative AI solutions, especially in enterprise settings or in any domain in which knowledge is constantly refreshed and cannot be memorized in the LLM. We argue here that the retrieval component of RAG systems, be it dense or sparse, deserves increased attention from the research community, and accordingly, we conduct the first comprehensive and systematic examination of the retrieval strategy of RAG systems. We focus, in particular, on the type of passages IR systems within a RAG solution should retrieve. Our analysis considers multiple factors, such as the relevance of the passages included in the prompt context, their position, and their number. One counter-intuitive finding of this work is that the retriever's highest-scoring documents that are not directly relevant to the query (e.g., do not contain the answer) negatively impact the effectiveness of the LLM. Even more surprising, we discovered that adding random documents in the prompt improves the LLM accuracy by up to 35%. These results highlight the need to investigate the appropriate strategies when integrating retrieval with LLMs, thereby laying the groundwork for future research in this area.

LORA · MoDELS · 語言模型化 · 閾值 · 大語言模型 ·

2024 年 5 月 1 日

AdaMoLE: Fine-Tuning Large Language Models with Adaptive Mixture of Low-Rank Adaptation Experts

Zefang Liu,Jiahua Luo

We introduce AdaMoLE, a novel method for fine-tuning large language models (LLMs) through an Adaptive Mixture of Low-Rank Adaptation (LoRA) Experts. Moving beyond conventional methods that employ a static top-k strategy for activating experts, AdaMoLE dynamically adjusts the activation threshold using a dedicated threshold network, adaptively responding to the varying complexities of different tasks. By replacing a single LoRA in a layer with multiple LoRA experts and integrating a gating function with the threshold mechanism, AdaMoLE effectively selects and activates the most appropriate experts based on the input context. Our extensive evaluations across a variety of commonsense reasoning and natural language processing tasks show that AdaMoLE exceeds baseline performance. This enhancement highlights the advantages of AdaMoLE's adaptive selection of LoRA experts, improving model effectiveness without a corresponding increase in the expert count. The experimental validation not only confirms AdaMoLE as a robust approach for enhancing LLMs but also suggests valuable directions for future research in adaptive expert selection mechanisms, potentially broadening the scope for optimizing model performance across diverse language processing tasks.

Taxonomy · AIM · MoDELS · 評論員 · 語言模型化 ·

2023 年 11 月 9 日

A Survey on Hallucination in Large Language Models: Principles, Taxonomy, Challenges, and Open Questions

Lei Huang,Weijiang Yu,Weitao Ma,Weihong Zhong,Zhangyin Feng,Haotian Wang,Qianglong Chen,Weihua Peng,Xiaocheng Feng,Bing Qin,Ting Liu

from arxiv, Work in progress; 49 pages

The emergence of large language models (LLMs) has marked a significant breakthrough in natural language processing (NLP), leading to remarkable advancements in text understanding and generation. Nevertheless, alongside these strides, LLMs exhibit a critical tendency to produce hallucinations, resulting in content that is inconsistent with real-world facts or user inputs. This phenomenon poses substantial challenges to their practical deployment and raises concerns over the reliability of LLMs in real-world scenarios, which attracts increasing attention to detect and mitigate these hallucinations. In this survey, we aim to provide a thorough and in-depth overview of recent advances in the field of LLM hallucinations. We begin with an innovative taxonomy of LLM hallucinations, then delve into the factors contributing to hallucinations. Subsequently, we present a comprehensive overview of hallucination detection methods and benchmarks. Additionally, representative approaches designed to mitigate hallucinations are introduced accordingly. Finally, we analyze the challenges that highlight the current limitations and formulate open questions, aiming to delineate pathways for future research on hallucinations in LLMs.

Extensibility · 鏈路預測 · Performer · 任務對話系統 · MoDELS ·

2019 年 12 月 17 日

Differentiable Reasoning on Large Knowledge Bases and Natural Language

Pasquale Minervini,Matko Bo?njak,Tim Rockt?schel,Sebastian Riedel,Edward Grefenstette

from arxiv, Accepted at the 34th AAAI Conference on Artificial Intelligence (AAAI-20)

Reasoning with knowledge expressed in natural language and Knowledge Bases (KBs) is a major challenge for Artificial Intelligence, with applications in machine reading, dialogue, and question answering. General neural architectures that jointly learn representations and transformations of text are very data-inefficient, and it is hard to analyse their reasoning process. These issues are addressed by end-to-end differentiable reasoning systems such as Neural Theorem Provers (NTPs), although they can only be used with small-scale symbolic KBs. In this paper we first propose Greedy NTPs (GNTPs), an extension to NTPs addressing their complexity and scalability limitations, thus making them applicable to real-world datasets. This result is achieved by dynamically constructing the computation graph of NTPs and including only the most promising proof paths during inference, thus obtaining orders of magnitude more efficient models. Then, we propose a novel approach for jointly reasoning over KBs and textual mentions, by embedding logic facts and natural language sentences in a shared embedding space. We show that GNTPs perform on par with NTPs at a fraction of their cost while achieving competitive link prediction results on large datasets, providing explanations for predictions, and inducing interpretable models. Source code, datasets, and supplementary material are available online at //github.com/uclnlp/gntp.

BERT · 語言表示 · state-of-the-art · 可理解性 · 自動問答 ·

2018 年 10 月 11 日

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

Jacob Devlin,Ming-Wei Chang,Kenton Lee,Kristina Toutanova

from arxiv, 13 pages

We introduce a new language representation model called BERT, which stands for Bidirectional Encoder Representations from Transformers. Unlike recent language representation models, BERT is designed to pre-train deep bidirectional representations by jointly conditioning on both left and right context in all layers. As a result, the pre-trained BERT representations can be fine-tuned with just one additional output layer to create state-of-the-art models for a wide range of tasks, such as question answering and language inference, without substantial task-specific architecture modifications. BERT is conceptually simple and empirically powerful. It obtains new state-of-the-art results on eleven natural language processing tasks, including pushing the GLUE benchmark to 80.4% (7.6% absolute improvement), MultiNLI accuracy to 86.7 (5.6% absolute improvement) and the SQuAD v1.1 question answering Test F1 to 93.2 (1.5% absolute improvement), outperforming human performance by 2.0%.

閱讀: 0 點贊: 0

小貼士

登錄享

相關主題

大語言模型

語言模(mo)型化

北京阿比特科技有限公司

注冊地址：北京市海淀區羊坊店路18號2幢3層301-191

<tr id='bm53o'><strong id='bm53o'></strong><small id='bm53o'></small><button id='bm53o'></button><li id='bm53o'><noscript id='bm53o'><big id='bm53o'></big><dt id='bm53o'></dt></noscript></li></tr><ol id='bm53o'><option id='bm53o'><table id='bm53o'><blockquote id='bm53o'><tbody id='bm53o'></tbody></blockquote></table></option></ol><u id='bm53o'></u><kbd id='bm53o'><kbd id='bm53o'></kbd></kbd>

<code id='bm53o'><strong id='bm53o'></strong></code>

<fieldset id='bm53o'></fieldset>

<span id='bm53o'></span>

<ins id='bm53o'></ins>

<acronym id='bm53o'><em id='bm53o'></em><td id='bm53o'><div id='bm53o'></div></td></acronym><address id='bm53o'><big id='bm53o'><big id='bm53o'></big><legend id='bm53o'></legend></big></address>

<i id='bm53o'><div id='bm53o'><ins id='bm53o'></ins></div></i>

<i id='bm53o'></i>