美国式禁忌电影在线观看免费观看_欧美一级久久久免费_你懂的网址色精品国产_亚洲日韩中文字幕在线网址_中文字幕A级电影_精品无码综合一区二区三_国产AV不卡一区二区三区四区

Large Language Models (LLMs) are increasingly deployed in various applications. As their usage grows, concerns regarding their safety are rising, especially in maintaining harmless responses when faced with malicious instructions. Many defense strategies have been developed to enhance the safety of LLMs. However, our research finds that existing defense strategies lead LLMs to predominantly adopt a rejection-oriented stance, thereby diminishing the usability of their responses to benign instructions. To solve this problem, we introduce the MoGU framework, designed to enhance LLMs' safety while preserving their usability. Our MoGU framework transforms the base LLM into two variants: the usable LLM and the safe LLM, and further employs dynamic routing to balance their contribution. When encountering malicious instructions, the router will assign a higher weight to the safe LLM to ensure that responses are harmless. Conversely, for benign instructions, the router prioritizes the usable LLM, facilitating usable and helpful responses. On various open-sourced LLMs, we compare multiple defense strategies to verify the superiority of our MoGU framework. Besides, our analysis provides key insights into the effectiveness of MoGU and verifies that our designed routing mechanism can effectively balance the contribution of each variant by assigning weights. Our work released the safer Llama2, Vicuna, Falcon, Dolphin, and Baichuan2.

相關內容

大語(yu)言模型

關注 56

大(da)語(yu)言(yan)(yan)(yan)模(mo)(mo)型(xing)是基于海量(liang)(liang)文(wen)(wen)(wen)本數(shu)據訓練的(de)深(shen)度學習模(mo)(mo)型(xing)。它不(bu)僅能(neng)(neng)(neng)夠(gou)生(sheng)成(cheng)自(zi)然語(yu)言(yan)(yan)(yan)文(wen)(wen)(wen)本，還能(neng)(neng)(neng)夠(gou)深(shen)入理(li)(li)解文(wen)(wen)(wen)本含義，處(chu)理(li)(li)各種自(zi)然語(yu)言(yan)(yan)(yan)任務，如文(wen)(wen)(wen)本摘(zhai)要、問答、翻(fan)譯等。2023年，大(da)語(yu)言(yan)(yan)(yan)模(mo)(mo)型(xing)及(ji)其(qi)在(zai)(zai)人(ren)工智能(neng)(neng)(neng)領域的(de)應用已成(cheng)為全球科技研究的(de)熱(re)點，其(qi)在(zai)(zai)規模(mo)(mo)上(shang)的(de)增長尤為引人(ren)注目，參(can)(can)數(shu)量(liang)(liang)已從最(zui)初(chu)的(de)十(shi)幾億(yi)躍升到如今的(de)一(yi)萬億(yi)。參(can)(can)數(shu)量(liang)(liang)的(de)提升使得模(mo)(mo)型(xing)能(neng)(neng)(neng)夠(gou)更(geng)(geng)加精細地(di)捕捉人(ren)類語(yu)言(yan)(yan)(yan)微妙之(zhi)處(chu)，更(geng)(geng)加深(shen)入地(di)理(li)(li)解人(ren)類語(yu)言(yan)(yan)(yan)的(de)復(fu)雜(za)性。在(zai)(zai)過去的(de)一(yi)年里，大(da)語(yu)言(yan)(yan)(yan)模(mo)(mo)型(xing)在(zai)(zai)吸納新知(zhi)識、分(fen)解復(fu)雜(za)任務以(yi)及(ji)圖(tu)文(wen)(wen)(wen)對(dui)齊等多方(fang)面都有顯著提升。隨著技術(shu)的(de)不(bu)斷成(cheng)熟，它將不(bu)斷拓(tuo)展其(qi)應用范圍，為人(ren)類提供更(geng)(geng)加智能(neng)(neng)(neng)化(hua)和個性化(hua)的(de)服務，進一(yi)步改善人(ren)們的(de)生(sheng)活和生(sheng)產方(fang)式。

Neural Networks · Networking · 混合計算 · Performer · 模型評估 ·

2024 年 7 月 3 日

ShiftAddAug: Augment Multiplication-Free Tiny Neural Network with Hybrid Computation

Yipin Guo,Zihao Li,Yilin Lang,Qinyuan Ren

from arxiv, Accepted by 2024 CVPR Workshop : Efficient Deep Learning for Computer Vision

Operators devoid of multiplication, such as Shift and Add, have gained prominence for their compatibility with hardware. However, neural networks (NNs) employing these operators typically exhibit lower accuracy compared to conventional NNs with identical structures. ShiftAddAug uses costly multiplication to augment efficient but less powerful multiplication-free operators, improving performance without any inference overhead. It puts a ShiftAdd tiny NN into a large multiplicative model and encourages it to be trained as a sub-model to obtain additional supervision. In order to solve the weight discrepancy problem between hybrid operators, a new weight sharing method is proposed. Additionally, a novel two stage neural architecture search is used to obtain better augmentation effects for smaller but stronger multiplication-free tiny neural networks. The superiority of ShiftAddAug is validated through experiments in image classification and semantic segmentation, consistently delivering noteworthy enhancements. Remarkably, it secures up to a 4.95% increase in accuracy on the CIFAR100 compared to its directly trained counterparts, even surpassing the performance of multiplicative NNs.

Processing（編程語言） · INFORMS · AIM · 可理解性 · 真實值 ·

2024 年 7 月 3 日

Evaluation of Retrieval-Augmented Generation: A Survey

Hao Yu,Aoran Gan,Kai Zhang,Shiwei Tong,Qi Liu,Zhaofeng Liu

Retrieval-Augmented Generation (RAG) has recently gained traction in natural language processing. Numerous studies and real-world applications are leveraging its ability to enhance generative models through external information retrieval. Evaluating these RAG systems, however, poses unique challenges due to their hybrid structure and reliance on dynamic knowledge sources. To better understand these challenges, we conduct A Unified Evaluation Process of RAG (Auepora) and aim to provide a comprehensive overview of the evaluation and benchmarks of RAG systems. Specifically, we examine and compare several quantifiable metrics of the Retrieval and Generation components, such as relevance, accuracy, and faithfulness, within the current RAG benchmarks, encompassing the possible output and ground truth pairs. We then analyze the various datasets and metrics, discuss the limitations of current benchmarks, and suggest potential directions to advance the field of RAG benchmarks.

有偏 · 語言模型化 · Facebook AI Research · MoDELS · 大語言模型 ·

2024 年 7 月 2 日

CEB: Compositional Evaluation Benchmark for Fairness in Large Language Models

Song Wang,Peng Wang,Tong Zhou,Yushun Dong,Zhen Tan,Jundong Li

from arxiv, 37 pages, 32 figures

As Large Language Models (LLMs) are increasingly deployed to handle various natural language processing (NLP) tasks, concerns regarding the potential negative societal impacts of LLM-generated content have also arisen. To evaluate the biases exhibited by LLMs, researchers have recently proposed a variety of datasets. However, existing bias evaluation efforts often focus on only a particular type of bias and employ inconsistent evaluation metrics, leading to difficulties in comparison across different datasets and LLMs. To address these limitations, we collect a variety of datasets designed for the bias evaluation of LLMs, and further propose CEB, a Compositional Evaluation Benchmark that covers different types of bias across different social groups and tasks. The curation of CEB is based on our newly proposed compositional taxonomy, which characterizes each dataset from three dimensions: bias types, social groups, and tasks. By combining the three dimensions, we develop a comprehensive evaluation strategy for the bias in LLMs. Our experiments demonstrate that the levels of bias vary across these dimensions, thereby providing guidance for the development of specific bias mitigation methods.

Performer · 變換 · MoDELS · Analysis · Extensibility ·

2024 年 7 月 2 日

On the Performance and Memory Footprint of Distributed Training: An Empirical Study on Transformers

Zhengxian Lu,Fangyu Wang,Zhiwei Xu,Fei Yang,Tao Li

Transformer models have emerged as potent solutions to a wide array of multidisciplinary challenges. The deployment of Transformer architectures is significantly hindered by their extensive computational and memory requirements, necessitating the reliance on advanced efficient distributed training methodologies. Prior research has delved into the performance bottlenecks associated with distributed training, aiming to unravel these bottlenecks and suggest optimization directions. However, such analyses often overlook three aspects unique to Transformer models: the specialized architecture, the dependency on various distributed strategies, and the requirement to balance computational and memory overhead. This paper aims to bridge this gap by offering a comprehensive examination of the performance bottlenecks inherent in distributed training of Transformer models, leveraging both theoretical analysis and empirical investigation. We propose an analytical framework tailored to these unique aspects of Transformers, facilitating a holistic evaluation of model architectures, distributed strategies, and resource consumption. Based on this analytical framework, we conduct a comparative analysis of theoretical performances and further systematically explore how various distributed training strategies fare in real-world scenarios. Most of the experimental results can be well explained by the analytical outcomes derived from the analytical framework. Notably, our findings suggest an advantage of pipeline parallelism over data parallelism for Transformer models. Moreover, we shed light on some unexpected outcomes, such as the potential for increased total memory overhead due to suboptimal model partitioning within pipeline parallelism. Additionally, we underscore the significance of communication block size and waiting time to further enhance performance.

SimPLe · 知識 (knowledge) · 圖 · 知識圖譜 · MoDELS ·

2024 年 7 月 2 日

Simple Augmentations of Logical Rules for Neuro-Symbolic Knowledge Graph Completion

Ananjan Nandi,Navdeep Kaur,Parag Singla, Mausam

from arxiv, 12 pages, 15 tables Published in ACL 2023

High-quality and high-coverage rule sets are imperative to the success of Neuro-Symbolic Knowledge Graph Completion (NS-KGC) models, because they form the basis of all symbolic inferences. Recent literature builds neural models for generating rule sets, however, preliminary experiments show that they struggle with maintaining high coverage. In this work, we suggest three simple augmentations to existing rule sets: (1) transforming rules to their abductive forms, (2) generating equivalent rules that use inverse forms of constituent relations and (3) random walks that propose new rules. Finally, we prune potentially low quality rules. Experiments over four datasets and five ruleset-baseline settings suggest that these simple augmentations consistently improve results, and obtain up to 7.1 pt MRR and 8.5 pt Hits@1 gains over using rules without augmentations.

SoP · 大語言模型 · 優化器 · Prompt · Extensibility ·

2024 年 7 月 2 日

SoP: Unlock the Power of Social Facilitation for Automatic Jailbreak Attack

Yan Yang,Zeguan Xiao,Xin Lu,Hongru Wang,Hailiang Huang,Guanhua Chen,Yun Chen

The widespread applications of large language models (LLMs) have brought about concerns regarding their potential misuse. Although aligned with human preference data before release, LLMs remain vulnerable to various malicious attacks. In this paper, we adopt a red-teaming strategy to enhance LLM safety and introduce SoP, a simple yet effective framework to design jailbreak prompts automatically. Inspired by the social facilitation concept, SoP generates and optimizes multiple jailbreak characters to bypass the guardrails of the target LLM. Different from previous work which relies on proprietary LLMs or seed jailbreak templates crafted by human expertise, SoP can generate and optimize the jailbreak prompt in a cold-start scenario using open-sourced LLMs without any seed jailbreak templates. Experimental results show that SoP achieves attack success rates of 88% and 60% in bypassing the safety alignment of GPT-3.5-1106 and GPT-4, respectively. Furthermore, we extensively evaluate the transferability of the generated templates across different LLMs and held-out malicious requests, while also exploring defense strategies against the jailbreak attack designed by SoP. Code is available at //github.com/Yang-Yan-Yang-Yan/SoP.

線性的 · 判別器 · 缺失值 · Weight · Analysis ·

2024 年 6 月 30 日

Weighted Missing Linear Discriminant Analysis: An Explainable Approach for Classification with Missing Data

Tuan L. Vo,Uyen Dang,Thu Nguyen

As Artificial Intelligence (AI) models are gradually being adopted in real-life applications, the explainability of the model used is critical, especially in high-stakes areas such as medicine, finance, etc. Among the commonly used models, Linear Discriminant Analysis (LDA) is a widely used classification tool that is also explainable thanks to its ability to model class distributions and maximize class separation through linear feature combinations. Nevertheless, real-world data is frequently incomplete, presenting significant challenges for classification tasks and model explanations. In this paper, we propose a novel approach to LDA under missing data, termed \textbf{\textit{Weighted missing Linear Discriminant Analysis (WLDA)}}, to directly classify observations in data that contains missing values without imputation effectively by estimating the parameters directly on missing data and use a weight matrix for missing values to penalize missing entries during classification. Furthermore, we also analyze the theoretical properties and examine the explainability of the proposed technique in a comprehensive manner. Experimental results demonstrate that WLDA outperforms conventional methods by a significant margin, particularly in scenarios where missing values are present in both training and test sets.

Analysis · Extensibility · 相互獨立的 · INTERACT · AIM ·

2024 年 6 月 27 日

Failure Diagnosis in Microservice Systems: A Comprehensive Survey and Analysis

Shenglin Zhang,Sibo Xia,Wenzhao Fan,Binpeng Shi,Xiao Xiong,Zhenyu Zhong,Minghua Ma,Yongqian Sun,Dan Pei

Modern microservice systems have gained widespread adoption due to their high scalability, flexibility, and extensibility. However, the characteristics of independent deployment, decentralization, and frequent dynamic interactions also introduce the risk of cascading failures, making it challenging to achieve accurate failure diagnosis and rapid system recovery. These issues severely impact operation efficiency and user experience. Recognizing the crucial role of failure diagnosis in enhancing the stability and reliability of microservice systems, researchers have conducted extensive studies and achieved a series of significant outcomes. This survey provides a comprehensive review and primary analysis of 94 papers from 2003 to the present, including an overview of the fundamental concepts, a research framework, and problem statements. These insights aim to help researchers understand the latest research progress in failure diagnosis. Publicly available datasets, toolkits, and evaluation metrics are also compiled to assist practitioners in selecting and validating various techniques, providing a foundation to advance the domain beyond current practices.

目標檢測 · 學成 · 深度學習 · Performance · BASIC ·

2021 年 5 月 26 日

Deep Learning for Weakly-Supervised Object Detection and Object Localization: A Survey

Feifei Shao,Long Chen,Jian Shao,Wei Ji,Shaoning Xiao,Lu Ye,Yueting Zhuang,Jun Xiao

from arxiv, 13 pages, 4 figures

Weakly-Supervised Object Detection (WSOD) and Localization (WSOL), i.e., detecting multiple and single instances with bounding boxes in an image using image-level labels, are long-standing and challenging tasks in the CV community. With the success of deep neural networks in object detection, both WSOD and WSOL have received unprecedented attention. Hundreds of WSOD and WSOL methods and numerous techniques have been proposed in the deep learning era. To this end, in this paper, we consider WSOL is a sub-task of WSOD and provide a comprehensive survey of the recent achievements of WSOD. Specifically, we firstly describe the formulation and setting of the WSOD, including the background, challenges, basic framework. Meanwhile, we summarize and analyze all advanced techniques and training tricks for improving detection performance. Then, we introduce the widely-used datasets and evaluation metrics of WSOD. Lastly, we discuss the future directions of WSOD. We believe that these summaries can help pave a way for future research on WSOD and WSOL.

多峰值 · 學成 · Extensibility · 深度學習 · Processing（編程語言） ·

2021 年 5 月 24 日

Recent Advances and Trends in Multimodal Deep Learning: A Review

Jabeen Summaira,Xi Li,Amin Muhammad Shoib,Songyuan Li,Jabbar Abdul

Deep Learning has implemented a wide range of applications and has become increasingly popular in recent years. The goal of multimodal deep learning is to create models that can process and link information using various modalities. Despite the extensive development made for unimodal learning, it still cannot cover all the aspects of human learning. Multimodal learning helps to understand and analyze better when various senses are engaged in the processing of information. This paper focuses on multiple types of modalities, i.e., image, video, text, audio, body gestures, facial expressions, and physiological signals. Detailed analysis of past and current baseline approaches and an in-depth study of recent advancements in multimodal deep learning applications has been provided. A fine-grained taxonomy of various multimodal deep learning applications is proposed, elaborating on different applications in more depth. Architectures and datasets used in these applications are also discussed, along with their evaluation metrics. Last, main issues are highlighted separately for each domain along with their possible future research directions.