亚洲男人的天堂2018av,欧美草比,久久久久久免费视频精选,国色天香在线看免费,久久久久亚洲av成人片仓井空

<li id='izLCx'></li>

_{^{<dd id='oqUZw'><tbody id='9yhoX'><td id='OkWWD'><optgroup id='niMXH'><strong id='lNDd1'></strong></optgroup><address id='zwtvv'><ul id='sGffi'></ul></address><big id='aRMTv'></big></td><table id='9YQ76'></table></tbody><pre id='sAFiO'></pre></dd><span id='bbqdM'><b id='LWQ07'></b></span>}}


<dfn id='bO2hM'><optgroup id='4A7JR'></optgroup></dfn><tfoot id='5nBN6'><bdo id='dEFOI'><div id='HCC4H'></div><i id='xX52v'><dt id='NbtMM'></dt></i></bdo></tfoot>

_{<fieldset id='vVaOt'></fieldset>}

·

大語言模型 · MoDELS · 語言模型化 · Analysis · 多樣性 ·

2024 年 8 月 11 日

LLM Reasoners: New Evaluation, Library, and Analysis of Step-by-Step Reasoning with Large Language Models

Shibo Hao,Yi Gu,Haotian Luo,Tianyang Liu,Xiyan Shao,Xinyuan Wang,Shuhua Xie,Haodi Ma,Adithya Samavedhi,Qiyue Gao,Zhen Wang,Zhiting Hu

from arxiv, Project website: //www.llm-reasoners.net/

Generating accurate step-by-step reasoning is essential for Large Language Models (LLMs) to address complex problems and enhance robustness and interpretability. Despite the flux of research on developing advanced reasoning approaches, systematically analyzing the diverse LLMs and reasoning strategies in generating reasoning chains remains a significant challenge. The difficulties stem from the lack of two key elements: (1) an automatic method for evaluating the generated reasoning chains on different tasks, and (2) a unified formalism and implementation of the diverse reasoning approaches for systematic comparison. This paper aims to close the gap: (1) We introduce AutoRace for fully automated reasoning chain evaluation. Existing metrics rely on expensive human annotations or pre-defined LLM prompts not adaptable to different tasks. In contrast, AutoRace automatically creates detailed evaluation criteria tailored for each task, and uses GPT-4 for accurate evaluation following the criteria. (2) We develop LLM Reasoners, a library for standardized modular implementation of existing and new reasoning algorithms, under a unified formulation of the search, reward, and world model components. With the new evaluation and library, (3) we conduct extensive study of different reasoning approaches (e.g., CoT, ToT, RAP). The analysis reveals interesting findings about different factors contributing to reasoning, including the reward-guidance, breadth-vs-depth in search, world model, and prompt formats, etc.

相關內容

大語言模型

大(da)語言(yan)模型

大語(yu)(yu)言(yan)模(mo)(mo)型(xing)是基于(yu)海量文本數據訓練的(de)(de)深(shen)(shen)度學習(xi)模(mo)(mo)型(xing)。它(ta)不僅(jin)能(neng)夠(gou)生(sheng)成自然(ran)語(yu)(yu)言(yan)文本，還能(neng)夠(gou)深(shen)(shen)入(ru)理(li)(li)解文本含義，處理(li)(li)各種自然(ran)語(yu)(yu)言(yan)任(ren)務，如文本摘(zhai)要、問答(da)、翻譯等(deng)。2023年，大語(yu)(yu)言(yan)模(mo)(mo)型(xing)及其(qi)在(zai)人(ren)工(gong)智(zhi)能(neng)領域的(de)(de)應用已(yi)成為(wei)全球科(ke)技研究的(de)(de)熱點，其(qi)在(zai)規(gui)模(mo)(mo)上(shang)的(de)(de)增(zeng)長(chang)尤(you)為(wei)引人(ren)注目，參數量已(yi)從最初的(de)(de)十幾億(yi)躍升(sheng)到如今的(de)(de)一(yi)萬億(yi)。參數量的(de)(de)提升(sheng)使(shi)得模(mo)(mo)型(xing)能(neng)夠(gou)更(geng)加(jia)精細地捕捉人(ren)類語(yu)(yu)言(yan)微妙之處，更(geng)加(jia)深(shen)(shen)入(ru)地理(li)(li)解人(ren)類語(yu)(yu)言(yan)的(de)(de)復(fu)雜(za)(za)性。在(zai)過去的(de)(de)一(yi)年里，大語(yu)(yu)言(yan)模(mo)(mo)型(xing)在(zai)吸(xi)納新(xin)知識、分解復(fu)雜(za)(za)任(ren)務以及圖文對齊等(deng)多方面(mian)都有顯著提升(sheng)。隨著技術的(de)(de)不斷成熟，它(ta)將不斷拓展其(qi)應用范(fan)圍，為(wei)人(ren)類提供更(geng)加(jia)智(zhi)能(neng)化和個性化的(de)(de)服務，進一(yi)步改善人(ren)們的(de)(de)生(sheng)活(huo)和生(sheng)產(chan)方式。

TOOLS · MoDELS · 評論員 · 回合 · INTERACT ·

2024 年 10 月 3 日

Preparing for Super-Reactivity: Early Fault-Detection in the Development of Exceedingly Complex Reactive Systems

David Harel,Assaf Marron

We introduce the term Super-Reactive Systems to refer to reactive systems whose construction and behavior are complex, constantly changing and evolving, and heavily interwoven with other systems and the physical world. Finding hidden faults in such systems early in planning and development is critical for human safety, the environment, society and the economy. However, the complexity of the system and its interactions and the absence of adequate technical details pose a great obstacle. We propose an architecture for models and tools to overcome such barriers and enable simulation, systematic analysis, and fault detection and handling, early in the development of super-reactive systems. The approach is facilitated by the inference and abstraction capabilities and the power and knowledge afforded by large language models and associated AI tools. It is based on: (i) deferred, just-in-time interpretation of model elements that are stored in natural language form, and (ii) early capture of tacit interdependencies among seemingly orthogonal requirements.

集成 · MoDELS · Performer · 語言模型化 · 全 ·

2024 年 10 月 3 日

Fighting Randomness with Randomness: Mitigating Optimisation Instability of Fine-Tuning using Delayed Ensemble and Noisy Interpolation

Branislav Pecher,Jan Cegin,Robert Belanec,Jakub Simko,Ivan Srba,Maria Bielikova

from arxiv, Accepted to the Findings of the EMNLP'24 Conference

While fine-tuning of pre-trained language models generally helps to overcome the lack of labelled training samples, it also displays model performance instability. This instability mainly originates from randomness in initialisation or data shuffling. To address this, researchers either modify the training process or augment the available samples, which typically results in increased computational costs. We propose a new mitigation strategy, called Delayed Ensemble with Noisy Interpolation (DENI), that leverages the strengths of ensembling, noise regularisation and model interpolation, while retaining computational efficiency. We compare DENI with 9 representative mitigation strategies across 3 models, 4 tuning strategies and 7 text classification datasets. We show that: 1) DENI outperforms the best performing mitigation strategy (Ensemble), while using only a fraction of its cost; 2) the mitigation strategies are beneficial for parameter-efficient fine-tuning (PEFT) methods, outperforming full fine-tuning in specific cases; and 3) combining DENI with data augmentation often leads to even more effective instability mitigation.

MoDELS · tuning · 穩健性 · 語言模型化 · 評論員 ·

2024 年 10 月 2 日

TuBA: Cross-Lingual Transferability of Backdoor Attacks in LLMs with Instruction Tuning

Xuanli He,Jun Wang,Qiongkai Xu,Pasquale Minervini,Pontus Stenetorp,Benjamin I. P. Rubinstein,Trevor Cohn

from arxiv, work in progress

The implications of backdoor attacks on English-centric large language models (LLMs) have been widely examined - such attacks can be achieved by embedding malicious behaviors during training and activated under specific conditions that trigger malicious outputs. Despite the increasing support for multilingual capabilities in open-source and proprietary LLMs, the impact of backdoor attacks on these systems remains largely under-explored. Our research focuses on cross-lingual backdoor attacks against multilingual LLMs, particularly investigating how poisoning the instruction-tuning data for one or two languages can affect the outputs for languages whose instruction-tuning data were not poisoned. Despite its simplicity, our empirical analysis reveals that our method exhibits remarkable efficacy in models like mT5 and GPT-4o, with high attack success rates, surpassing 90% in more than 7 out of 12 languages across various scenarios. Our findings also indicate that more powerful models show increased susceptibility to transferable cross-lingual backdoor attacks, which also applies to LLMs predominantly pre-trained on English data, such as Llama2, Llama3, and Gemma. Moreover, our experiments demonstrate 1) High Transferability: the backdoor mechanism operates successfully in cross-lingual response scenarios across 26 languages, achieving an average attack success rate of 99%, and 2) Robustness: the proposed attack remains effective even after defenses are applied. These findings expose critical security vulnerabilities in multilingual LLMs and highlight the urgent need for more robust, targeted defense strategies to address the unique challenges posed by cross-lingual backdoor transfer.

3D · 逼真度 · 查準率/準確率 · 表示 · 有向 ·

2024 年 10 月 2 日

GaussianBlock: Building Part-Aware Compositional and Editable 3D Scene by Primitives and Gaussians

Shuyi Jiang,Qihao Zhao,Hossein Rahmani,De Wen Soh,Jun Liu,Na Zhao

Recently, with the development of Neural Radiance Fields and Gaussian Splatting, 3D reconstruction techniques have achieved remarkably high fidelity. However, the latent representations learnt by these methods are highly entangled and lack interpretability. In this paper, we propose a novel part-aware compositional reconstruction method, called GaussianBlock, that enables semantically coherent and disentangled representations, allowing for precise and physical editing akin to building blocks, while simultaneously maintaining high fidelity. Our GaussianBlock introduces a hybrid representation that leverages the advantages of both primitives, known for their flexible actionability and editability, and 3D Gaussians, which excel in reconstruction quality. Specifically, we achieve semantically coherent primitives through a novel attention-guided centering loss derived from 2D semantic priors, complemented by a dynamic splitting and fusion strategy. Furthermore, we utilize 3D Gaussians that hybridize with primitives to refine structural details and enhance fidelity. Additionally, a binding inheritance strategy is employed to strengthen and maintain the connection between the two. Our reconstructed scenes are evidenced to be disentangled, compositional, and compact across diverse benchmarks, enabling seamless, direct and precise editing while maintaining high quality.

語音增強 · Performance · 輸出 · 噪聲 · 回合 ·

2024 年 10 月 2 日

Restorative Speech Enhancement: A Progressive Approach Using SE and Codec Modules

Hsin-Tien Chiang,Hao Zhang,Yong Xu,Meng Yu,Dong Yu

from arxiv, Paper in submission

In challenging environments with significant noise and reverberation, traditional speech enhancement (SE) methods often lead to over-suppressed speech, creating artifacts during listening and harming downstream tasks performance. To overcome these limitations, we propose a novel approach called Restorative SE (RestSE), which combines a lightweight SE module with a generative codec module to progressively enhance and restore speech quality. The SE module initially reduces noise, while the codec module subsequently performs dereverberation and restores speech using generative capabilities. We systematically explore various quantization techniques within the codec module to optimize performance. Additionally, we introduce a weighted loss function and feature fusion that merges the SE output with the original mixture, particularly at segments where the SE output is heavily distorted. Experimental results demonstrate the effectiveness of our proposed method in enhancing speech quality under adverse conditions. Audio demos are available at: //sophie091524.github.io/RestorativeSE/.

MoDELS · 大語言模型 · GPT-4 · Performer · INFORMS ·

2024 年 10 月 1 日

MiniCheck: Efficient Fact-Checking of LLMs on Grounding Documents

Liyan Tang,Philippe Laban,Greg Durrett

from arxiv, EMNLP 2024

Recognizing if LLM output can be grounded in evidence is central to many tasks in NLP: retrieval-augmented generation, summarization, document-grounded dialogue, and more. Current approaches to this kind of fact-checking are based on verifying each piece of a model generation against potential evidence using an LLM. However, this process can be very computationally expensive, requiring many calls to a model to check a single response. In this work, we show how to build small fact-checking models that have GPT-4-level performance but for 400x lower cost. We do this by constructing synthetic training data with GPT-4, which involves creating realistic yet challenging instances of factual errors via a structured generation procedure. Training on this data teaches models to check each fact in the claim and recognize synthesis of information across sentences. For evaluation, we unify datasets from recent work on fact-checking and grounding LLM generations into a new benchmark, LLM-AggreFact. Our best system MiniCheck-FT5 (770M parameters) outperforms all systems of comparable size and reaches GPT-4 accuracy. We release LLM-AggreFact, code for data synthesis, and models.

可理解性 · TOOLS · 3D · 表示 · 機器人 ·

2024 年 9 月 30 日

UniAff: A Unified Representation of Affordances for Tool Usage and Articulation with Vision-Language Models

Qiaojun Yu,Siyuan Huang,Xibin Yuan,Zhengkai Jiang,Ce Hao,Xin Li,Haonan Chang,Junbo Wang,Liu Liu,Hongsheng Li,Peng Gao,Cewu Lu

Previous studies on robotic manipulation are based on a limited understanding of the underlying 3D motion constraints and affordances. To address these challenges, we propose a comprehensive paradigm, termed UniAff, that integrates 3D object-centric manipulation and task understanding in a unified formulation. Specifically, we constructed a dataset labeled with manipulation-related key attributes, comprising 900 articulated objects from 19 categories and 600 tools from 12 categories. Furthermore, we leverage MLLMs to infer object-centric representations for manipulation tasks, including affordance recognition and reasoning about 3D motion constraints. Comprehensive experiments in both simulation and real-world settings indicate that UniAff significantly improves the generalization of robotic manipulation for tools and articulated objects. We hope that UniAff will serve as a general baseline for unified robotic manipulation tasks in the future. Images, videos, dataset, and code are published on the project website at://sites.google.com/view/uni-aff/home

Jupyter · Taxonomy · state-of-the-art · MoDELS · 設計 ·

2024 年 9 月 28 日

Jupyter Notebook Attacks Taxonomy: Ransomware, Data Exfiltration, and Security Misconfiguration

from arxiv, Accepted to the 11th Annual International Workshop on Innovating the Network for Data-Intensive Science (INDIS 2024). Co-located with the International Conference for High Performance Computing, Networking, Storage, and Analysis (Supercomputing)

Open-science collaboration using Jupyter Notebooks may expose expensively trained AI models, high-performance computing resources, and training data to security vulnerabilities, such as unauthorized access, accidental deletion, or misuse. The ubiquitous deployments of Jupyter Notebooks (~11 million public notebooks on Github have transformed collaborative scientific computing by enabling reproducible research. Jupyter is the main HPC's science gateway interface between AI researchers and supercomputers at academic institutions, such as the National Center for Supercomputing Applications (NCSA), national labs, and the industry. An impactful attack targeting Jupyter could disrupt scientific missions and business operations. This paper describes the network-based attack taxonomy of Jupyter Notebooks, such as ransomware, data exfiltration, security misconfiguration, and resource abuse for cryptocurrency mining. The open nature of Jupyter (direct data access, arbitrary code execution in multiple programming languages kernels) and its vast attack interface (terminal, file browser, untrusted cells) also attract attacks attempting to misuse supercomputing resources and steal state-of-the-art research artifacts. Jupyter uses encrypted datagrams of rapidly evolving WebSocket protocols that challenge even the most state-of-the-art network observability tools, such as Zeek. We envisage even more sophisticated AI-driven attacks can be adapted to target Jupyter, where defenders have limited visibility. In addition, Jupyter's cryptographic design should be adapted to resist emerging quantum threats. On balance, this is the first paper to systematically describe the threat model against Jupyter Notebooks and lay out the design of auditing Jupyter to have better visibility against such attacks.

泛函 · ReLU · 激活函數 · Neural Networks · Networks ·

2024 年 9 月 28 日

Zorro: A Flexible and Differentiable Parametric Family of Activation Functions That Extends ReLU and GELU

Matias Roodschild,Jorge Gotay-Sardi?as,Victor A. Jimenez,Adrian Will

from arxiv, 13 pages, 7 figures, 9 tables

Even in recent neural network architectures such as Transformers and Extended LSTM (xLSTM), and traditional ones like Convolutional Neural Networks, Activation Functions are an integral part of nearly all neural networks. They enable more effective training and capture nonlinear data patterns. More than 400 functions have been proposed over the last 30 years, including fixed or trainable parameters, but only a few are widely used. ReLU is one of the most frequently used, with GELU and Swish variants increasingly appearing. However, ReLU presents non-differentiable points and exploding gradient issues, while testing different parameters of GELU and Swish variants produces varying results, needing more parameters to adapt to datasets and architectures. This article introduces a novel set of activation functions called Zorro, a continuously differentiable and flexible family comprising five main functions fusing ReLU and Sigmoid. Zorro functions are smooth and adaptable, and serve as information gates, aligning with ReLU in the 0-1 range, offering an alternative to ReLU without the need for normalization, neuron death, or gradient explosions. Zorro also approximates functions like Swish, GELU, and DGELU, providing parameters to adjust to different datasets and architectures. We tested it on fully connected, convolutional, and transformer architectures to demonstrate its effectiveness.

圖 · Neural Networks · Networks · AIM · 圖形處理器 ·

2023 年 8 月 31 日

A Survey on Privacy in Graph Neural Networks: Attacks, Preservation, and Applications

Yi Zhang,Yuying Zhao,Zhaoqing Li,Xueqi Cheng,Yu Wang,Olivera Kotevska,Philip S. Yu,Tyler Derr

Graph Neural Networks (GNNs) have gained significant attention owing to their ability to handle graph-structured data and the improvement in practical applications. However, many of these models prioritize high utility performance, such as accuracy, with a lack of privacy consideration, which is a major concern in modern society where privacy attacks are rampant. To address this issue, researchers have started to develop privacy-preserving GNNs. Despite this progress, there is a lack of a comprehensive overview of the attacks and the techniques for preserving privacy in the graph domain. In this survey, we aim to address this gap by summarizing the attacks on graph data according to the targeted information, categorizing the privacy preservation techniques in GNNs, and reviewing the datasets and applications that could be used for analyzing/solving privacy issues in GNNs. We also outline potential directions for future research in order to build better privacy-preserving GNNs.

閱讀: 0 點贊: 0

小貼士

登錄享

相關主題

大(da)語言模(mo)型

語言模型化(hua)

北京阿比特科技有限公司

注冊地址：北京市海淀區羊坊店路18號2幢3層301-191

<form id='l4g8e'></form>

<bdo id='l4g8e'><sup id='l4g8e'><div id='l4g8e'><bdo id='l4g8e'></bdo></div></sup></bdo>