亚洲男人的天堂2018av,欧美草比,久久久久久免费视频精选,国色天香在线看免费,久久久久亚洲av成人片仓井空

<li id='h0lvJ'></li>

_{^{<dd id='finxC'><tbody id='3LYYy'><td id='XhWyH'><optgroup id='E4zyo'><strong id='zPYYQ'></strong></optgroup><address id='AqAUq'><ul id='68Xid'></ul></address><big id='AkJuP'></big></td><table id='j8mpL'></table></tbody><pre id='NVY69'></pre></dd><span id='EST9Z'><b id='zuzRb'></b></span>}}


<dfn id='wGllg'><optgroup id='zrdSL'></optgroup></dfn><tfoot id='0ZJ7d'><bdo id='iE3ta'><div id='ec09c'></div><i id='iEeDM'><dt id='T56P8'></dt></i></bdo></tfoot>

_{<fieldset id='yhZ8O'></fieldset>}

·

大語言模型 · 語言模型化 · MoDELS · 原點 · 代碼 ·

2024 年 1 月 19 日

ICBeLLM: High Quality International Events Data with Open Source Large Language Models on Consumer Hardware

Rex W. Douglass,Thomas Leo Scherer,J. Andrés Gannon,Erik Gartzke

The International Crises Behavior Events (ICBe) ontology provides high coverage over the thoughts, communications, and actions that constitute international relations. A major disadvantage of that level of detail is that it requires large human capital costs to apply it manually to new texts. Whether such an ontolgy is practical for international relations research given limited human and financial resources is a pressing concern. We introduce a working proof of concept showing that ICBe codings can be reliably extracted from new texts using the current generation of open source large language models (LLM) running on consumer grade computer hardware. Our solution requires no finetuning and only limited prompt engineering. We detail our solution and present benchmarks against the original ICBe codings. We conclude by discussing the implications of very high quality event coding of any text being within reach of individual researchers with limited resources.

相關內容

大語言模型

大語(yu)言(yan)模型(xing)

大(da)語(yu)言(yan)(yan)模(mo)型(xing)是基于海(hai)量文(wen)本(ben)數據訓練的(de)(de)(de)(de)深(shen)度學習模(mo)型(xing)。它不僅能夠(gou)(gou)生(sheng)成自然語(yu)言(yan)(yan)文(wen)本(ben)，還(huan)能夠(gou)(gou)深(shen)入理(li)(li)解(jie)文(wen)本(ben)含(han)義，處(chu)理(li)(li)各種自然語(yu)言(yan)(yan)任務，如(ru)文(wen)本(ben)摘要、問答、翻譯等。2023年(nian)，大(da)語(yu)言(yan)(yan)模(mo)型(xing)及(ji)其在人(ren)(ren)工智(zhi)能領域的(de)(de)(de)(de)應用已成為全球科技(ji)研究的(de)(de)(de)(de)熱點(dian)，其在規模(mo)上的(de)(de)(de)(de)增長尤為引人(ren)(ren)注目，參數量已從最初的(de)(de)(de)(de)十幾億躍升(sheng)到如(ru)今的(de)(de)(de)(de)一萬億。參數量的(de)(de)(de)(de)提(ti)升(sheng)使得模(mo)型(xing)能夠(gou)(gou)更(geng)(geng)加精細地捕捉人(ren)(ren)類語(yu)言(yan)(yan)微妙之處(chu)，更(geng)(geng)加深(shen)入地理(li)(li)解(jie)人(ren)(ren)類語(yu)言(yan)(yan)的(de)(de)(de)(de)復(fu)雜(za)性。在過去的(de)(de)(de)(de)一年(nian)里，大(da)語(yu)言(yan)(yan)模(mo)型(xing)在吸納新(xin)知識、分解(jie)復(fu)雜(za)任務以及(ji)圖(tu)文(wen)對齊等多方(fang)面都有顯(xian)著提(ti)升(sheng)。隨著技(ji)術的(de)(de)(de)(de)不斷(duan)成熟，它將不斷(duan)拓展其應用范(fan)圍(wei)，為人(ren)(ren)類提(ti)供更(geng)(geng)加智(zhi)能化和個(ge)性化的(de)(de)(de)(de)服(fu)務，進一步改(gai)善人(ren)(ren)們的(de)(de)(de)(de)生(sheng)活和生(sheng)產方(fang)式。

泛化理論 · 泛函 · 優化器 · 標注 · 再生核希爾伯特空間 ·

2024 年 3 月 2 日

RKHS-BA: A Semantic Correspondence-Free Multi-View Registration Framework with Global Tracking

Ray Zhang,Jingwei Song,Xiang Gao,Junzhe Wu,Tianyi Liu,Jinyuan Zhang,Ryan Eustice,Maani Ghaffari

from arxiv, 16 pages, 12 figures, technical report under review

This work reports a novel Bundle Adjustment (BA) formulation using a Reproducing Kernel Hilbert Space (RKHS) representation called RKHS-BA. The proposed formulation is correspondence-free, enables the BA to use RGB-D/LiDAR and semantic labels in the optimization directly, and provides a generalization for the photometric loss function commonly used in direct methods. RKHS-BA can incorporate appearance and semantic labels within a continuous spatial-semantic functional representation that does not require optimization via image pyramids. We demonstrate its applications in sliding-window odometry and global LiDAR mapping, which show highly robust performance in extremely challenging scenes and the best trade-off of generalization and accuracy.

fMRI · Guidance · 自下而上 · 控制器 · Performer ·

2024 年 3 月 2 日

NeuralDiffuser: Controllable fMRI Reconstruction with Primary Visual Feature Guided Diffusion

Haoyu Li,Hao Wu,Badong Chen

from arxiv, The implementation error lead to incorrect results in experiment

Reconstructing visual stimuli from functional Magnetic Resonance Imaging (fMRI) based on Latent Diffusion Models (LDM) provides a fine-grained retrieval of the brain. A challenge persists in reconstructing a cohesive alignment of details (such as structure, background, texture, color, etc.). Moreover, LDMs would generate different image results even under the same conditions. For these, we first uncover the neuroscientific perspective of LDM-based methods that is top-down creation based on pre-trained knowledge from massive images but lack of detail-driven bottom-up perception resulting in unfaithful details. We propose NeuralDiffuser which introduces primary visual feature guidance to provide detail cues in the form of gradients, extending the bottom-up process for LDM-based methods to achieve faithful semantics and details. We also developed a novel guidance strategy to ensure the consistency of repeated reconstructions rather than a variety of results. We obtain the state-of-the-art performance of NeuralDiffuser on the Natural Senses Dataset (NSD), which offers more faithful details and consistent results.

MoDELS · 大語言模型 · 語言模型化 · Analysis · Nuance ·

2024 年 3 月 2 日

Reading Subtext: Evaluating Large Language Models on Short Story Summarization with Writers

Melanie Subbiah,Sean Zhang,Lydia B. Chilton,Kathleen McKeown

We evaluate recent Large language Models (LLMs) on the challenging task of summarizing short stories, which can be lengthy, and include nuanced subtext or scrambled timelines. Importantly, we work directly with authors to ensure that the stories have not been shared online (and therefore are unseen by the models), and to obtain informed evaluations of summary quality using judgments from the authors themselves. Through quantitative and qualitative analysis grounded in narrative theory, we compare GPT-4, Claude-2.1, and LLama-2-70B. We find that all three models make faithfulness mistakes in over 50% of summaries and struggle to interpret difficult subtext. However, at their best, the models can provide thoughtful thematic analysis of stories. We additionally demonstrate that LLM judgments of summary quality do not match the feedback from the writers.

Continuity · 離散化 · 控制器 · 多樣性 · Performer ·

2024 年 3 月 1 日

EfficientZero V2: Mastering Discrete and Continuous Control with Limited Data

Shengjie Wang,Shaohuai Liu,Weirui Ye,Jiacheng You,Yang Gao

from arxiv, 21 pages,10 figures

Sample efficiency remains a crucial challenge in applying Reinforcement Learning (RL) to real-world tasks. While recent algorithms have made significant strides in improving sample efficiency, none have achieved consistently superior performance across diverse domains. In this paper, we introduce EfficientZero V2, a general framework designed for sample-efficient RL algorithms. We have expanded the performance of EfficientZero to multiple domains, encompassing both continuous and discrete actions, as well as visual and low-dimensional inputs. With a series of improvements we propose, EfficientZero V2 outperforms the current state-of-the-art (SOTA) by a significant margin in diverse tasks under the limited data setting. EfficientZero V2 exhibits a notable advancement over the prevailing general algorithm, DreamerV3, achieving superior outcomes in 50 of 66 evaluated tasks across diverse benchmarks, such as Atari 100k, Proprio Control, and Vision Control.

泛函 · 激活函數 · Sigmoid（一種激活函數） · Networking · Neural Networks ·

2024 年 3 月 1 日

OPAF: Optimized Secure Two-Party Computation Protocols for Nonlinear Activation Functions in Recurrent Neural Network

Qian Feng,Zhihua Xia,Zhifeng Xu,Jiasi Weng,Jian Weng

Deep neural network (DNN) typically involves convolutions, pooling, and activation function. Due to the growing concern about privacy, privacy-preserving DNN becomes a hot research topic. Generally, the convolution and pooling operations can be supported by additive homomorphic and secure comparison, but the secure implementation of activation functions is not so straightforward for the requirements of accuracy and efficiency, especially for the non-linear ones such as exponential, sigmoid, and tanh functions. This paper pays a special attention to the implementation of such non-linear functions in semi-honest model with two-party settings, for which SIRNN is the current state-of-the-art. Different from previous works, we proposed improved implementations for these functions by using their intrinsic features as well as worthy tiny tricks. At first, we propose a novel and efficient protocol for exponential function by using a divide-and-conquer strategy with most of the computations executed locally. Exponential protocol is widely used in machine learning tasks such as Poisson regression, and is also a key component of sigmoid and tanh functions. Next, we take advantage of the symmetry of sigmoid and Tanh, and fine-tune the inputs to reduce the 2PC building blocks, which helps to save overhead and improve performance. As a result, we implement these functions with fewer fundamental building blocks. The comprehensive evaluations show that our protocols achieve state-of-the-art precision while reducing run-time by approximately 57%, 44%, and 42% for exponential (with only negative inputs), sigmoid, and Tanh functions, respectively.

Networking · Neural Networks · 泛函 · 特化 · 查準率/準確率 ·

2024 年 2 月 29 日

NeuraLUT: Hiding Neural Network Density in Boolean Synthesizable Functions

Marta Andronic,George A. Constantinides

Field-Programmable Gate Array (FPGA) accelerators have proven successful in handling latency- and resource-critical deep neural network (DNN) inference tasks. Among the most computationally intensive operations in a neural network (NN) is the dot product between the feature and weight vectors. Thus, some previous FPGA acceleration works have proposed mapping neurons with quantized inputs and outputs directly to lookup tables (LUTs) for hardware implementation. In these works, the boundaries of the neurons coincide with the boundaries of the LUTs. We propose relaxing these boundaries and mapping entire sub-networks to a single LUT. As the sub-networks are absorbed within the LUT, the NN topology and precision within a partition do not affect the size of the lookup tables generated. Therefore, we utilize fully connected layers with floating-point precision inside each partition, which benefit from being universal function approximators, with rigid sparsity and quantization enforced only between partitions, where the NN topology becomes exposed to the circuit topology. Although cheap to implement, this approach can lead to very deep NNs, and so to tackle challenges like vanishing gradients, we also introduce skip connections inside the partitions. The resulting methodology can be seen as training DNNs with a specific sparsity pattern that allows them to be mapped to much shallower circuit-level networks, thereby significantly improving latency. We validate our proposed method on a known latency-critical task, jet substructure tagging, and on the classical computer vision task, the digit classification using MNIST. Our approach allows for greater function expressivity within the LUTs compared to existing work, leading to lower latency NNs for the same accuracy.

多峰值 · WEB · Agent · MoDELS · 端到端 ·

2024 年 2 月 29 日

WebVoyager: Building an End-to-End Web Agent with Large Multimodal Models

Hongliang He,Wenlin Yao,Kaixin Ma,Wenhao Yu,Yong Dai,Hongming Zhang,Zhenzhong Lan,Dong Yu

The rapid advancement of large language models (LLMs) has led to a new era marked by the development of autonomous applications in real-world scenarios, which drives innovation in creating advanced web agents. Existing web agents typically only handle one input modality and are evaluated only in simplified web simulators or static web snapshots, greatly limiting their applicability in real-world scenarios. To bridge this gap, we introduce WebVoyager, an innovative Large Multimodal Model (LMM) powered web agent that can complete user instructions end-to-end by interacting with real-world websites. Moreover, we establish a new benchmark by compiling real-world tasks from 15 popular websites and introduce an automatic evaluation protocol leveraging multimodal understanding abilities of GPT-4V to evaluate open-ended web agents. We show that WebVoyager achieves a 59.1% task success rate on our benchmark, significantly surpassing the performance of both GPT-4 (All Tools) and the WebVoyager (text-only) setups, underscoring the exceptional capability of WebVoyager. The proposed automatic evaluation metric achieves 85.3% agreement with human judgment, indicating its effectiveness in providing reliable and accurate assessments of web agents.

MoDELS · Performer · motivation · state-of-the-art · entity ·

2024 年 2 月 29 日

Pointing out the Shortcomings of Relation Extraction Models with Semantically Motivated Adversarials

Gennaro Nolano,Moritz Blum,Basil Ell,Philipp Cimiano

In recent years, large language models have achieved state-of-the-art performance across various NLP tasks. However, investigations have shown that these models tend to rely on shortcut features, leading to inaccurate predictions and causing the models to be unreliable at generalization to out-of-distribution (OOD) samples. For instance, in the context of relation extraction (RE), we would expect a model to identify the same relation independently of the entities involved in it. For example, consider the sentence "Leonardo da Vinci painted the Mona Lisa" expressing the created(Leonardo_da_Vinci, Mona_Lisa) relation. If we substiute "Leonardo da Vinci" with "Barack Obama", then the sentence still expresses the created relation. A robust model is supposed to detect the same relation in both cases. In this work, we describe several semantically-motivated strategies to generate adversarial examples by replacing entity mentions and investigate how state-of-the-art RE models perform under pressure. Our analyses show that the performance of these models significantly deteriorates on the modified datasets (avg. of -48.5% in F1), which indicates that these models rely to a great extent on shortcuts, such as surface forms (or patterns therein) of entities, without making full use of the information present in the sentences.

MoDELS · 語言模型化 · ElasticSearch · 大語言模型 · 變換 ·

2024 年 2 月 24 日

Enhancing Cloud-Based Large Language Model Processing with Elasticsearch and Transformer Models

Chunhe Ni,Jiang Wu,Hongbo Wang,Wenran Lu,Chenwei Zhang

Large Language Models (LLMs) are a class of generative AI models built using the Transformer network, capable of leveraging vast datasets to identify, summarize, translate, predict, and generate language. LLMs promise to revolutionize society, yet training these foundational models poses immense challenges. Semantic vector search within large language models is a potent technique that can significantly enhance search result accuracy and relevance. Unlike traditional keyword-based search methods, semantic search utilizes the meaning and context of words to grasp the intent behind queries and deliver more precise outcomes. Elasticsearch emerges as one of the most popular tools for implementing semantic search an exceptionally scalable and robust search engine designed for indexing and searching extensive datasets. In this article, we delve into the fundamentals of semantic search and explore how to harness Elasticsearch and Transformer models to bolster large language model processing paradigms. We gain a comprehensive understanding of semantic search principles and acquire practical skills for implementing semantic search in real-world model application scenarios.

Vision · 圖 · 變換 · Networking · 圖形處理器 ·

2022 年 9 月 27 日

A Survey on Graph Neural Networks and Graph Transformers in Computer Vision: A Task-Oriented Perspective

Chaoqi Chen,Yushuang Wu,Qiyuan Dai,Hong-Yu Zhou,Mutian Xu,Sibei Yang,Xiaoguang Han,Yizhou Yu

from arxiv, Preprint

Graph Neural Networks (GNNs) have gained momentum in graph representation learning and boosted the state of the art in a variety of areas, such as data mining (\emph{e.g.,} social network analysis and recommender systems), computer vision (\emph{e.g.,} object detection and point cloud learning), and natural language processing (\emph{e.g.,} relation extraction and sequence learning), to name a few. With the emergence of Transformers in natural language processing and computer vision, graph Transformers embed a graph structure into the Transformer architecture to overcome the limitations of local neighborhood aggregation while avoiding strict structural inductive biases. In this paper, we present a comprehensive review of GNNs and graph Transformers in computer vision from a task-oriented perspective. Specifically, we divide their applications in computer vision into five categories according to the modality of input data, \emph{i.e.,} 2D natural images, videos, 3D data, vision + language, and medical images. In each category, we further divide the applications according to a set of vision tasks. Such a task-oriented taxonomy allows us to examine how each task is tackled by different GNN-based approaches and how well these approaches perform. Based on the necessary preliminaries, we provide the definitions and challenges of the tasks, in-depth coverage of the representative approaches, as well as discussions regarding insights, limitations, and future directions.

閱讀: 0 點贊: 0

小貼士

登錄享

相關主題

大語言模型

語(yu)言模(mo)型化(hua)

北京阿比特科技有限公司

注冊地址：北京市海淀區羊坊店路18號2幢3層301-191

<li id='BOY6g'></li>

_{^{<dd id='r0u90'><tbody id='zsDMn'><td id='J2jrJ'><optgroup id='64kdH'><strong id='V1lWn'></strong></optgroup><address id='Wrvmj'><ul id='XaXwe'></ul></address><big id='2EoQe'></big></td><table id='TWhJm'></table></tbody><pre id='pScjX'></pre></dd><span id='blDp1'><b id='AXHVn'></b></span>}}


<dfn id='WFbU3'><optgroup id='rkmMN'></optgroup></dfn><tfoot id='N5J5s'><bdo id='orxD5'><div id='934RX'></div><i id='8ZtKe'><dt id='Yy5pB'></dt></i></bdo></tfoot>

_{<fieldset id='5h662'></fieldset>}