亚洲男人的天堂2018av,欧美草比,久久久久久免费视频精选,国色天香在线看免费,久久久久亚洲av成人片仓井空

<tfoot id='HgroH'></tfoot>

<legend id='9xTGV'><style id='jORxI'><dir id='4q93q'><q id='qo6Wr'></q></dir></style></legend>

<i id='oQiiH'><tr id='Nr3Qo'><dt id='Z8WuM'><q id='D74Cn'><span id='aFsZM'><b id='dbHQC'><form id='ZYEDn'><ins id='9wlTb'></ins><ul id='ooTC5'></ul><sub id='Ql5Y6'></sub></form><legend id='XtlKO'></legend><bdo id='TytGD'><pre id='LUmny'><center id='aCthO'></center></pre></bdo></b><th id='hUkB9'></th></span></q></dt></tr></i><div id='ciTL3'><tfoot id='8IyKL'></tfoot><dl id='DbCI3'><fieldset id='xHENe'></fieldset></dl></div>

·

大語言模型 · 語言模型化 · MoDELS · Automator · AIM ·

2023 年 12 月 21 日

Context Matters: Data-Efficient Augmentation of Large Language Models for Scientific Applications

Xiang Li,Haoran Tang,Siyu Chen,Ziwei Wang,Anurag Maravi,Marcin Abram

from arxiv, 11 pages, 6 figures, 4 tables, 3 pages of supplementary material

In this paper, we explore the challenges inherent to Large Language Models (LLMs) like GPT-4, particularly their propensity for hallucinations, logic mistakes, and incorrect conclusions when tasked with answering complex questions. The capacity of LLMs to present erroneous answers in a coherent and semantically rigorous manner further complicates the detection of factual inaccuracies. This issue is especially pronounced in fields that require specialized expertise. Our work delves into these challenges, aiming to enhance the understanding and mitigation of such errors, thereby contributing to the improvement of LLM accuracy and reliability in scientific and other specialized domains. Our findings reveal a non-linear relationship between the context's relevancy and the answers' measured quality. In addition, we demonstrate that with the correct calibration, it is possible to automate the grading procedure -- a finding suggesting that, at least to some degree, the LLMs can be used to self-examine the quality of their own performance. Finally, we describe an experimental platform that can be seen as a proof-of-concept of the techniques described in this work.

相關內容

大語言模型

大語言模型

大(da)語言(yan)(yan)模(mo)型是基于海量文本(ben)(ben)數據訓(xun)練的(de)(de)深(shen)(shen)度學習模(mo)型。它不僅能(neng)夠(gou)生成自然(ran)語言(yan)(yan)文本(ben)(ben)，還能(neng)夠(gou)深(shen)(shen)入(ru)理(li)(li)解文本(ben)(ben)含義，處理(li)(li)各種(zhong)自然(ran)語言(yan)(yan)任(ren)(ren)務(wu)，如文本(ben)(ben)摘要、問(wen)答、翻(fan)譯等(deng)。2023年，大(da)語言(yan)(yan)模(mo)型及其在人工智(zhi)能(neng)領(ling)域的(de)(de)應用已成為全(quan)球科(ke)技研究的(de)(de)熱點，其在規(gui)模(mo)上(shang)的(de)(de)增長尤為引人注(zhu)目，參數量已從最(zui)初的(de)(de)十幾億躍(yue)升到如今的(de)(de)一萬(wan)億。參數量的(de)(de)提(ti)升使得模(mo)型能(neng)夠(gou)更加(jia)(jia)精(jing)細地(di)捕捉人類語言(yan)(yan)微(wei)妙(miao)之處，更加(jia)(jia)深(shen)(shen)入(ru)地(di)理(li)(li)解人類語言(yan)(yan)的(de)(de)復(fu)雜性(xing)。在過去的(de)(de)一年里，大(da)語言(yan)(yan)模(mo)型在吸納新知識、分解復(fu)雜任(ren)(ren)務(wu)以及圖文對(dui)齊等(deng)多方(fang)面都有顯著提(ti)升。隨著技術的(de)(de)不斷成熟，它將不斷拓展其應用范(fan)圍(wei)，為人類提(ti)供更加(jia)(jia)智(zhi)能(neng)化(hua)和個性(xing)化(hua)的(de)(de)服務(wu)，進(jin)一步改善人們的(de)(de)生活和生產(chan)方(fang)式(shi)。

MoDELS · 語言模型化 · Performer · 任務對話系統 · Notability ·

2024 年 2 月 9 日

LLaVA-Phi: Efficient Multi-Modal Assistant with Small Language Model

Yichen Zhu,Minjie Zhu,Ning Liu,Zhicai Ou,Xiaofeng Mou,Jian Tang

from arxiv, The datasets were incomplete as they did not include all the necessary copyrights

In this paper, we introduce LLaVA-$\phi$ (LLaVA-Phi), an efficient multi-modal assistant that harnesses the power of the recently advanced small language model, Phi-2, to facilitate multi-modal dialogues. LLaVA-Phi marks a notable advancement in the realm of compact multi-modal models. It demonstrates that even smaller language models, with as few as 2.7B parameters, can effectively engage in intricate dialogues that integrate both textual and visual elements, provided they are trained with high-quality corpora. Our model delivers commendable performance on publicly available benchmarks that encompass visual comprehension, reasoning, and knowledge-based perception. Beyond its remarkable performance in multi-modal dialogue tasks, our model opens new avenues for applications in time-sensitive environments and systems that require real-time interaction, such as embodied agents. It highlights the potential of smaller language models to achieve sophisticated levels of understanding and interaction, while maintaining greater resource efficiency.The project is available at {//github.com/zhuyiche/llava-phi}.

命名實體識別 · 大語言模型 · 解碼 · 語言模型化 · entity ·

2024 年 2 月 9 日

PaDeLLM-NER: Parallel Decoding in Large Language Models for Named Entity Recognition

Jinghui Lu,Ziwei Yang,Yanjie Wang,Xuejing Liu,Can Huang

In this study, we aim to reduce generation latency for Named Entity Recognition (NER) with Large Language Models (LLMs). The main cause of high latency in LLMs is the sequential decoding process, which autoregressively generates all labels and mentions for NER, significantly increase the sequence length. To this end, we introduce Parallel Decoding in LLM for NE} (PaDeLLM-NER), a approach that integrates seamlessly into existing generative model frameworks without necessitating additional modules or architectural modifications. PaDeLLM-NER allows for the simultaneous decoding of all mentions, thereby reducing generation latency. Experiments reveal that PaDeLLM-NER significantly increases inference speed that is 1.76 to 10.22 times faster than the autoregressive approach for both English and Chinese. Simultaneously it maintains the quality of predictions as evidenced by the performance that is on par with the state-of-the-art across various datasets.

Performer · 生成式人工智能 · MoDELS · 相關系數 · AI ·

2024 年 2 月 9 日

The Generative AI Paradox on Evaluation: What It Can Solve, It May Not Evaluate

Juhyun Oh,Eunsu Kim,Inha Cha,Alice Oh

This paper explores the assumption that Large Language Models (LLMs) skilled in generation tasks are equally adept as evaluators. We assess the performance of three LLMs and one open-source LM in Question-Answering (QA) and evaluation tasks using the TriviaQA (Joshi et al., 2017) dataset. Results indicate a significant disparity, with LLMs exhibiting lower performance in evaluation tasks compared to generation tasks. Intriguingly, we discover instances of unfaithful evaluation where models accurately evaluate answers in areas where they lack competence, underscoring the need to examine the faithfulness and trustworthiness of LLMs as evaluators. This study contributes to the understanding of "the Generative AI Paradox" (West et al., 2023), highlighting a need to explore the correlation between generative excellence and evaluation proficiency, and the necessity to scrutinize the faithfulness aspect in model evaluations.

圖 · 語言模型化 · 大語言模型 · 泛函 · 邊 ·

2024 年 2 月 8 日

Let Your Graph Do the Talking: Encoding Structured Data for LLMs

Bryan Perozzi,Bahare Fatemi,Dustin Zelle,Anton Tsitsulin,Mehran Kazemi,Rami Al-Rfou,Jonathan Halcrow

How can we best encode structured data into sequential form for use in large language models (LLMs)? In this work, we introduce a parameter-efficient method to explicitly represent structured data for LLMs. Our method, GraphToken, learns an encoding function to extend prompts with explicit structured information. Unlike other work which focuses on limited domains (e.g. knowledge graph representation), our work is the first effort focused on the general encoding of structured data to be used for various reasoning tasks. We show that explicitly representing the graph structure allows significant improvements to graph reasoning tasks. Specifically, we see across the board improvements - up to 73% points - on node, edge and, graph-level tasks from the GraphQA benchmark.

優化器 · Learning · 最優化 · 泛函 · 代價 ·

2024 年 2 月 8 日

DiffTOP: Differentiable Trajectory Optimization for Deep Reinforcement and Imitation Learning

Weikang Wan,Yufei Wang,Zackory Erickson,David Held

This paper introduces DiffTOP, which utilizes Differentiable Trajectory OPtimization as the policy representation to generate actions for deep reinforcement and imitation learning. Trajectory optimization is a powerful and widely used algorithm in control, parameterized by a cost and a dynamics function. The key to our approach is to leverage the recent progress in differentiable trajectory optimization, which enables computing the gradients of the loss with respect to the parameters of trajectory optimization. As a result, the cost and dynamics functions of trajectory optimization can be learned end-to-end. DiffTOP addresses the ``objective mismatch'' issue of prior model-based RL algorithms, as the dynamics model in DiffTOP is learned to directly maximize task performance by differentiating the policy gradient loss through the trajectory optimization process. We further benchmark DiffTOP for imitation learning on standard robotic manipulation task suites with high-dimensional sensory observations and compare our method to feed-forward policy classes as well as Energy-Based Models (EBM) and Diffusion. Across 15 model-based RL tasks and 13 imitation learning tasks with high-dimensional image and point cloud inputs, DiffTOP outperforms prior state-of-the-art methods in both domains.

泛化理論 · 可理解性 · Better · MoDELS · 泛化誤差 ·

2024 年 2 月 7 日

PAC-Chernoff Bounds: Understanding Generalization in the Interpolation Regime

Andrés R. Masegosa,Luis A. Ortega

from arxiv, 34 pages, 10 figures, Pre-print

In this paper, we present a distribution-dependent PAC-Chernoff bound that is perfectly tight for interpolators even under overparametrized model classes. This bound relies on basic principles of Large Deviation Theory and naturally provides a characterization of the smoothness of a model described as a simple real-valued function. Based on this distribution-dependent bound and the novel definition of smoothness, we propose an unifying theoretical explanation of why some interpolators generalize remarkably well while others not. And why a wide range of modern learning techniques (i.e., $\ell_2$-norm, distance-from-initialization, input-gradient and variance regularization together with data augmentation, invariant architectures, and overparameterization) are able to find them. The emergent conclusion is that all these methods provide complimentary procedures that bias the optimizer to smoother interpolators, which, according to this theoretical analysis, are the ones with better generalization error. One of the main insights of this study is that distribution-dependent bounds serve as a powerful tool better understand the complex dynamics behind the generalization capabilities of highly-overparameterized interpolators.

全局優化 · 優化器 · 樣本 · Lipschitz · Continuity ·

2024 年 2 月 7 日

Stein Boltzmann Sampling: A Variational Approach for Global Optimization

Ga?tan Serré,Argyris Kalogeratos,Nicolas Vayatis

In this paper, we introduce a new flow-based method for global optimization of Lipschitz functions, called Stein Boltzmann Sampling (SBS). Our method samples from the Boltzmann distribution that becomes asymptotically uniform over the set of the minimizers of the function to be optimized. Candidate solutions are sampled via the \emph{Stein Variational Gradient Descent} algorithm. We prove the asymptotic convergence of our method, introduce two SBS variants, and provide a detailed comparison with several state-of-the-art global optimization algorithms on various benchmark functions. The design of our method, the theoretical results, and our experiments, suggest that SBS is particularly well-suited to be used as a continuation of efficient global optimization methods as it can produce better solutions while making a good use of the budget.

泛化理論 · Performer · 目標檢測 · 源領域 · 優化器 ·

2024 年 2 月 7 日

G-NAS: Generalizable Neural Architecture Search for Single Domain Generalization Object Detection

Fan Wu,Jinling Gao,Lanqing Hong,Xinbing Wang,Chenghu Zhou,Nanyang Ye

from arxiv, Accepted by AAAI24

In this paper, we focus on a realistic yet challenging task, Single Domain Generalization Object Detection (S-DGOD), where only one source domain's data can be used for training object detectors, but have to generalize multiple distinct target domains. In S-DGOD, both high-capacity fitting and generalization abilities are needed due to the task's complexity. Differentiable Neural Architecture Search (NAS) is known for its high capacity for complex data fitting and we propose to leverage Differentiable NAS to solve S-DGOD. However, it may confront severe over-fitting issues due to the feature imbalance phenomenon, where parameters optimized by gradient descent are biased to learn from the easy-to-learn features, which are usually non-causal and spuriously correlated to ground truth labels, such as the features of background in object detection data. Consequently, this leads to serious performance degradation, especially in generalizing to unseen target domains with huge domain gaps between the source domain and target domains. To address this issue, we propose the Generalizable loss (G-loss), which is an OoD-aware objective, preventing NAS from over-fitting by using gradient descent to optimize parameters not only on a subset of easy-to-learn features but also the remaining predictive features for generalization, and the overall framework is named G-NAS. Experimental results on the S-DGOD urban-scene datasets demonstrate that the proposed G-NAS achieves SOTA performance compared to baseline methods. Codes are available at //github.com/wufan-cse/G-NAS.

NeRF · FAST · MoDELS · Extensibility · 相互獨立的 ·

2024 年 2 月 7 日

BirdNeRF: Fast Neural Reconstruction of Large-Scale Scenes From Aerial Imagery

Huiqing Zhang,Yifei Xue,Ming Liao,Yizhen Lao

In this study, we introduce BirdNeRF, an adaptation of Neural Radiance Fields (NeRF) designed specifically for reconstructing large-scale scenes using aerial imagery. Unlike previous research focused on small-scale and object-centric NeRF reconstruction, our approach addresses multiple challenges, including (1) Addressing the issue of slow training and rendering associated with large models. (2) Meeting the computational demands necessitated by modeling a substantial number of images, requiring extensive resources such as high-performance GPUs. (3) Overcoming significant artifacts and low visual fidelity commonly observed in large-scale reconstruction tasks due to limited model capacity. Specifically, we present a novel bird-view pose-based spatial decomposition algorithm that decomposes a large aerial image set into multiple small sets with appropriately sized overlaps, allowing us to train individual NeRFs of sub-scene. This decomposition approach not only decouples rendering time from the scene size but also enables rendering to scale seamlessly to arbitrarily large environments. Moreover, it allows for per-block updates of the environment, enhancing the flexibility and adaptability of the reconstruction process. Additionally, we propose a projection-guided novel view re-rendering strategy, which aids in effectively utilizing the independently trained sub-scenes to generate superior rendering results. We evaluate our approach on existing datasets as well as against our own drone footage, improving reconstruction speed by 10x over classical photogrammetry software and 50x over state-of-the-art large-scale NeRF solution, on a single GPU with similar rendering quality.

Learning · 強化學習 · 樣本 · INFORMS · state-of-the-art ·

2024 年 2 月 6 日

Improved Bayesian Regret Bounds for Thompson Sampling in Reinforcement Learning

Ahmadreza Moradipari,Mohammad Pedramfar,Modjtaba Shokrian Zini,Vaneet Aggarwal

from arxiv, 37th Conference on Neural Information Processing Systems (NeurIPS 2023)

In this paper, we prove the first Bayesian regret bounds for Thompson Sampling in reinforcement learning in a multitude of settings. We simplify the learning problem using a discrete set of surrogate environments, and present a refined analysis of the information ratio using posterior consistency. This leads to an upper bound of order $\widetilde{O}(H\sqrt{d_{l_1}T})$ in the time inhomogeneous reinforcement learning problem where $H$ is the episode length and $d_{l_1}$ is the Kolmogorov $l_1-$dimension of the space of environments. We then find concrete bounds of $d_{l_1}$ in a variety of settings, such as tabular, linear and finite mixtures, and discuss how how our results are either the first of their kind or improve the state-of-the-art.

閱讀: 0 點贊: 0

小貼士

登錄享

相關主題

大語(yu)言(yan)模型

語言(yan)模型化

北京阿比特科技有限公司

注冊地址：北京市海淀區羊坊店路18號2幢3層301-191

<tfoot id='9p71c'></tfoot>

<legend id='9p71c'><style id='9p71c'><dir id='9p71c'><q id='9p71c'></q></dir></style></legend>

<i id='9p71c'><tr id='9p71c'><dt id='9p71c'><q id='9p71c'><span id='9p71c'><b id='9p71c'><form id='9p71c'><ins id='9p71c'></ins><ul id='9p71c'></ul><sub id='9p71c'></sub></form><legend id='9p71c'></legend><bdo id='9p71c'><pre id='9p71c'><center id='9p71c'></center></pre></bdo></b><th id='9p71c'></th></span></q></dt></tr></i><div id='9p71c'><tfoot id='9p71c'></tfoot><dl id='9p71c'><fieldset id='9p71c'></fieldset></dl></div>

<li id='9p71c'><abbr id='9p71c'></abbr></li>