亚洲男人的天堂2018av,欧美草比,久久久久久免费视频精选,国色天香在线看免费,久久久久亚洲av成人片仓井空

<tfoot id='59p7z'></tfoot>

<legend id='59p7z'><style id='59p7z'><dir id='59p7z'><q id='59p7z'></q></dir></style></legend>

<i id='59p7z'><tr id='59p7z'><dt id='59p7z'><q id='59p7z'><span id='59p7z'><b id='59p7z'><form id='59p7z'><ins id='59p7z'></ins><ul id='59p7z'></ul><sub id='59p7z'></sub></form><legend id='59p7z'></legend><bdo id='59p7z'><pre id='59p7z'><center id='59p7z'></center></pre></bdo></b><th id='59p7z'></th></span></q></dt></tr></i><div id='59p7z'><tfoot id='59p7z'></tfoot><dl id='59p7z'><fieldset id='59p7z'></fieldset></dl></div>

<li id='59p7z'><abbr id='59p7z'></abbr></li>

·

大語言模型 · 語言模型化 · MoDELS · Extensibility · Performer ·

2024 年 3 月 5 日

Crossing Linguistic Horizons: Finetuning and Comprehensive Evaluation of Vietnamese Large Language Models

Sang T. Truong,Duc Q. Nguyen,Toan Nguyen,Dong D. Le,Nhi N. Truong,Tho Quan,Sanmi Koyejo

from arxiv, 33 pages

Recent advancements in large language models (LLMs) have underscored their importance in the evolution of artificial intelligence. However, despite extensive pretraining on multilingual datasets, available open-sourced LLMs exhibit limited effectiveness in processing Vietnamese. The challenge is exacerbated by the absence of systematic benchmark datasets and metrics tailored for Vietnamese LLM evaluation. To mitigate these issues, we have finetuned LLMs specifically for Vietnamese and developed a comprehensive evaluation framework encompassing 10 common tasks and 31 metrics. Our evaluation results reveal that the fine-tuned LLMs exhibit enhanced comprehension and generative capabilities in Vietnamese. Moreover, our analysis indicates that models with more parameters can introduce more biases and uncalibrated outputs and the key factor influencing LLM performance is the quality of the training or fine-tuning datasets. These insights underscore the significance of meticulous fine-tuning with high-quality datasets in enhancing LLM performance.

相關內容

大語言模型

大語(yu)言模型(xing)

大(da)語(yu)(yu)言(yan)(yan)模(mo)型(xing)(xing)是基于海量(liang)文(wen)(wen)本(ben)(ben)數(shu)據訓練(lian)的(de)深(shen)(shen)度學(xue)習模(mo)型(xing)(xing)。它(ta)不(bu)僅能夠生成(cheng)自然語(yu)(yu)言(yan)(yan)文(wen)(wen)本(ben)(ben)，還能夠深(shen)(shen)入理(li)解文(wen)(wen)本(ben)(ben)含義(yi)，處(chu)理(li)各種自然語(yu)(yu)言(yan)(yan)任務(wu)，如文(wen)(wen)本(ben)(ben)摘要、問答、翻譯等。2023年，大(da)語(yu)(yu)言(yan)(yan)模(mo)型(xing)(xing)及其在(zai)人(ren)(ren)工智能領域的(de)應(ying)用已成(cheng)為(wei)(wei)全球科技(ji)研究的(de)熱(re)點，其在(zai)規模(mo)上的(de)增長(chang)尤為(wei)(wei)引人(ren)(ren)注(zhu)目，參(can)數(shu)量(liang)已從最(zui)初的(de)十幾億躍升到(dao)如今的(de)一萬億。參(can)數(shu)量(liang)的(de)提升使得模(mo)型(xing)(xing)能夠更加(jia)(jia)精(jing)細(xi)地捕(bu)捉人(ren)(ren)類(lei)語(yu)(yu)言(yan)(yan)微妙之處(chu)，更加(jia)(jia)深(shen)(shen)入地理(li)解人(ren)(ren)類(lei)語(yu)(yu)言(yan)(yan)的(de)復(fu)雜性(xing)(xing)。在(zai)過去的(de)一年里(li)，大(da)語(yu)(yu)言(yan)(yan)模(mo)型(xing)(xing)在(zai)吸納新知識、分解復(fu)雜任務(wu)以及圖文(wen)(wen)對(dui)齊(qi)等多方面都有顯著(zhu)提升。隨(sui)著(zhu)技(ji)術(shu)的(de)不(bu)斷成(cheng)熟(shu)，它(ta)將不(bu)斷拓展其應(ying)用范圍，為(wei)(wei)人(ren)(ren)類(lei)提供更加(jia)(jia)智能化(hua)和(he)個性(xing)(xing)化(hua)的(de)服務(wu)，進一步(bu)改(gai)善人(ren)(ren)們的(de)生活(huo)和(he)生產方式。

語言模型化 · MoDELS · 向量化 · 輸出 · Extensibility ·

2024 年 4 月 16 日

DESTEIN: Navigating Detoxification of Language Models via Universal Steering Pairs and Head-wise Activation Fusion

Yu Li,Zhihua Wei,Han Jiang,Chuanyang Gong

Despite the remarkable achievements of language models (LMs) across a broad spectrum of tasks, their propensity for generating toxic outputs remains a prevalent concern. Current solutions involving fine-tuning or auxiliary models usually require extensive memory and computational resources, rendering them less practical for deployment in large language models (LLMs). In this paper, we propose DeStein, a novel method that detoxififies LMs by altering their internal representations in the activation space with lower resource and time cost. Specifically, we leverage self-induced steering pairs to identify detoxification vectors through arithmetic operations in the activation space. During inference, detoxification is achieved by blending the detoxification vectors with the original representations. Empirical results demonstrate that our method significantly outperforms previous state-of-the-art approaches on popular detoxification metrics, while also maintaining satisfactory generation quality and diversity. Furthermore, we extend our method to multiple LLMs, demonstrating its practicality and scalability. Warning: some example model outputs contain highly offensive or disturbing text.

Processing（編程語言） · 生成式人工智能 · AI · HTTPS · 語言模型化 ·

2024 年 4 月 13 日

Three Disclaimers for Safe Disclosure: A Cardwriter for Reporting the Use of Generative AI in Writing Process

Won Ik Cho,Eunjung Cho,Hyeonji Shin

from arxiv, 6 pages; an implementation version of PaperCard project

Generative artificial intelligence (AI) and large language models (LLMs) are increasingly being used in the academic writing process. This is despite the current lack of unified framework for reporting the use of machine assistance. In this work, we propose "Cardwriter", an intuitive interface that produces a short report for authors to declare their use of generative AI in their writing process. The demo is available online, at //cardwriter.vercel.app

大語言模型 · 可辨認的 · INTERACT · TOOLS · 可理解性 ·

2024 年 4 月 4 日

Fakes of Varying Shades: How Warning Affects Human Perception and Engagement Regarding LLM Hallucinations

Mahjabin Nahar,Haeseung Seo,Eun-Ju Lee,Aiping Xiong,Dongwon Lee

The widespread adoption and transformative effects of large language models (LLMs) have sparked concerns regarding their capacity to produce inaccurate and fictitious content, referred to as `hallucinations'. Given the potential risks associated with hallucinations, humans should be able to identify them. This research aims to understand the human perception of LLM hallucinations by systematically varying the degree of hallucination (genuine, minor hallucination, major hallucination) and examining its interaction with warning (i.e., a warning of potential inaccuracies: absent vs. present). Participants (N=419) from Prolific rated the perceived accuracy and engaged with content (e.g., like, dislike, share) in a Q/A format. Results indicate that humans rank content as truthful in the order genuine > minor hallucination > major hallucination and user engagement behaviors mirror this pattern. More importantly, we observed that warning improves hallucination detection without significantly affecting the perceived truthfulness of genuine content. We conclude by offering insights for future tools to aid human detection of hallucinations.

有偏 · GROUP · MoDELS · 大語言模型 · Performer ·

2024 年 4 月 4 日

Reevaluating Bias Detection in Language Models: The Role of Implicit Norm

Farnaz Kohankhaki,Jacob-Junqi Tian,David Emerson,Laleh Seyyed-Kalantari,Faiza Khan Khattak

Large language models (LLMs), trained on vast datasets, can carry biases that manifest in various forms, from overt discrimination to implicit stereotypes. One facet of bias is performance disparities in LLMs, often harming underprivileged groups, such as racial minorities. A common approach to quantifying bias is to use template-based bias probes, which explicitly state group membership (e.g. White) and evaluate if the outcome of a task, sentiment analysis for instance, is invariant to the change of group membership (e.g. change White race to Black). This approach is widely used in bias quantification. However, in this work, we find evidence of an unexpectedly overlooked consequence of using template-based probes for LLM bias quantification. We find that in doing so, text examples associated with White ethnicities appear to be classified as exhibiting negative sentiment at elevated rates. We hypothesize that the scenario arises artificially through a mismatch between the pre-training text of LLMs and the templates used to measure bias through reporting bias, unstated norms that imply group membership without explicit statement. Our finding highlights the potential misleading impact of varying group membership through explicit mention in bias quantification

MoDELS · 語言模型化 · 大語言模型 · Processing（編程語言） · 數據集 ·

2024 年 4 月 3 日

Conifer: Improving Complex Constrained Instruction-Following Ability of Large Language Models

Haoran Sun,Lixin Liu,Junjie Li,Fengyu Wang,Baohua Dong,Ran Lin,Ruohui Huang

The ability of large language models (LLMs) to follow instructions is crucial to real-world applications. Despite recent advances, several studies have highlighted that LLMs struggle when faced with challenging instructions, especially those that include complex constraints, hindering their effectiveness in various tasks. To address this challenge, we introduce Conifer, a novel instruction tuning dataset, designed to enhance LLMs to follow multi-level instructions with complex constraints. Utilizing GPT-4, we curate the dataset by a series of LLM-driven refinement processes to ensure high quality. We also propose a progressive learning scheme that emphasizes an easy-to-hard progression, and learning from process feedback. Models trained with Conifer exhibit remarkable improvements in instruction-following abilities, especially for instructions with complex constraints. On several instruction-following benchmarks, our 7B model outperforms the state-of-the-art open-source 7B models, even exceeds the performance of models 10 times larger on certain metrics. All the code and Conifer dataset are available at //www.github.com/ConiferLM/Conifer.

Agent · INTERACT · 多樣性 · Processing（編程語言） · 設計 ·

2024 年 4 月 3 日

Evolving Agents: Interactive Simulation of Dynamic and Diverse Human Personalities

Jiale Li,Jiayang Li,Jiahao Chen,Yifan Li,Shijie Wang,Hugo Zhou,Minjun Ye,Yunsheng Su

Human-like Agents with diverse and dynamic personality could serve as an important design probe in the process of user-centered design, thereby enabling designers to enhance the user experience of interactive application.In this article, we introduce Evolving Agents, a novel agent architecture that consists of two systems: Personality and Behavior. The Personality system includes three modules: Cognition, Emotion and Character Growth. The Behavior system comprises two modules: Planning and Action. We also build a simulation platform that enables agents to interact with the environment and other agents. Evolving Agents can simulate the human personality evolution process. Compared to its initial state, agents' personality and behavior patterns undergo believable development after several days of simulation. Agents reflect on their behavior to reason and develop new personality traits. These traits, in turn, generate new behavior patterns, forming a feedback loop-like personality evolution.In our experiment, we utilized simulation platform with 10 agents for evaluation. During the evaluation, these agents experienced believable and inspirational personality evolution. Through ablation and control experiments, we demonstrated the outstanding effectiveness of agent personality evolution and all modules of our agent architecture contribute to creating believable human-like agents with diverse and dynamic personalities. We also demonstrated through workshops how Evolving Agents could inspire designers.

語言模型化 · MoDELS · INFORMS · 泛函 · TOOLS ·

2023 年 7 月 31 日

When Large Language Models Meet Personalization: Perspectives of Challenges and Opportunities

Jin Chen,Zheng Liu,Xu Huang,Chenwang Wu,Qi Liu,Gangwei Jiang,Yuanhao Pu,Yuxuan Lei,Xiaolong Chen,Xingmei Wang,Defu Lian,Enhong Chen

The advent of large language models marks a revolutionary breakthrough in artificial intelligence. With the unprecedented scale of training and model parameters, the capability of large language models has been dramatically improved, leading to human-like performances in understanding, language synthesizing, and common-sense reasoning, etc. Such a major leap-forward in general AI capacity will change the pattern of how personalization is conducted. For one thing, it will reform the way of interaction between humans and personalization systems. Instead of being a passive medium of information filtering, large language models present the foundation for active user engagement. On top of such a new foundation, user requests can be proactively explored, and user's required information can be delivered in a natural and explainable way. For another thing, it will also considerably expand the scope of personalization, making it grow from the sole function of collecting personalized information to the compound function of providing personalized services. By leveraging large language models as general-purpose interface, the personalization systems may compile user requests into plans, calls the functions of external tools to execute the plans, and integrate the tools' outputs to complete the end-to-end personalization tasks. Today, large language models are still being developed, whereas the application in personalization is largely unexplored. Therefore, we consider it to be the right time to review the challenges in personalization and the opportunities to address them with LLMs. In particular, we dedicate this perspective paper to the discussion of the following aspects: the development and challenges for the existing personalization system, the newly emerged capabilities of large language models, and the potential ways of making use of large language models for personalization.

語言模型化 · Taxonomy · MoDELS · motivation · 評論員 ·

2023 年 5 月 31 日

Beyond One-Model-Fits-All: A Survey of Domain Specialization for Large Language Models

Chen Ling,Xujiang Zhao,Jiaying Lu,Chengyuan Deng,Can Zheng,Junxiang Wang,Tanmoy Chowdhury,Yun Li,Hejie Cui,Xuchao Zhang,Tianjiao Zhao,Amit Panalkar,Wei Cheng,Haoyu Wang,Yanchi Liu,Zhengzhang Chen,Haifeng Chen,Chris White,Quanquan Gu,Carl Yang,Liang Zhao

Large language models (LLMs) have significantly advanced the field of natural language processing (NLP), providing a highly useful, task-agnostic foundation for a wide range of applications. The great promise of LLMs as general task solvers motivated people to extend their functionality largely beyond just a ``chatbot'', and use it as an assistant or even replacement for domain experts and tools in specific domains such as healthcare, finance, and education. However, directly applying LLMs to solve sophisticated problems in specific domains meets many hurdles, caused by the heterogeneity of domain data, the sophistication of domain knowledge, the uniqueness of domain objectives, and the diversity of the constraints (e.g., various social norms, cultural conformity, religious beliefs, and ethical standards in the domain applications). To fill such a gap, explosively-increase research, and practices have been conducted in very recent years on the domain specialization of LLMs, which, however, calls for a comprehensive and systematic review to better summarizes and guide this promising domain. In this survey paper, first, we propose a systematic taxonomy that categorizes the LLM domain-specialization techniques based on the accessibility to LLMs and summarizes the framework for all the subcategories as well as their relations and differences to each other. We also present a comprehensive taxonomy of critical application domains that can benefit from specialized LLMs, discussing their practical significance and open challenges. Furthermore, we offer insights into the current research status and future trends in this area.

知識 (knowledge) · 圖 · 數學 · 表示 · 知識圖譜 ·

2022 年 11 月 7 日

Knowledge Graph Embedding: A Survey from the Perspective of Representation Spaces

Jiahang Cao,Jinyuan Fang,Zaiqiao Meng,Shangsong Liang

from arxiv, 32 pages, 6 figures

Knowledge graph embedding (KGE) is a increasingly popular technique that aims to represent entities and relations of knowledge graphs into low-dimensional semantic spaces for a wide spectrum of applications such as link prediction, knowledge reasoning and knowledge completion. In this paper, we provide a systematic review of existing KGE techniques based on representation spaces. Particularly, we build a fine-grained classification to categorise the models based on three mathematical perspectives of the representation spaces: (1) Algebraic perspective, (2) Geometric perspective, and (3) Analytical perspective. We introduce the rigorous definitions of fundamental mathematical spaces before diving into KGE models and their mathematical properties. We further discuss different KGE methods over the three categories, as well as summarise how spatial advantages work over different embedding needs. By collating the experimental results from downstream tasks, we also explore the advantages of mathematical space in different scenarios and the reasons behind them. We further state some promising research directions from a representation space perspective, with which we hope to inspire researchers to design their KGE models as well as their related applications with more consideration of their mathematical space properties.

知識 (knowledge) · Processing（編程語言） · 圖 · NLP · 知識圖譜 ·

2022 年 9 月 30 日

A Decade of Knowledge Graphs in Natural Language Processing: A Survey

Phillip Schneider,Tim Schopf,Juraj Vladika,Mikhail Galkin,Elena Simperl,Florian Matthes

from arxiv, Accepted to AACL-IJCNLP 2022

In pace with developments in the research field of artificial intelligence, knowledge graphs (KGs) have attracted a surge of interest from both academia and industry. As a representation of semantic relations between entities, KGs have proven to be particularly relevant for natural language processing (NLP), experiencing a rapid spread and wide adoption within recent years. Given the increasing amount of research work in this area, several KG-related approaches have been surveyed in the NLP research community. However, a comprehensive study that categorizes established topics and reviews the maturity of individual research streams remains absent to this day. Contributing to closing this gap, we systematically analyzed 507 papers from the literature on KGs in NLP. Our survey encompasses a multifaceted review of tasks, research types, and contributions. As a result, we present a structured overview of the research landscape, provide a taxonomy of tasks, summarize our findings, and highlight directions for future work.

閱讀: 0 點贊: 0

小貼士

登錄享

相關主題

大語言(yan)模型

語言(yan)模(mo)型化

北京阿比特科技有限公司

注冊地址：北京市海淀區羊坊店路18號2幢3層301-191

<tr id='59p7z'><strong id='59p7z'></strong><small id='59p7z'></small><button id='59p7z'></button><li id='59p7z'><noscript id='59p7z'><big id='59p7z'></big><dt id='59p7z'></dt></noscript></li></tr><ol id='59p7z'><option id='59p7z'><table id='59p7z'><blockquote id='59p7z'><tbody id='59p7z'></tbody></blockquote></table></option></ol><u id='59p7z'></u><kbd id='59p7z'><kbd id='59p7z'></kbd></kbd>

<code id='59p7z'><strong id='59p7z'></strong></code>

<fieldset id='59p7z'></fieldset>

<span id='59p7z'></span>

<ins id='59p7z'></ins>

<acronym id='59p7z'><em id='59p7z'></em><td id='59p7z'><div id='59p7z'></div></td></acronym><address id='59p7z'><big id='59p7z'><big id='59p7z'></big><legend id='59p7z'></legend></big></address>

<i id='59p7z'><div id='59p7z'><ins id='59p7z'></ins></div></i>

<i id='59p7z'></i>