欧美丰满大乳屁股流白浆_国产美女高潮流白浆视频18_草逼视频网站一区二区三区_9久久国产精品资源_天堂AV首页网站导航_午夜小视频在线观看_精品国产日韩在线人成

The field of text generation suffers from a severe shortage of labeled data due to the extremely expensive and time consuming process involved in manual annotation. A natural approach for coping with this problem is active learning (AL), a well-known machine learning technique for improving annotation efficiency by selectively choosing the most informative examples to label. However, while AL has been well-researched in the context of text classification, its application to text generation remained largely unexplored. In this paper, we present a first systematic study of active learning for text generation, considering a diverse set of tasks and multiple leading AL strategies. Our results indicate that existing AL strategies, despite their success in classification, are largely ineffective for the text generation scenario, and fail to consistently surpass the baseline of random example selection. We highlight some notable differences between the classification and generation scenarios, and analyze the selection behaviors of existing AL strategies. Our findings motivate exploring novel approaches for applying AL to NLG tasks.

相關內容

主(zhu)動學習

關注 240

主動(dong)學(xue)(xue)(xue)(xue)(xue)(xue)(xue)習(xi)是機器學(xue)(xue)(xue)(xue)(xue)(xue)(xue)習(xi)（更(geng)普遍的(de)(de)(de)說是人工智能(neng)(neng)）的(de)(de)(de)一(yi)個子(zi)領(ling)域，在(zai)統計學(xue)(xue)(xue)(xue)(xue)(xue)(xue)領(ling)域也叫(jiao)查詢學(xue)(xue)(xue)(xue)(xue)(xue)(xue)習(xi)、最優(you)實驗設計。“學(xue)(xue)(xue)(xue)(xue)(xue)(xue)習(xi)模塊”和(he)“選擇策略”是主動(dong)學(xue)(xue)(xue)(xue)(xue)(xue)(xue)習(xi)算法的(de)(de)(de)2個基本且重要的(de)(de)(de)模塊。主動(dong)學(xue)(xue)(xue)(xue)(xue)(xue)(xue)習(xi)是“一(yi)種學(xue)(xue)(xue)(xue)(xue)(xue)(xue)習(xi)方(fang)法，在(zai)這(zhe)種方(fang)法中，學(xue)(xue)(xue)(xue)(xue)(xue)(xue)生(sheng)(sheng)會(hui)主動(dong)或(huo)體驗性地(di)參與(yu)(yu)學(xue)(xue)(xue)(xue)(xue)(xue)(xue)習(xi)過(guo)程(cheng)，并且根據學(xue)(xue)(xue)(xue)(xue)(xue)(xue)生(sheng)(sheng)的(de)(de)(de)參與(yu)(yu)程(cheng)度，有不同程(cheng)度的(de)(de)(de)主動(dong)學(xue)(xue)(xue)(xue)(xue)(xue)(xue)習(xi)。” （Bonwell＆Eison 1991）Bonwell＆Eison（1991）指(zhi)出(chu)：“學(xue)(xue)(xue)(xue)(xue)(xue)(xue)生(sheng)(sheng)除了被(bei)動(dong)地(di)聽課(ke)以外，還(huan)從事(shi)其他活動(dong)。” 在(zai)高等教育(yu)研究協會(hui)（ASHE）的(de)(de)(de)一(yi)份報(bao)告中，作者討論了各種促進主動(dong)學(xue)(xue)(xue)(xue)(xue)(xue)(xue)習(xi)的(de)(de)(de)方(fang)法。他們引用了一(yi)些(xie)文獻，這(zhe)些(xie)文獻表明學(xue)(xue)(xue)(xue)(xue)(xue)(xue)生(sheng)(sheng)不僅要做(zuo)聽，還(huan)必(bi)須(xu)(xu)做(zuo)更(geng)多(duo)的(de)(de)(de)事(shi)情才(cai)能(neng)(neng)學(xue)(xue)(xue)(xue)(xue)(xue)(xue)習(xi)。他們必(bi)須(xu)(xu)閱(yue)讀，寫作，討論并參與(yu)(yu)解決問題。此過(guo)程(cheng)涉(she)及三個學(xue)(xue)(xue)(xue)(xue)(xue)(xue)習(xi)領(ling)域，即知識，技能(neng)(neng)和(he)態度（KSA）。這(zhe)種學(xue)(xue)(xue)(xue)(xue)(xue)(xue)習(xi)行為分類法可(ke)以被(bei)認(ren)為是“學(xue)(xue)(xue)(xue)(xue)(xue)(xue)習(xi)過(guo)程(cheng)的(de)(de)(de)目標”。特別是，學(xue)(xue)(xue)(xue)(xue)(xue)(xue)生(sheng)(sheng)必(bi)須(xu)(xu)從事(shi)諸如分析，綜合(he)和(he)評估之類的(de)(de)(de)高級思維任務。

Performer · MoDELS · Extensibility · Automator · 可約的 ·

2023 年 7 月 11 日

Faithful Low-Resource Data-to-Text Generation through Cycle Training

Zhuoer Wang,Marcus Collins,Nikhita Vedula,Simone Filice,Shervin Malmasi,Oleg Rokhlenko

from arxiv, 19 pages, 4 figures, ACL 2023

Methods to generate text from structured data have advanced significantly in recent years, primarily due to fine-tuning of pre-trained language models on large datasets. However, such models can fail to produce output faithful to the input data, particularly on out-of-domain data. Sufficient annotated data is often not available for specific domains, leading us to seek an unsupervised approach to improve the faithfulness of output text. Since the problem is fundamentally one of consistency between the representations of the structured data and text, we evaluate the effectiveness of cycle training in this work. Cycle training uses two models which are inverses of each other: one that generates text from structured data, and one which generates the structured data from natural language text. We show that cycle training, when initialized with a small amount of supervised data (100 samples in our case), achieves nearly the same performance as fully supervised approaches for the data-to-text generation task on the WebNLG, E2E, WTQ, and WSQL datasets. We perform extensive empirical analysis with automated evaluation metrics and a newly designed human evaluation schema to reveal different cycle training strategies' effectiveness of reducing various types of generation errors. Our code is publicly available at //github.com/Edillower/CycleNLG.

Performer · 語音識別 · MoDELS · 語言模型化 · 模型性能 ·

2023 年 7 月 9 日

Can Generative Large Language Models Perform ASR Error Correction?

Rao Ma,Mengjie Qian,Potsawee Manakul,Mark Gales,Kate Knill

ASR error correction continues to serve as an important part of post-processing for speech recognition systems. Traditionally, these models are trained with supervised training using the decoding results of the underlying ASR system and the reference text. This approach is computationally intensive and the model needs to be re-trained when switching the underlying ASR model. Recent years have seen the development of large language models and their ability to perform natural language processing tasks in a zero-shot manner. In this paper, we take ChatGPT as an example to examine its ability to perform ASR error correction in the zero-shot or 1-shot settings. We use the ASR N-best list as model input and propose unconstrained error correction and N-best constrained error correction methods. Results on a Conformer-Transducer model and the pre-trained Whisper model show that we can largely improve the ASR system performance with error correction using the powerful ChatGPT model.

語言模型化 · MoDELS · Performer · state-of-the-art · Guidance ·

2023 年 7 月 8 日

Evaluating the Capability of Large-scale Language Models on Chinese Grammatical Error Correction Task

Fanyi Qu,Yunfang Wu

Large-scale language models (LLMs) has shown remarkable capability in various of Natural Language Processing (NLP) tasks and attracted lots of attention recently. However, some studies indicated that large language models fail to achieve promising result beyond the state-of-the-art models in English grammatical error correction (GEC) tasks. In this report, we aim to explore the how large language models perform on Chinese grammatical error correction tasks and provide guidance for future work. We conduct experiments with 3 different LLMs of different model scale on 4 Chinese GEC dataset. Our experimental results indicate that the performances of LLMs on automatic evaluation metrics falls short of the previous sota models because of the problem of over-correction. Furthermore, we also discover notable variations in the performance of LLMs when evaluated on different data distributions. Our findings demonstrates that further investigation is required for the application of LLMs on Chinese GEC task.

知識 (knowledge) · Processing（編程語言） · 圖 · NLP · 知識圖譜 ·

2022 年 9 月 30 日

A Decade of Knowledge Graphs in Natural Language Processing: A Survey

Phillip Schneider,Tim Schopf,Juraj Vladika,Mikhail Galkin,Elena Simperl,Florian Matthes

from arxiv, Accepted to AACL-IJCNLP 2022

In pace with developments in the research field of artificial intelligence, knowledge graphs (KGs) have attracted a surge of interest from both academia and industry. As a representation of semantic relations between entities, KGs have proven to be particularly relevant for natural language processing (NLP), experiencing a rapid spread and wide adoption within recent years. Given the increasing amount of research work in this area, several KG-related approaches have been surveyed in the NLP research community. However, a comprehensive study that categorizes established topics and reviews the maturity of individual research streams remains absent to this day. Contributing to closing this gap, we systematically analyzed 507 papers from the literature on KGs in NLP. Our survey encompasses a multifaceted review of tasks, research types, and contributions. As a result, we present a structured overview of the research landscape, provide a taxonomy of tasks, summarize our findings, and highlight directions for future work.

學成 · Vision · 深度學習 · 注意力機制 · 計算機視覺 ·

2021 年 12 月 22 日

A Survey of Natural Language Generation

Chenhe Dong,Yinghui Li,Haifan Gong,Miaoxin Chen,Junxin Li,Ying Shen,Min Yang

from arxiv, 36 pages, 4 tables; Under review

This paper offers a comprehensive review of the research on Natural Language Generation (NLG) over the past two decades, especially in relation to data-to-text generation and text-to-text generation deep learning methods, as well as new applications of NLG technology. This survey aims to (a) give the latest synthesis of deep learning research on the NLG core tasks, as well as the architectures adopted in the field; (b) detail meticulously and comprehensively various NLG tasks and datasets, and draw attention to the challenges in NLG evaluation, focusing on different evaluation methods and their relationships; (c) highlight some future emphasis and relatively recent research issues that arise due to the increasing synergy between NLG and other artificial intelligence areas, such as computer vision, text and computational creativity.

Machine Learning · Principle · 可理解性 · 學成 · 監督 ·

2020 年 11 月 16 日

A Survey on the Explainability of Supervised Machine Learning

Nadia Burkart,Marco F. Huber

from arxiv, Accepted for publication at the Journal of Artificial Intelligence Research (JAIR)

Predictions obtained by, e.g., artificial neural networks have a high accuracy but humans often perceive the models as black boxes. Insights about the decision making are mostly opaque for humans. Particularly understanding the decision making in highly sensitive areas such as healthcare or fifinance, is of paramount importance. The decision-making behind the black boxes requires it to be more transparent, accountable, and understandable for humans. This survey paper provides essential definitions, an overview of the different principles and methodologies of explainable Supervised Machine Learning (SML). We conduct a state-of-the-art survey that reviews past and recent explainable SML approaches and classifies them according to the introduced definitions. Finally, we illustrate principles by means of an explanatory case study and discuss important future directions.

Performer · MoDELS · Integration · seq2seq · 輸出 ·

2020 年 10 月 9 日

A Survey of Knowledge-Enhanced Text Generation

Wenhao Yu,Chenguang Zhu,Zaitang Li,Zhiting Hu,Qingyun Wang,Heng Ji,Meng Jiang

from arxiv, 44 pages; Preprint; A paper and code collection is available at //github.com/wyu97/KENLG-Reading

The goal of text generation is to make machines express in human language. It is one of the most important yet challenging tasks in natural language processing (NLP). Since 2014, various neural encoder-decoder models pioneered by Seq2Seq have been proposed to achieve the goal by learning to map input text to output text. However, the input text alone often provides limited knowledge to generate the desired output, so the performance of text generation is still far from satisfaction in many real-world scenarios. To address this issue, researchers have considered incorporating various forms of knowledge beyond the input text into the generation models. This research direction is known as knowledge-enhanced text generation. In this survey, we present a comprehensive review of the research on knowledge enhanced text generation over the past five years. The main content includes two parts: (i) general methods and architectures for integrating knowledge into text generation; (ii) specific techniques and applications according to different forms of knowledge data. This survey can have broad audiences, researchers and practitioners, in academia and industry.

命名實體識別 · entity · 學成 · 深度學習 · 可辨認的 ·

2020 年 3 月 13 日

A Survey on Deep Learning for Named Entity Recognition

Jing Li,Aixin Sun,Jianglei Han,Chenliang Li

from arxiv, 20 pages, 12 figures, 3 tables. arXiv admin note: text overlap with arXiv:1702.02098, arXiv:1904.10503 by other authors

Named entity recognition (NER) is the task to identify text spans that mention named entities, and to classify them into predefined categories such as person, location, organization etc. NER serves as the basis for a variety of natural language applications such as question answering, text summarization, and machine translation. Although early NER systems are successful in producing decent recognition accuracy, they often require much human effort in carefully designing rules or features. In recent years, deep learning, empowered by continuous real-valued vector representations and semantic composition through nonlinear processing, has been employed in NER systems, yielding stat-of-the-art performance. In this paper, we provide a comprehensive review on existing deep learning techniques for NER. We first introduce NER resources, including tagged NER corpora and off-the-shelf NER tools. Then, we systematically categorize existing works based on a taxonomy along three axes: distributed representations for input, context encoder, and tag decoder. Next, we survey the most representative methods for recent applied techniques of deep learning in new NER problem settings and applications. Finally, we present readers with the challenges faced by NER systems and outline future directions in this area.

小樣本學習 · MoDELS · Pivotal（公司） · 情景 · 標注 ·

2020 年 2 月 27 日

Few-shot Natural Language Generation for Task-Oriented Dialog

Baolin Peng,Chenguang Zhu,Chunyuan Li,Xiujun Li,Jinchao Li,Michael Zeng,Jianfeng Gao

from arxiv, Project website: //aka.ms/scgpt ; Code and data: //github.com/pengbaolin/SC-GPT

As a crucial component in task-oriented dialog systems, the Natural Language Generation (NLG) module converts a dialog act represented in a semantic form into a response in natural language. The success of traditional template-based or statistical models typically relies on heavily annotated data, which is infeasible for new domains. Therefore, it is pivotal for an NLG system to generalize well with limited labelled data in real applications. To this end, we present FewShotWoz, the first NLG benchmark to simulate the few-shot learning setting in task-oriented dialog systems. Further, we develop the SC-GPT model. It is pre-trained on a large set of annotated NLG corpus to acquire the controllable generation ability, and fine-tuned with only a few domain-specific labels to adapt to new domains. Experiments on FewShotWoz and the large Multi-Domain-WOZ datasets show that the proposed SC-GPT significantly outperforms existing methods, measured by various automatic metrics and human evaluations.

圖 · INFORMS · 知識圖譜 · 變換 · 信息抽取 ·

2019 年 4 月 4 日

Text Generation from Knowledge Graphs with Graph Transformers

Rik Koncel-Kedziorski,Dhanush Bekal,Yi Luan,Mirella Lapata,Hannaneh Hajishirzi

from arxiv, Accepted as a long paper in NAACL 2019

Generating texts which express complex ideas spanning multiple sentences requires a structured representation of their content (document plan), but these representations are prohibitively expensive to manually produce. In this work, we address the problem of generating coherent multi-sentence texts from the output of an information extraction system, and in particular a knowledge graph. Graphical knowledge representations are ubiquitous in computing, but pose a significant challenge for text generation techniques due to their non-hierarchical nature, collapsing of long-distance dependencies, and structural variety. We introduce a novel graph transforming encoder which can leverage the relational structure of such knowledge graphs without imposing linearization or hierarchical constraints. Incorporated into an encoder-decoder setup, we provide an end-to-end trainable system for graph-to-text generation that we apply to the domain of scientific text. Automatic and human evaluations show that our technique produces more informative texts which exhibit better document structure than competitive encoder-decoder methods.