国产免费一区二区三区在线能观看_国产美女高潮流白浆视频18_国产精品国产三级国产专播偷_人人操人人舔人人干_狠狠色成人综合首页_小仙女白丝高潮喷白浆_免费恋脚足网站AV

_{^{<dd id='TrQim'><tbody id='PVwTs'><td id='sYUop'><optgroup id='Tc6a2'><strong id='sdp8r'></strong></optgroup><address id='aS9ok'><ul id='5qUYV'></ul></address><big id='9Nxcz'></big></td><table id='PJmC7'></table></tbody><pre id='jW1OQ'></pre></dd><span id='guEhu'><b id='QbT2m'></b></span>}}


<dfn id='7LFp1'><optgroup id='RMoYs'></optgroup></dfn><tfoot id='Gr60D'><bdo id='mfs3c'><div id='aFn6e'></div><i id='wrxrv'><dt id='lwihS'></dt></i></bdo></tfoot>

_{<fieldset id='65K9u'></fieldset>}

自動摘要 · 數據集 · MoDELS · 自然語言處理 ·

2021 年 10 月 5 日

Dataset for Automatic Summarization of Russian News

Ilya Gusev

from arxiv, Version 4, October 2021, corrected BLEU scores

Automatic text summarization has been studied in a variety of domains and languages. However, this does not hold for the Russian language. To overcome this issue, we present Gazeta, the first dataset for summarization of Russian news. We describe the properties of this dataset and benchmark several extractive and abstractive models. We demonstrate that the dataset is a valid task for methods of text summarization for Russian. Additionally, we prove the pretrained mBART model to be useful for Russian text summarization.

相關內容

自動摘要

關注 16

就是(shi)說(shuo)在不(bu)改變文檔原意的情況(kuang)下，利用計算機程序自動(dong)地總(zong)結(jie)出文檔的主要內容(rong)。自動(dong)摘要的應(ying)用場景(jing)非(fei)常多，例如新聞標題生成(cheng)(cheng)、科技文獻摘要生成(cheng)(cheng)、搜索結(jie)果片段（snippets）生成(cheng)(cheng)、商(shang)品評論摘要等。

INFORMS · 話題 · 自然語言處理 · 多媒體 · 進化計算 ·

2021 年 9 月 11 日

A Survey on Multi-modal Summarization

Anubhav Jangra,Adam Jatowt,Sriparna Saha,Mohammad Hasanuzzaman

The new era of technology has brought us to the point where it is convenient for people to share their opinions over an abundance of platforms. These platforms have a provision for the users to express themselves in multiple forms of representations, including text, images, videos, and audio. This, however, makes it difficult for users to obtain all the key information about a topic, making the task of automatic multi-modal summarization (MMS) essential. In this paper, we present a comprehensive survey of the existing research in the area of MMS.

小樣本學習 · 語言模型化 · Prompt · MoDELS · SimPLe ·

2020 年 12 月 22 日

Few-Shot Text Generation with Pattern-Exploiting Training

Timo Schick,Hinrich Schütze

Providing pretrained language models with simple task descriptions or prompts in natural language yields impressive few-shot results for a wide range of text classification tasks when combined with gradient-based learning from examples. In this paper, we show that the underlying idea can also be applied to text generation tasks: We adapt Pattern-Exploiting Training (PET), a recently proposed few-shot approach, for finetuning generative language models on text generation tasks. On several text summarization and headline generation datasets, our proposed variant of PET gives consistent improvements over a strong baseline in few-shot settings.

Pegasus · Performer · state-of-the-art · MoDELS · ROUGE ·

2020 年 6 月 2 日

PEGASUS: Pre-training with Extracted Gap-sentences for Abstractive Summarization

Jingqing Zhang,Yao Zhao,Mohammad Saleh,Peter J. Liu

from arxiv, Added Human Evaluation results; Code link added; Accepted for ICML 2020

Recent work pre-training Transformers with self-supervised objectives on large text corpora has shown great success when fine-tuned on downstream NLP tasks including text summarization. However, pre-training objectives tailored for abstractive text summarization have not been explored. Furthermore there is a lack of systematic evaluation across diverse domains. In this work, we propose pre-training large Transformer-based encoder-decoder models on massive text corpora with a new self-supervised objective. In PEGASUS, important sentences are removed/masked from an input document and are generated together as one output sequence from the remaining sentences, similar to an extractive summary. We evaluated our best PEGASUS model on 12 downstream summarization tasks spanning news, science, stories, instructions, emails, patents, and legislative bills. Experiments demonstrate it achieves state-of-the-art performance on all 12 downstream datasets measured by ROUGE scores. Our model also shows surprising performance on low-resource summarization, surpassing previous state-of-the-art results on 6 datasets with only 1000 examples. Finally we validated our results using human evaluation and show that our model summaries achieve human performance on multiple datasets.

BERT · MoDELS · 語言模型化 · 變換 · state-of-the-art ·

2019 年 8 月 22 日

Text Summarization with Pretrained Encoders

Yang Liu,Mirella Lapata

from arxiv, To appear in EMNLP 2019

Bidirectional Encoder Representations from Transformers (BERT) represents the latest incarnation of pretrained language models which have recently advanced a wide range of natural language processing tasks. In this paper, we showcase how BERT can be usefully applied in text summarization and propose a general framework for both extractive and abstractive models. We introduce a novel document-level encoder based on BERT which is able to express the semantics of a document and obtain representations for its sentences. Our extractive model is built on top of this encoder by stacking several inter-sentence Transformer layers. For abstractive summarization, we propose a new fine-tuning schedule which adopts different optimizers for the encoder and the decoder as a means of alleviating the mismatch between the two (the former is pretrained while the latter is not). We also demonstrate that a two-staged fine-tuning approach can further boost the quality of the generated summaries. Experiments on three datasets show that our model achieves state-of-the-art results across the board in both extractive and abstractive settings. Our code is available at //github.com/nlpyang/PreSumm

BERT · Performer · Transformer模型 · SimPLe · HTTPS ·

2019 年 3 月 25 日

Fine-tune BERT for Extractive Summarization

Yang Liu

BERT, a pre-trained Transformer model, has achieved ground-breaking performance on multiple NLP tasks. In this paper, we describe BERTSUM, a simple variant of BERT, for extractive summarization. Our system is the state of the art on the CNN/Dailymail dataset, outperforming the previous best-performed system by 1.65 on ROUGE-L. The codes to reproduce our results are available at //github.com/nlpyang/BertSum

自動摘要 · contrastive · ROUGE · BLEU · 可理解性 ·

2018 年 12 月 18 日

Automatic Summarization of Natural Language

Marc Everett Johnson

from arxiv, 6 pages, 1 literature synthesis matrix

Automatic summarization of natural language is a current topic in computer science research and industry, studied for decades because of its usefulness across multiple domains. For example, summarization is necessary to create reviews such as this one. Research and applications have achieved some success in extractive summarization (where key sentences are curated), however, abstractive summarization (synthesis and re-stating) is a hard problem and generally unsolved in computer science. This literature review contrasts historical progress up through current state of the art, comparing dimensions such as: extractive vs. abstractive, supervised vs. unsupervised, NLP (Natural Language Processing) vs Knowledge-based, deep learning vs algorithms, structured vs. unstructured sources, and measurement metrics such as Rouge and BLEU. Multiple dimensions are contrasted since current research uses combinations of approaches as seen in the review matrix. Throughout this summary, synthesis and critique is provided. This review concludes with insights for improved abstractive summarization measurement, with surprising implications for detecting understanding and comprehension in general.

ROUGE · Performer · state-of-the-art · Mail · 優化器 ·

2018 年 4 月 17 日

Multi-Reward Reinforced Summarization with Saliency and Entailment

Ramakanth Pasunuru,Mohit Bansal

from arxiv, NAACL 2018 (8 pages)

Abstractive text summarization is the task of compressing and rewriting a long document into a short summary while maintaining saliency, directed logical entailment, and non-redundancy. In this work, we address these three important aspects of a good summary via a reinforcement learning approach with two novel reward functions: ROUGESal and Entail, on top of a coverage-based baseline. The ROUGESal reward modifies the ROUGE metric by up-weighting the salient phrases/words detected via a keyphrase classifier. The Entail reward gives high (length-normalized) scores to logically-entailed summaries using an entailment classifier. Further, we show superior performance improvement when these rewards are combined with traditional metric (ROUGE) based rewards, via our novel and effective multi-reward approach of optimizing multiple rewards simultaneously in alternate mini-batches. Our method achieves the new state-of-the-art results on CNN/Daily Mail dataset as well as strong improvements in a test-only transfer setup on DUC-2002.

端到端 · 強化學習 · 學成 · 自然語言處理 ·

2018 年 3 月 27 日

Deep Communicating Agents for Abstractive Summarization

Asli Celikyilmaz,Antoine Bosselut,Xiaodong He,Yejin Choi

from arxiv, Accepted for publication at NAACL 2018

We present deep communicating agents in an encoder-decoder architecture to address the challenges of representing a long document for abstractive summarization. With deep communicating agents, the task of encoding a long text is divided across multiple collaborating agents, each in charge of a subsection of the input text. These encoders are connected to a single decoder, trained end-to-end using reinforcement learning to generate a focused and coherent summary. Empirical results demonstrate that multiple communicating encoders lead to a higher quality summary compared to several strong baselines, including those based on a single encoder or multiple non-communicating encoders.

INFORMS · 維基百科 · Perplexity · MoDELS · 可辨認的 ·

2018 年 1 月 30 日

Generating Wikipedia by Summarizing Long Sequences

Peter J. Liu,Mohammad Saleh,Etienne Pot,Ben Goodrich,Ryan Sepassi,Lukasz Kaiser,Noam Shazeer

from arxiv, Published as a conference paper at ICLR 2018

We show that generating English Wikipedia articles can be approached as a multi- document summarization of source documents. We use extractive summarization to coarsely identify salient information and a neural abstractive model to generate the article. For the abstractive model, we introduce a decoder-only architecture that can scalably attend to very long sequences, much longer than typical encoder- decoder architectures used in sequence transduction. We show that this model can generate fluent, coherent multi-sentence paragraphs and even whole Wikipedia articles. When given reference documents, we show it can extract relevant factual information as reflected in perplexity, ROUGE scores and human evaluations.

圖 · 可辨認的 · state-of-the-art · Processing（編程語言） · 縮放 ·

2017 年 4 月 12 日

Graph Summarization: A Survey

Yike Liu,Abhilash Dighe,Tara Safavi,Danai Koutra

While advances in computing resources have made processing enormous amounts of data possible, human ability to identify patterns in such data has not scaled accordingly. Thus, efficient computational methods for condensing and simplifying data are becoming vital for extracting actionable insights. In particular, while data summarization techniques have been studied extensively, only recently has summarizing interconnected data, or graphs, become popular. This survey is a structured, comprehensive overview of the state-of-the-art methods for summarizing graph data. We first broach the motivation behind and the challenges of graph summarization. We then categorize summarization approaches by the type of graphs taken as input and further organize each category by core methodology. Finally, we discuss applications of summarization on real-world graphs and conclude by describing some open problems in the field.

游客

閱讀: 0 點贊: 0

小貼士

登錄享

相關主題

北京阿比特科技有限公司

注冊地址：北京市海淀區羊坊店路18號2幢3層301-191

_{^{<dd id='ehuZz'><tbody id='CYDIH'><td id='TlYxQ'><optgroup id='W3TfM'><strong id='RaY6F'></strong></optgroup><address id='pu9W9'><ul id='1F3i7'></ul></address><big id='BkPWc'></big></td><table id='hLu6L'></table></tbody><pre id='OXTjM'></pre></dd><span id='ptKZd'><b id='etyuk'></b></span>}}


<dfn id='4LTQl'><optgroup id='JJOgw'></optgroup></dfn><tfoot id='YBKwo'><bdo id='Mu8PO'><div id='BwHWZ'></div><i id='9pW5Y'><dt id='N1Slj'></dt></i></bdo></tfoot>

_{<fieldset id='eUrTb'></fieldset>}

亚洲男人的天堂2018av,欧美草比,久久久久久免费视频精选,国色天香在线看免费,久久久久亚洲av成人片仓井空

相關內容