亚洲男人的天堂2018av,欧美草比,久久久久久免费视频精选,国色天香在线看免费,久久久久亚洲av成人片仓井空

<tr id='dqgwF'><strong id='q2sfV'></strong><small id='ZQkbV'></small><button id='WfeGU'></button><li id='4h9WW'><noscript id='UHX3o'><big id='rmTbH'></big><dt id='IakFX'></dt></noscript></li></tr><ol id='JAx39'><option id='NixH3'><table id='ENyxT'><blockquote id='6suKU'><tbody id='SeA2E'></tbody></blockquote></table></option></ol><u id='N8jDn'></u><kbd id='W5ITl'><kbd id='DUfYj'></kbd></kbd>

<code id='yOkx2'><strong id='puuZw'></strong></code>

<fieldset id='0rvb0'></fieldset>

<span id='aPXDm'></span>

<ins id='2qMHr'></ins>

<acronym id='7G358'><em id='N4IUA'></em><td id='9d8it'><div id='EJX28'></div></td></acronym><address id='yZUu7'><big id='On8OB'><big id='0Scmr'></big><legend id='65TsO'></legend></big></address>

<i id='OxdoQ'><div id='k3rLp'><ins id='ceuRr'></ins></div></i>

<i id='JdGur'></i>

·

state-of-the-art · 學成 · Performer · ROUGE · MoDELS ·

2021 年 7 月 24 日

Generative Pretraining for Paraphrase Evaluation

Jack Weston,Raphael Lenain,Udeepa Meepegama,Emil Fristed

from arxiv, Under review

We introduce ParaBLEU, a paraphrase representation learning model and evaluation metric for text generation. Unlike previous approaches, ParaBLEU learns to understand paraphrasis using generative conditioning as a pretraining objective. ParaBLEU correlates more strongly with human judgements than existing metrics, obtaining new state-of-the-art results on the 2017 WMT Metrics Shared Task. We show that our model is robust to data scarcity, exceeding previous state-of-the-art performance using only $50\%$ of the available training data and surpassing BLEU, ROUGE and METEOR with only $40$ labelled examples. Finally, we demonstrate that ParaBLEU can be used to conditionally generate novel paraphrases from a single demonstration, which we use to confirm our hypothesis that it learns abstract, generalized paraphrase representations.

相關內容

state-of-the-art

state-of-the-art

重要性采樣 · 估計/估計量 · 樣本 · MoDELS · 未標記 ·

2021 年 9 月 24 日

Sample Efficient Model Evaluation

Emine Yilmaz,Peter Hayes,Raza Habib,Jordan Burgess,David Barber

Labelling data is a major practical bottleneck in training and testing classifiers. Given a collection of unlabelled data points, we address how to select which subset to label to best estimate test metrics such as accuracy, $F_1$ score or micro/macro $F_1$. We consider two sampling based approaches, namely the well-known Importance Sampling and we introduce a novel application of Poisson Sampling. For both approaches we derive the minimal error sampling distributions and how to approximate and use them to form estimators and confidence intervals. We show that Poisson Sampling outperforms Importance Sampling both theoretically and experimentally.

估計/估計量 · 機器翻譯 · 相關系數 · 均值 · Performer ·

2021 年 9 月 22 日

Pushing the Right Buttons: Adversarial Evaluation of Quality Estimation

Diptesh Kanojia,Marina Fomicheva,Tharindu Ranasinghe,Frédéric Blain,Constantin Or?san,Lucia Specia

from arxiv, Accepted to WMT 2021 Conference co-located with EMNLP 2021. 14 pages with a 4 page appendix

Current Machine Translation (MT) systems achieve very good results on a growing variety of language pairs and datasets. However, they are known to produce fluent translation outputs that can contain important meaning errors, thus undermining their reliability in practice. Quality Estimation (QE) is the task of automatically assessing the performance of MT systems at test time. Thus, in order to be useful, QE systems should be able to detect such errors. However, this ability is yet to be tested in the current evaluation practices, where QE systems are assessed only in terms of their correlation with human judgements. In this work, we bridge this gap by proposing a general methodology for adversarial testing of QE for MT. First, we show that despite a high correlation with human judgements achieved by the recent SOTA, certain types of meaning errors are still problematic for QE to detect. Second, we show that on average, the ability of a given model to discriminate between meaning-preserving and meaning-altering perturbations is predictive of its overall performance, thus potentially allowing for comparing QE systems without relying on manual quality annotation.

語言模型化 · entity · MoDELS · 后驗分布 · 圖 ·

2019 年 8 月 21 日

Latent Relation Language Models

Hiroaki Hayashi,Zecong Hu,Chenyan Xiong,Graham Neubig

In this paper, we propose Latent Relation Language Models (LRLMs), a class of language models that parameterizes the joint distribution over the words in a document and the entities that occur therein via knowledge graph relations. This model has a number of attractive properties: it not only improves language modeling performance, but is also able to annotate the posterior probability of entity spans for a given text through relations. Experiments demonstrate empirical improvements over both a word-based baseline language model and a previous approach that incorporates knowledge graph information. Qualitative analysis further demonstrates the proposed model's ability to learn to predict appropriate relations in context.

BERT · 相似度 · 詞元分析器 · 圖像字幕 · Better ·

2019 年 4 月 21 日

BERTScore: Evaluating Text Generation with BERT

Tianyi Zhang,Varsha Kishore,Felix Wu,Kilian Q. Weinberger,Yoav Artzi

from arxiv, Code available at //github.com/Tiiiger/bert_score

We propose BERTScore, an automatic evaluation metric for text generation. Analogous to common metrics, \method computes a similarity score for each token in the candidate sentence with each token in the reference. However, instead of looking for exact matches, we compute similarity using contextualized BERT embeddings. We evaluate on several machine translation and image captioning benchmarks, and show that BERTScore correlates better with human judgments than existing metrics, often significantly outperforming even task-specific supervised metrics.

圖像字幕 · entity · LSTM · MoDELS · 單峰值 ·

2018 年 11 月 7 日

Entity-aware Image Caption Generation

Di Lu,Spencer Whitehead,Lifu Huang,Heng Ji,Shih-Fu Chang

from arxiv, In proceedings of EMNLP 2018

Current image captioning approaches generate descriptions which lack specific information, such as named entities that are involved in the images. In this paper we propose a new task which aims to generate informative image captions, given images and hashtags as input. We propose a simple but effective approach to tackle this problem. We first train a convolutional neural networks - long short term memory networks (CNN-LSTM) model to generate a template caption based on the input image. Then we use a knowledge graph based collective inference algorithm to fill in the template with specific named entities retrieved via the hashtags. Experiments on a new benchmark dataset collected from Flickr show that our model generates news-style image descriptions with much richer information. Our model outperforms unimodal baselines significantly with various evaluation metrics.

學成 · 強化學習 · 深度強化學習 · MoDELS · 逆強化學習 ·

2018 年 8 月 23 日

Paraphrase Generation with Deep Reinforcement Learning

Zichao Li,Xin Jiang,Lifeng Shang,Hang Li

from arxiv, EMNLP 2018

Automatic generation of paraphrases from a given sentence is an important yet challenging task in natural language processing (NLP), and plays a key role in a number of applications such as question answering, search, and dialogue. In this paper, we present a deep reinforcement learning approach to paraphrase generation. Specifically, we propose a new framework for the task, which consists of a \textit{generator} and an \textit{evaluator}, both of which are learned from data. The generator, built as a sequence-to-sequence learning model, can produce paraphrases given a sentence. The evaluator, constructed as a deep matching model, can judge whether two sentences are paraphrases of each other. The generator is first trained by deep learning and then further fine-tuned by reinforcement learning in which the reward is given by the evaluator. For the learning of the evaluator, we propose two methods based on supervised learning and inverse reinforcement learning respectively, depending on the type of available training data. Empirical study shows that the learned evaluator can guide the generator to produce more accurate paraphrases. Experimental results demonstrate the proposed models (the generators) outperform the state-of-the-art methods in paraphrase generation in both automatic evaluation and human evaluation.

MoDELS · 估計/估計量 · 機器閱讀理解 · 基準 · 自然語言處理 ·

2018 年 5 月 30 日

Neural Models for Key Phrase Detection and Question Generation

Sandeep Subramanian,Tong Wang,Xingdi Yuan,Saizheng Zhang,Yoshua Bengio,Adam Trischler

from arxiv, Machine Reading for Question Answering workshop at ACL 2018

We propose a two-stage neural model to tackle question generation from documents. First, our model estimates the probability that word sequences in a document are ones that a human would pick when selecting candidate answers by training a neural key-phrase extractor on the answers in a question-answering corpus. Predicted key phrases then act as target answers and condition a sequence-to-sequence question-generation model with a copy mechanism. Empirically, our key-phrase extraction model significantly outperforms an entity-tagging baseline and existing rule-based approaches. We further demonstrate that our question generation system formulates fluent, answerable questions from key phrases. This two-stage system could be used to augment or generate reading comprehension datasets, which may be leveraged to improve machine reading systems or in educational settings.

話題模型 · MoDELS · 話題 · Performer · 相關系數 ·

2018 年 4 月 26 日

Lessons from the Bible on Modern Topics: Low-Resource Multilingual Topic Model Evaluation

Shudong Hao,Jordan Boyd-Graber,Michael J. Paul

from arxiv, North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL-HLT), New Orleans, Louisiana. June 2018

Multilingual topic models enable document analysis across languages through coherent multilingual summaries of the data. However, there is no standard and effective metric to evaluate the quality of multilingual topics. We introduce a new intrinsic evaluation of multilingual topic models that correlates well with human judgments of multilingual topic coherence as well as performance in downstream applications. Importantly, we also study evaluation for low-resource languages. Because standard metrics fail to accurately measure topic quality when robust external resources are unavailable, we propose an adaptation model that improves the accuracy and reliability of these metrics in low-resource settings.

生成式對抗網絡 · 特征空間 · Networking · 高斯混合（模型） · 高斯混合模型 ·

2018 年 3 月 27 日

An Improved Evaluation Framework for Generative Adversarial Networks

Shaohui Liu,Yi Wei,Jiwen Lu,Jie Zhou

from arxiv, 21 pages, 9 figures, 8 tables

In this paper, we propose an improved quantitative evaluation framework for Generative Adversarial Networks (GANs) on generating domain-specific images, where we improve conventional evaluation methods on two levels: the feature representation and the evaluation metric. Unlike most existing evaluation frameworks which transfer the representation of ImageNet inception model to map images onto the feature space, our framework uses a specialized encoder to acquire fine-grained domain-specific representation. Moreover, for datasets with multiple classes, we propose Class-Aware Frechet Distance (CAFD), which employs a Gaussian mixture model on the feature space to better fit the multi-manifold feature distribution. Experiments and analysis on both the feature level and the image level were conducted to demonstrate improvements of our proposed framework over the recently proposed state-of-the-art FID method. To our best knowledge, we are the first to provide counter examples where FID gives inconsistent results with human judgments. It is shown in the experiments that our framework is able to overcome the shortness of FID and improves robustness. Code will be made available.

詞表 · 解碼 · 采樣法 · MoDELS · state-of-the-art ·

2017 年 11 月 30 日

Neural Response Generation with Dynamic Vocabularies

Yu Wu,Wei Wu,Dejian Yang,Can Xu,Zhoujun Li,Ming Zhou

from arxiv, accepted by AAAI18

We study response generation for open domain conversation in chatbots. Existing methods assume that words in responses are generated from an identical vocabulary regardless of their inputs, which not only makes them vulnerable to generic patterns and irrelevant noise, but also causes a high cost in decoding. We propose a dynamic vocabulary sequence-to-sequence (DVS2S) model which allows each input to possess their own vocabulary in decoding. In training, vocabulary construction and response generation are jointly learned by maximizing a lower bound of the true objective with a Monte Carlo sampling method. In inference, the model dynamically allocates a small vocabulary for an input with the word prediction model, and conducts decoding only with the small vocabulary. Because of the dynamic vocabulary mechanism, DVS2S eludes many generic patterns and irrelevant words in generation, and enjoys efficient decoding at the same time. Experimental results on both automatic metrics and human annotations show that DVS2S can significantly outperform state-of-the-art methods in terms of response quality, but only requires 60% decoding time compared to the most efficient baseline.

閱讀: 0 點贊: 0

小貼士

登錄享

相關主題

state-of-the-art

北京阿比特科技有限公司

注冊地址：北京市海淀區羊坊店路18號2幢3層301-191

<tr id='ehbb2'><strong id='ehbb2'></strong><small id='ehbb2'></small><button id='ehbb2'></button><li id='ehbb2'><noscript id='ehbb2'><big id='ehbb2'></big><dt id='ehbb2'></dt></noscript></li></tr><ol id='ehbb2'><option id='ehbb2'><table id='ehbb2'><blockquote id='ehbb2'><tbody id='ehbb2'></tbody></blockquote></table></option></ol><u id='ehbb2'></u><kbd id='ehbb2'><kbd id='ehbb2'></kbd></kbd>

<code id='ehbb2'><strong id='ehbb2'></strong></code>

<fieldset id='ehbb2'></fieldset>

<span id='ehbb2'></span>

<ins id='ehbb2'></ins>

<acronym id='ehbb2'><em id='ehbb2'></em><td id='ehbb2'><div id='ehbb2'></div></td></acronym><address id='ehbb2'><big id='ehbb2'><big id='ehbb2'></big><legend id='ehbb2'></legend></big></address>

<i id='ehbb2'><div id='ehbb2'><ins id='ehbb2'></ins></div></i>

<i id='ehbb2'></i>