亚洲男人的天堂2018av,欧美草比,久久久久久免费视频精选,国色天香在线看免费,久久久久亚洲av成人片仓井空

<tfoot id='7k7sv'></tfoot>

<legend id='7k7sv'><style id='7k7sv'><dir id='7k7sv'><q id='7k7sv'></q></dir></style></legend>

<i id='7k7sv'><tr id='7k7sv'><dt id='7k7sv'><q id='7k7sv'><span id='7k7sv'><b id='7k7sv'><form id='7k7sv'><ins id='7k7sv'></ins><ul id='7k7sv'></ul><sub id='7k7sv'></sub></form><legend id='7k7sv'></legend><bdo id='7k7sv'><pre id='7k7sv'><center id='7k7sv'></center></pre></bdo></b><th id='7k7sv'></th></span></q></dt></tr></i><div id='7k7sv'><tfoot id='7k7sv'></tfoot><dl id='7k7sv'><fieldset id='7k7sv'></fieldset></dl></div>

·

entity · 命名實體識別 · CASE · 可約的 · 學成 ·

2021 年 11 月 15 日

Crowdsourcing Learning as Domain Adaptation: A Case Study on Named Entity Recognition

Xin Zhang,Guangwei Xu,Yueheng Sun,Meishan Zhang,Pengjun Xie

from arxiv, ACL-IJCNLP 2021 main conf, long paper; corrected the wrong reference for "argument retrieval" in first paragraph of Introduction

Crowdsourcing is regarded as one prospective solution for effective supervised learning, aiming to build large-scale annotated training data by crowd workers. Previous studies focus on reducing the influences from the noises of the crowdsourced annotations for supervised models. We take a different point in this work, regarding all crowdsourced annotations as gold-standard with respect to the individual annotators. In this way, we find that crowdsourcing could be highly similar to domain adaptation, and then the recent advances of cross-domain methods can be almost directly applied to crowdsourcing. Here we take named entity recognition (NER) as a study case, suggesting an annotator-aware representation learning model that inspired by the domain adaptation methods which attempt to capture effective domain-aware features. We investigate both unsupervised and supervised crowdsourcing learning, assuming that no or only small-scale expert annotations are available. Experimental results on a benchmark crowdsourced NER dataset show that our method is highly effective, leading to a new state-of-the-art performance. In addition, under the supervised setting, we can achieve impressive performance gains with only a very small scale of expert annotations.

相關內容

entity

主動學習 · 學成 · 樣例 · 未標記 · 高斯混合（模型） ·

2022 年 1 月 18 日

Active Learning for Open-set Annotation

Kun-Peng Ning,Xun Zhao,Yu Li,Sheng-Jun Huang

Existing active learning studies typically work in the closed-set setting by assuming that all data examples to be labeled are drawn from known classes. However, in real annotation tasks, the unlabeled data usually contains a large amount of examples from unknown classes, resulting in the failure of most active learning methods. To tackle this open-set annotation (OSA) problem, we propose a new active learning framework called LfOSA, which boosts the classification performance with an effective sampling strategy to precisely detect examples from known classes for annotation. The LfOSA framework introduces an auxiliary network to model the per-example max activation value (MAV) distribution with a Gaussian Mixture Model, which can dynamically select the examples with highest probability from known classes in the unlabeled set. Moreover, by reducing the temperature $T$ of the loss function, the detection model will be further optimized by exploiting both known and unknown supervision. The experimental results show that the proposed method can significantly improve the selection quality of known classes, and achieve higher classification accuracy with lower annotation cost than state-of-the-art active learning methods. To the best of our knowledge, this is the first work of active learning for open-set annotation.

UDA · domain shift · 未標記 · state-of-the-art · 標注 ·

2021 年 12 月 13 日

A Survey of Unsupervised Domain Adaptation for Visual Recognition

While huge volumes of unlabeled data are generated and made available in many domains, the demand for automated understanding of visual data is higher than ever before. Most existing machine learning models typically rely on massive amounts of labeled training data to achieve high performance. Unfortunately, such a requirement cannot be met in real-world applications. The number of labels is limited and manually annotating data is expensive and time-consuming. It is often necessary to transfer knowledge from an existing labeled domain to a new domain. However, model performance degrades because of the differences between domains (domain shift or dataset bias). To overcome the burden of annotation, Domain Adaptation (DA) aims to mitigate the domain shift problem when transferring knowledge from one domain into another similar but different domain. Unsupervised DA (UDA) deals with a labeled source domain and an unlabeled target domain. The principal objective of UDA is to reduce the domain discrepancy between the labeled source data and unlabeled target data and to learn domain-invariant representations across the two domains during training. In this paper, we first define UDA problem. Secondly, we overview the state-of-the-art methods for different categories of UDA from both traditional methods and deep learning based methods. Finally, we collect frequently used benchmark datasets and report results of the state-of-the-art methods of UDA on visual recognition problem.

entity · 未標記 · 命名實體識別 · Performer · MoDELS ·

2020 年 12 月 14 日

Empirical Analysis of Unlabeled Entity Problem in Named Entity Recognition

Yangming Li,Lemao Liu,Shuming Shi

In many scenarios, named entity recognition (NER) models severely suffer from unlabeled entity problem, where the entities of a sentence may not be fully annotated. Through empirical studies performed on synthetic datasets, we find two causes of the performance degradation. One is the reduction of annotated entities and the other is treating unlabeled entities as negative instances. The first cause has less impact than the second one and can be mitigated by adopting pretraining language models. The second cause seriously misguides a model in training and greatly affects its performances. Based on the above observations, we propose a general approach that is capable of eliminating the misguidance brought by unlabeled entities. The core idea is using negative sampling to keep the probability of training with unlabeled entities at a very low level. Experiments on synthetic datasets and real-world datasets show that our model is robust to unlabeled entity problem and surpasses prior baselines. On well-annotated datasets, our model is competitive with state-of-the-art method.

entity · 命名實體識別 · MoDELS · SOTA · 無監督 ·

2019 年 11 月 22 日

Zero-Resource Cross-Lingual Named Entity Recognition

M Saiful Bari,Shafiq Joty,Prathyusha Jwalapuram

Recently, neural methods have achieved state-of-the-art (SOTA) results in Named Entity Recognition (NER) tasks for many languages without the need for manually crafted features. However, these models still require manually annotated training data, which is not available for many languages. In this paper, we propose an unsupervised cross-lingual NER model that can transfer NER knowledge from one language to another in a completely unsupervised way without relying on any bilingual dictionary or parallel data. Our model achieves this through word-level adversarial learning and augmented fine-tuning with parameter sharing and feature augmentation. Experiments on five different languages demonstrate the effectiveness of our approach, outperforming existing models by a good margin and setting a new SOTA for each language pair.

entity · 命名實體識別 · CASE · Extensibility · MoDELS ·

2019 年 11 月 14 日

Enhanced Meta-Learning for Cross-lingual Named Entity Recognition with Minimal Resources

Qianhui Wu,Zijia Lin,Guoxin Wang,Hui Chen,B?rje F. Karlsson,Biqing Huang,Chin-Yew Lin

from arxiv, This paper is accepted by AAAI2020

For languages with no annotated resources, transferring knowledge from rich-resource languages is an effective solution for named entity recognition (NER). While all existing methods directly transfer from source-learned model to a target language, in this paper, we propose to fine-tune the learned model with a few similar examples given a test case, which could benefit the prediction by leveraging the structural and semantic information conveyed in such similar examples. To this end, we present a meta-learning algorithm to find a good model parameter initialization that could fast adapt to the given test case and propose to construct multiple pseudo-NER tasks for meta-training by computing sentence similarities. To further improve the model's generalization ability across different languages, we introduce a masking scheme and augment the loss function with an additional maximum term during meta-training. We conduct extensive experiments on cross-lingual named entity recognition with minimal resources over five target languages. The results show that our approach significantly outperforms existing state-of-the-art methods across the board.

Performer · 目標領域 · MoDELS · 機器閱讀理解 · Extensibility ·

2019 年 11 月 13 日

Unsupervised Domain Adaptation on Reading Comprehension

Yu Cao,Meng Fang,Baosheng Yu,Joey Tianyi Zhou

from arxiv, 9 pages, 6 figures, 5 tables, Accepted by AAAI 2020

Reading comprehension (RC) has been studied in a variety of datasets with the boosted performance brought by deep neural networks. However, the generalization capability of these models across different domains remains unclear. To alleviate this issue, we are going to investigate unsupervised domain adaptation on RC, wherein a model is trained on labeled source domain and to be applied to the target domain with only unlabeled samples. We first show that even with the powerful BERT contextual representation, the performance is still unsatisfactory when the model trained on one dataset is directly applied to another target dataset. To solve this, we provide a novel conditional adversarial self-training method (CASe). Specifically, our approach leverages a BERT model fine-tuned on the source dataset along with the confidence filtering to generate reliable pseudo-labeled samples in the target domain for self-training. On the other hand, it further reduces domain distribution discrepancy through conditional adversarial learning across domains. Extensive experiments show our approach achieves comparable accuracy to supervised models on multiple large-scale benchmark datasets.

entity · 命名實體識別 · Performer · 學成 · state-of-the-art ·

2019 年 7 月 18 日

Joint Learning of Named Entity Recognition and Entity Linking

Pedro Henrique Martins,Zita Marinho,André F. T. Martins

Named entity recognition (NER) and entity linking (EL) are two fundamentally related tasks, since in order to perform EL, first the mentions to entities have to be detected. However, most entity linking approaches disregard the mention detection part, assuming that the correct mentions have been previously detected. In this paper, we perform joint learning of NER and EL to leverage their relatedness and obtain a more robust and generalisable system. For that, we introduce a model inspired by the Stack-LSTM approach (Dyer et al., 2015). We observe that, in fact, doing multi-task learning of NER and EL improves the performance in both tasks when comparing with models trained with individual objectives. Furthermore, we achieve results competitive with the state-of-the-art in both NER and EL.

entity · 命名實體識別 · 遷移學習 · 參數共享 · 學成 ·

2018 年 12 月 13 日

Dynamic Transfer Learning for Named Entity Recognition

Parminder Bhatia,Kristjan Arumae,Busra Celikkaya

from arxiv, AAAI 2019 Workshop on Health Intelligence

State-of-the-art named entity recognition (NER) systems have been improving continuously using neural architectures over the past several years. However, many tasks including NER require large sets of annotated data to achieve such performance. In particular, we focus on NER from clinical notes, which is one of the most fundamental and critical problems for medical text analysis. Our work centers on effectively adapting these neural architectures towards low-resource settings using parameter transfer methods. We complement a standard hierarchical NER model with a general transfer learning framework consisting of parameter sharing between the source and target tasks, and showcase scores significantly above the baseline architecture. These sharing schemes require an exponential search over tied parameter sets to generate an optimal configuration. To mitigate the problem of exhaustively searching for model optimization, we propose the Dynamic Transfer Networks (DTN), a gated architecture which learns the appropriate parameter sharing scheme between source and target datasets. DTN achieves the improvements of the optimized transfer learning framework with just a single training setting, effectively removing the need for exponential search.

entity · Performer · 命名實體識別 · state-of-the-art · 主動學習 ·

2018 年 2 月 4 日

Deep Active Learning for Named Entity Recognition

Yanyao Shen,Hyokun Yun,Zachary C. Lipton,Yakov Kronrod,Animashree Anandkumar

Deep learning has yielded state-of-the-art performance on many natural language processing tasks including named entity recognition (NER). However, this typically requires large amounts of labeled data. In this work, we demonstrate that the amount of labeled training data can be drastically reduced when deep learning is combined with active learning. While active learning is sample-efficient, it can be computationally expensive since it requires iterative retraining. To speed this up, we introduce a lightweight architecture for NER, viz., the CNN-CNN-LSTM model consisting of convolutional character and word encoders and a long short term memory (LSTM) tag decoder. The model achieves nearly state-of-the-art performance on standard datasets for the task while being computationally much more efficient than best performing models. We carry out incremental active learning, during the training process, and are able to nearly match state-of-the-art performance with just 25\% of the original training data.

entity · MoDELS · Neural Networks · state-of-the-art · 命名實體識別 ·

2018 年 1 月 30 日

Cross-type Biomedical Named Entity Recognition with Deep Multi-Task Learning

Xuan Wang,Yu Zhang,Xiang Ren,Yuhao Zhang,Marinka Zitnik,Jingbo Shang,Curtis Langlotz,Jiawei Han

Motivation: Biomedical named entity recognition (BioNER) is the most fundamental task in biomedical text mining. State-of-the-art BioNER systems often require handcrafted features specifically designed for each type of biomedical entities. This feature generation process requires intensive labors from biomedical and linguistic experts, and makes it difficult to adapt these systems to new biomedical entity types. Although recent studies explored using neural network models for BioNER to free experts from manual feature generation, these models still require substantial human efforts to annotate massive training data. Results: We propose a multi-task learning framework for BioNER that is based on neural network models to save human efforts. We build a global model by collectively training multiple models that share parameters, each model capturing the characteristics of a different biomedical entity type. In experiments on five BioNER benchmark datasets covering four major biomedical entity types, our model outperforms state-of-the-art systems and other neural network models by a large margin, even when only limited training data are available. Further analysis shows that the large performance gains come from sharing character- and word-level information between different biomedical entities. The approach creates new opportunities for text-mining approaches to help biomedical scientists better exploit knowledge in biomedical literature.

閱讀: 0 點贊: 0

小貼士

登錄享

相關主題

命名實體識別

北京阿比特科技有限公司

注冊地址：北京市海淀區羊坊店路18號2幢3層301-191

<dir id='7k7sv'><del id='7k7sv'><del id='7k7sv'></del><pre id='7k7sv'><pre id='7k7sv'><option id='7k7sv'><address id='7k7sv'></address><bdo id='7k7sv'><tr id='7k7sv'><acronym id='7k7sv'><pre id='7k7sv'></pre></acronym><div id='7k7sv'></div></tr></bdo></option></pre><small id='7k7sv'><address id='7k7sv'><u id='7k7sv'><legend id='7k7sv'><option id='7k7sv'><abbr id='7k7sv'></abbr><li id='7k7sv'><pre id='7k7sv'></pre></li></option></legend><select id='7k7sv'></select></u></address></small></pre></del><sup id='7k7sv'></sup><blockquote id='7k7sv'><dt id='7k7sv'></dt></blockquote><blockquote id='7k7sv'></blockquote></dir><tt id='7k7sv'></tt><u id='7k7sv'><tt id='7k7sv'><form id='7k7sv'></form></tt><td id='7k7sv'><dt id='7k7sv'></dt></td></u>

<code id='7k7sv'><i id='7k7sv'><q id='7k7sv'><legend id='7k7sv'><pre id='7k7sv'><style id='7k7sv'><acronym id='7k7sv'><i id='7k7sv'><form id='7k7sv'><option id='7k7sv'><center id='7k7sv'></center></option></form></i></acronym></style><tt id='7k7sv'></tt></pre></legend></q></i></code><center id='7k7sv'></center>

<dd id='7k7sv'></dd>

<style id='7k7sv'></style><sub id='7k7sv'><dfn id='7k7sv'><abbr id='7k7sv'><big id='7k7sv'><bdo id='7k7sv'></bdo></big></abbr></dfn></sub>_{<dir id='7k7sv'></dir>}