亚洲男人的天堂2018av,欧美草比,久久久久久免费视频精选,国色天香在线看免费,久久久久亚洲av成人片仓井空

<tfoot id='al2yy'></tfoot>

<legend id='al2yy'><style id='al2yy'><dir id='al2yy'><q id='al2yy'></q></dir></style></legend>

<i id='al2yy'><tr id='al2yy'><dt id='al2yy'><q id='al2yy'><span id='al2yy'><b id='al2yy'><form id='al2yy'><ins id='al2yy'></ins><ul id='al2yy'></ul><sub id='al2yy'></sub></form><legend id='al2yy'></legend><bdo id='al2yy'><pre id='al2yy'><center id='al2yy'></center></pre></bdo></b><th id='al2yy'></th></span></q></dt></tr></i><div id='al2yy'><tfoot id='al2yy'></tfoot><dl id='al2yy'><fieldset id='al2yy'></fieldset></dl></div>

·

Machine Translation · 可辨認的 · 特化 · 數據集 · Pair ·

2023 年 9 月 22 日

Audience-specific Explanations for Machine Translation

Renhan Lou,Jan Niehues

In machine translation, a common problem is that the translation of certain words even if translated can cause incomprehension of the target language audience due to different cultural backgrounds. A solution to solve this problem is to add explanations for these words. In a first step, we therefore need to identify these words or phrases. In this work we explore techniques to extract example explanations from a parallel corpus. However, the sparsity of sentences containing words that need to be explained makes building the training dataset extremely difficult. In this work, we propose a semi-automatic technique to extract these explanations from a large parallel corpus. Experiments on English->German language pair show that our method is able to extract sentence so that more than 10% of the sentences contain explanation, while only 1.9% of the original sentences contain explanations. In addition, experiments on English->French and English->Chinese language pairs also show similar conclusions. This is therefore an essential first automatic step to create a explanation dataset. Furthermore we show that the technique is robust for all three language pairs.

相關內容

Machine Translation

Machine Translation

機器翻譯（Machine Translation）涵蓋計算語言學和語言工程的所有分支，包含多語言方面。特色論文涵蓋理論，描述或計算方面的任何下列主題:雙語和多語語料庫的編寫和使用，計算機輔助語言教學，非羅馬字符集的計算含義，連接主義翻譯方法，對比語言學等。官網地址：

語音翻譯 · Learning · 端到端 · SOTA · state-of-the-art ·

2023 年 11 月 7 日

Rethinking and Improving Multi-task Learning for End-to-end Speech Translation

Yuhao Zhang,Chen Xu,Bei Li,Hao Chen,Tong Xiao,Chunliang Zhang,Jingbo Zhu

from arxiv, Accepted to EMNLP2023 main conference

Significant improvements in end-to-end speech translation (ST) have been achieved through the application of multi-task learning. However, the extent to which auxiliary tasks are highly consistent with the ST task, and how much this approach truly helps, have not been thoroughly studied. In this paper, we investigate the consistency between different tasks, considering different times and modules. We find that the textual encoder primarily facilitates cross-modal conversion, but the presence of noise in speech impedes the consistency between text and speech representations. Furthermore, we propose an improved multi-task learning (IMTL) approach for the ST task, which bridges the modal gap by mitigating the difference in length and representation. We conduct experiments on the MuST-C dataset. The results demonstrate that our method attains state-of-the-art results. Moreover, when additional data is used, we achieve the new SOTA result on MuST-C English to Spanish task with 20.8% of the training time required by the current SOTA method.

MoDELS · 知識 (knowledge) · 集成 · Performer · 圖 ·

2023 年 11 月 7 日

Ensembling Textual and Structure-Based Models for Knowledge Graph Completion

Ananjan Nandi,Navdeep Kaur,Parag Singla, Mausam

from arxiv, 9 pages, 2 figures, 9 tables

We consider two popular approaches to Knowledge Graph Completion (KGC): textual models that rely on textual entity descriptions, and structure-based models that exploit the connectivity structure of the Knowledge Graph (KG). Preliminary experiments show that these approaches have complementary strengths: structure-based models perform well when the gold answer is easily reachable from the query head in the KG, while textual models exploit descriptions to give good performance even when the gold answer is not reachable. In response, we explore ensembling as a way of combining the best of both approaches. We propose a novel method for learning query-dependent ensemble weights by using the distributions of scores assigned by individual models to all candidate entities. Our ensemble baseline achieves state-of-the-art results on three standard KGC datasets, with up to 6.8 pt MRR and 8.3 pt Hits@1 gains over best individual models.

自動問答 · MoDELS · 生成模型 · 判別式模型 · 判別器 ·

2023 年 11 月 6 日

Adapting Pre-trained Generative Models for Extractive Question Answering

Prabir Mallick,Tapas Nayak,Indrajit Bhattacharya

from arxiv, Accepted in GEM workshop @ EMNLP 2023

Pre-trained Generative models such as BART, T5, etc. have gained prominence as a preferred method for text generation in various natural language processing tasks, including abstractive long-form question answering (QA) and summarization. However, the potential of generative models in extractive QA tasks, where discriminative models are commonly employed, remains largely unexplored. Discriminative models often encounter challenges associated with label sparsity, particularly when only a small portion of the context contains the answer. The challenge is more pronounced for multi-span answers. In this work, we introduce a novel approach that uses the power of pre-trained generative models to address extractive QA tasks by generating indexes corresponding to context tokens or sentences that form part of the answer. Through comprehensive evaluations on multiple extractive QA datasets, including MultiSpanQA, BioASQ, MASHQA, and WikiQA, we demonstrate the superior performance of our proposed approach compared to existing state-of-the-art models.

語言模型化 · MoDELS · entity · 訓練數據 · Extensibility ·

2023 年 11 月 5 日

Quantifying and Analyzing Entity-level Memorization in Large Language Models

Zhenhong Zhou,Jiuyang Xiang,Chaomeng Chen,Sen Su

from arxiv, 9 pages, 7 figures

Large language models (LLMs) have been proven capable of memorizing their training data, which can be extracted through specifically designed prompts. As the scale of datasets continues to grow, privacy risks arising from memorization have attracted increasing attention. Quantifying language model memorization helps evaluate potential privacy risks. However, prior works on quantifying memorization require access to the precise original data or incur substantial computational overhead, making it difficult for applications in real-world language models. To this end, we propose a fine-grained, entity-level definition to quantify memorization with conditions and metrics closer to real-world scenarios. In addition, we also present an approach for efficiently extracting sensitive entities from autoregressive language models. We conduct extensive experiments based on the proposed, probing language models' ability to reconstruct sensitive entities under different settings. We find that language models have strong memorization at the entity level and are able to reproduce the training data even with partial leakages. The results demonstrate that LLMs not only memorize their training data but also understand associations between entities. These findings necessitate that trainers of LLMs exercise greater prudence regarding model memorization, adopting memorization mitigation techniques to preclude privacy violations.

Learning · 點云 · INTERACT · 隨機初始化 · Performer ·

2023 年 11 月 5 日

HACMan: Learning Hybrid Actor-Critic Maps for 6D Non-Prehensile Manipulation

Wenxuan Zhou,Bowen Jiang,Fan Yang,Chris Paxton,David Held

from arxiv, 7th Conference on Robot Learning (CoRL 2023)

Manipulating objects without grasping them is an essential component of human dexterity, referred to as non-prehensile manipulation. Non-prehensile manipulation may enable more complex interactions with the objects, but also presents challenges in reasoning about gripper-object interactions. In this work, we introduce Hybrid Actor-Critic Maps for Manipulation (HACMan), a reinforcement learning approach for 6D non-prehensile manipulation of objects using point cloud observations. HACMan proposes a temporally-abstracted and spatially-grounded object-centric action representation that consists of selecting a contact location from the object point cloud and a set of motion parameters describing how the robot will move after making contact. We modify an existing off-policy RL algorithm to learn in this hybrid discrete-continuous action representation. We evaluate HACMan on a 6D object pose alignment task in both simulation and in the real world. On the hardest version of our task, with randomized initial poses, randomized 6D goals, and diverse object categories, our policy demonstrates strong generalization to unseen object categories without a performance drop, achieving an 89% success rate on unseen objects in simulation and 50% success rate with zero-shot transfer in the real world. Compared to alternative action representations, HACMan achieves a success rate more than three times higher than the best baseline. With zero-shot sim2real transfer, our policy can successfully manipulate unseen objects in the real world for challenging non-planar goals, using dynamic and contact-rich non-prehensile skills. Videos can be found on the project website: //hacman-2023.github.io.

容差 · Performer · Raft算法 · state-of-the-art · 結點 ·

2023 年 11 月 2 日

Raft-Forensics: High Performance CFT Consensus with Accountability for Byzantine Faults

Weizhao Tang,Peiyao Sheng,Pronoy Roy,Xuechao Wang,Giulia Fanti,Pramod Viswanath

Crash fault tolerant (CFT) consensus algorithms are commonly used in scenarios where system components are trusted, such as enterprise settings. CFT algorithms offer high throughput and low latency, making them an attractive option for centralized operations that require fault tolerance. However, CFT consensus is vulnerable to Byzantine faults, which can be introduced by a single corrupt component. Such faults can break consensus in the system. Byzantine fault tolerant (BFT) consensus algorithms withstand Byzantine faults, but they are not as competitive with CFT algorithms in terms of performance. In this work, we explore a middle ground between BFT and CFT consensus by exploring the role of accountability in CFT protocols. That is, if a CFT protocol node breaks protocol and affects consensus safety, we aim to identify which node was the culprit. Based on Raft, one of the most popular CFT algorithms, we present Raft-Forensics, which provides accountability over Byzantine faults. We theoretically prove that if two honest components fail to reach consensus, the Raft-Forensics auditing algorithm finds the adversarial component that caused the inconsistency. In an empirical evaluation, we demonstrate that Raft-Forensics performs similarly to Raft and significantly better than state-of-the-art BFT algorithms. With 256 byte messages, Raft-Forensics achieves peak throughput 87.8% of vanilla Raft at 46% higher latency, while state-of-the-art BFT protocol Dumbo-NG only achieves 18.9% peak throughput at nearly $6\times$ higher latency.

MoDELS · CLUES · INTERACT · 圖形處理器 · Neural Networks ·

2021 年 1 月 28 日

A Graph-based Relevance Matching Model for Ad-hoc Retrieval

Yufeng Zhang,Jinghao Zhang,Zeyu Cui,Shu Wu,Liang Wang

from arxiv, To appear at AAAI 2021

To retrieve more relevant, appropriate and useful documents given a query, finding clues about that query through the text is crucial. Recent deep learning models regard the task as a term-level matching problem, which seeks exact or similar query patterns in the document. However, we argue that they are inherently based on local interactions and do not generalise to ubiquitous, non-consecutive contextual relationships.In this work, we propose a novel relevance matching model based on graph neural networks to leverage the document-level word relationships for ad-hoc retrieval. In addition to the local interactions, we explicitly incorporate all contexts of a term through the graph-of-word text format. Matching patterns can be revealed accordingly to provide a more accurate relevance score. Our approach significantly outperforms strong baselines on two ad-hoc benchmarks. We also experimentally compare our model with BERT and show our ad-vantages on long documents.

MoDELS · entity · CC · Performer · 學成 ·

2020 年 3 月 12 日

Learning Conceptual-Contextual Embeddings for Medical Text

Xiao Zhang,Dejing Dou,Ji Wu

External knowledge is often useful for natural language understanding tasks. We introduce a contextual text representation model called Conceptual-Contextual (CC) embeddings, which incorporates structured knowledge into text representations. Unlike entity embedding methods, our approach encodes a knowledge graph into a context model. CC embeddings can be easily reused for a wide range of tasks just like pre-trained language models. Our model effectively encodes the huge UMLS database by leveraging semantic generalizability. Experiments on electronic health records (EHRs) and medical text processing benchmarks showed our model gives a major boost to the performance of supervised medical NLP tasks.

圖卷積神經網絡/圖卷積網絡 · 情感分類 · 圖卷積 · INFORMS · 卷積 ·

2019 年 9 月 8 日

Aspect-based Sentiment Classification with Aspect-specific Graph Convolutional Networks

Chen Zhang,Qiuchi Li,Dawei Song

from arxiv, 11 pages, 4 figures, accepted to EMNLP 2019

Due to their inherent capability in semantic alignment of aspects and their context words, attention mechanism and Convolutional Neural Networks (CNNs) are widely applied for aspect-based sentiment classification. However, these models lack a mechanism to account for relevant syntactical constraints and long-range word dependencies, and hence may mistakenly recognize syntactically irrelevant contextual words as clues for judging aspect sentiment. To tackle this problem, we propose to build a Graph Convolutional Network (GCN) over the dependency tree of a sentence to exploit syntactical information and word dependencies. Based on it, a novel aspect-specific sentiment classification framework is raised. Experiments on three benchmarking collections illustrate that our proposed model has comparable effectiveness to a range of state-of-the-art models, and further demonstrate that both syntactical information and long-range word dependencies are properly captured by the graph convolution structure.

MoDELS · entity · CC · Performer · 學成 ·

2019 年 8 月 16 日

Learning Conceptual-Contexual Embeddings for Medical Text

Xiao Zhang,Dejing Dou,Ji Wu

External knowledge is often useful for natural language understanding tasks. We introduce a contextual text representation model called Conceptual-Contextual (CC) embeddings, which incorporates structured knowledge into text representations. Unlike entity embedding methods, our approach encodes a knowledge graph into a context model. CC embeddings can be easily reused for a wide range of tasks just like pre-trained language models. Our model effectively encodes the huge UMLS database by leveraging semantic generalizability. Experiments on electronic health records (EHRs) and medical text processing benchmarks showed our model gives a major boost to the performance of supervised medical NLP tasks.

閱讀: 0 點贊: 0

小貼士

登錄享

相關主題

Machine Translation

北京阿比特科技有限公司

注冊地址：北京市海淀區羊坊店路18號2幢3層301-191

<tfoot id='al2yy'></tfoot>

<legend id='al2yy'><style id='al2yy'><dir id='al2yy'><q id='al2yy'></q></dir></style></legend>

<i id='al2yy'><tr id='al2yy'><dt id='al2yy'><q id='al2yy'><span id='al2yy'><b id='al2yy'><form id='al2yy'><ins id='al2yy'></ins><ul id='al2yy'></ul><sub id='al2yy'></sub></form><legend id='al2yy'></legend><bdo id='al2yy'><pre id='al2yy'><center id='al2yy'></center></pre></bdo></b><th id='al2yy'></th></span></q></dt></tr></i><div id='al2yy'><tfoot id='al2yy'></tfoot><dl id='al2yy'><fieldset id='al2yy'></fieldset></dl></div>