亚洲男人的天堂2018av,欧美草比,久久久久久免费视频精选,国色天香在线看免费,久久久久亚洲av成人片仓井空

<li id='bEJsR'></li>

_{^{<dd id='2pcM8'><tbody id='bhI4o'><td id='a8Lga'><optgroup id='w1XyN'><strong id='fQfEH'></strong></optgroup><address id='OJdN7'><ul id='wsbht'></ul></address><big id='jn0lt'></big></td><table id='xXpVS'></table></tbody><pre id='s8WhF'></pre></dd><span id='48FnE'><b id='ljHft'></b></span>}}


<dfn id='UdWii'><optgroup id='8a1bg'></optgroup></dfn><tfoot id='mZFPI'><bdo id='C7oqN'><div id='udMbC'></div><i id='v5E8d'><dt id='GASOs'></dt></i></bdo></tfoot>

_{<fieldset id='ZCR7y'></fieldset>}

·

命名實體識別 · 置信度 · 標注 · MoDELS · Learning ·

2023 年 7 月 27 日

A Confidence-based Partial Label Learning Model for Crowd-Annotated Named Entity Recognition

Limao Xiong,Jie Zhou,Qunxi Zhu,Xiao Wang,Yuanbin Wu,Qi Zhang,Tao Gui,Xuanjing Huang,Jin Ma,Ying Shan

Existing models for named entity recognition (NER) are mainly based on large-scale labeled datasets, which always obtain using crowdsourcing. However, it is hard to obtain a unified and correct label via majority voting from multiple annotators for NER due to the large labeling space and complexity of this task. To address this problem, we aim to utilize the original multi-annotator labels directly. Particularly, we propose a Confidence-based Partial Label Learning (CPLL) method to integrate the prior confidence (given by annotators) and posterior confidences (learned by models) for crowd-annotated NER. This model learns a token- and content-dependent confidence via an Expectation-Maximization (EM) algorithm by minimizing empirical risk. The true posterior estimator and confidence estimator perform iteratively to update the true posterior and confidence respectively. We conduct extensive experimental results on both real-world and synthetic datasets, which show that our model can improve performance effectively compared with strong baselines.

相關內容

命名實體識別

命名實體識(shi)別

命名(ming)(ming)實體(ti)識別（NER）（也(ye)稱為實體(ti)標識，實體(ti)組塊和實體(ti)提取）是信息抽取的子任務，旨(zhi)在將非結構(gou)(gou)化(hua)文(wen)本中(zhong)提到的命名(ming)(ming)實體(ti)定(ding)位和分類(lei)為預定(ding)義(yi)類(lei)別，例如人員姓名(ming)(ming)、地名(ming)(ming)、機構(gou)(gou)名(ming)(ming)、專有(you)名(ming)(ming)詞(ci)等。

知識薈萃

精(jing)品入門和進階教(jiao)程(cheng)、論文和代碼(ma)整理(li)等(deng)

更多

查(cha)看相關VIP內容(rong)、論文、資(zi)訊(xun)等(deng)

MoDELS · Learning · 機器學習建模 · contrastive · ML ·

2023 年 9 月 18 日

Towards Better Modeling with Missing Data: A Contrastive Learning-based Visual Analytics Perspective

Laixin Xie,Yang Ouyang,Longfei Chen,Ziming Wu,Quan Li

from arxiv, 18 pages, 11 figures. This paper is accepted by IEEE Transactions on Visualization and Computer Graphics (TVCG)

Missing data can pose a challenge for machine learning (ML) modeling. To address this, current approaches are categorized into feature imputation and label prediction and are primarily focused on handling missing data to enhance ML performance. These approaches rely on the observed data to estimate the missing values and therefore encounter three main shortcomings in imputation, including the need for different imputation methods for various missing data mechanisms, heavy dependence on the assumption of data distribution, and potential introduction of bias. This study proposes a Contrastive Learning (CL) framework to model observed data with missing values, where the ML model learns the similarity between an incomplete sample and its complete counterpart and the dissimilarity between other samples. Our proposed approach demonstrates the advantages of CL without requiring any imputation. To enhance interpretability, we introduce CIVis, a visual analytics system that incorporates interpretable techniques to visualize the learning process and diagnose the model status. Users can leverage their domain knowledge through interactive sampling to identify negative and positive pairs in CL. The output of CIVis is an optimized model that takes specified features and predicts downstream tasks. We provide two usage scenarios in regression and classification tasks and conduct quantitative experiments, expert interviews, and a qualitative user study to demonstrate the effectiveness of our approach. In short, this study offers a valuable contribution to addressing the challenges associated with ML modeling in the presence of missing data by providing a practical solution that achieves high predictive accuracy and model interpretability.

語音增強 · Performer · Processing（編程語言） · 泛函 · 估計/估計量 ·

2023 年 9 月 18 日

Single and Few-step Diffusion for Generative Speech Enhancement

Bunlong Lay,Jean-Marie Lemercier,Julius Richter,Timo Gerkmann

from arxiv, 5 pages, 1 figure, 1 table

Diffusion models have shown promising results in speech enhancement, using a task-adapted diffusion process for the conditional generation of clean speech given a noisy mixture. However, at test time, the neural network used for score estimation is called multiple times to solve the iterative reverse process. This results in a slow inference process and causes discretization errors that accumulate over the sampling trajectory. In this paper, we address these limitations through a two-stage training approach. In the first stage, we train the diffusion model the usual way using the generative denoising score matching loss. In the second stage, we compute the enhanced signal by solving the reverse process and compare the resulting estimate to the clean speech target using a predictive loss. We show that using this second training stage enables achieving the same performance as the baseline model using only 5 function evaluations instead of 60 function evaluations. While the performance of usual generative diffusion algorithms drops dramatically when lowering the number of function evaluations (NFEs) to obtain single-step diffusion, we show that our proposed method keeps a steady performance and therefore largely outperforms the diffusion baseline in this setting and also generalizes better than its predictive counterpart.

Siamese · Extensibility · INFORMS · state-of-the-art · Performer ·

2023 年 9 月 15 日

DSRRTracker: Dynamic Search Region Refinement for Attention-based Siamese Multi-Object Tracking

JiaXu Wan,Hong Zhang,Jin Zhang,Yuan Ding,Yifan Yang,Yan Li,Xuliang Li

from arxiv, The paper contained some errors in the legends and visualisations, such as incorrectly using the visualisations of the next generation model we studied. We have rewritten our paper on its next-generation model based on that paper. Since we do not want readers to misunderstand the next-generation paper due to the errors in this preprint paper, we have decided to withdraw this preprint paper

Many multi-object tracking (MOT) methods follow the framework of "tracking by detection", which associates the target objects-of-interest based on the detection results. However, due to the separate models for detection and association, the tracking results are not optimal.Moreover, the speed is limited by some cumbersome association methods to achieve high tracking performance. In this work, we propose an end-to-end MOT method, with a Gaussian filter-inspired dynamic search region refinement module to dynamically filter and refine the search region by considering both the template information from the past frames and the detection results from the current frame with little computational burden, and a lightweight attention-based tracking head to achieve the effective fine-grained instance association. Extensive experiments and ablation study on MOT17 and MOT20 datasets demonstrate that our method can achieve the state-of-the-art performance with reasonable speed.

Attention · 流 · 語音識別 · MoDELS · 可約的 ·

2023 年 9 月 14 日

Folding Attention: Memory and Power Optimization for On-Device Transformer-based Streaming Speech Recognition

Yang Li,Liangzhen Lai,Yuan Shangguan,Forrest N. Iandola,Ernie Chang,Yangyang Shi,Vikas Chandra

Transformer-based models excel in speech recognition. Existing efforts to optimize Transformer inference, typically for long-context applications, center on simplifying attention score calculations. However, streaming speech recognition models usually process a limited number of tokens each time, making attention score calculation less of a bottleneck. Instead, the bottleneck lies in the linear projection layers of multi-head attention and feedforward networks, constituting a substantial portion of the model size and contributing significantly to computation, memory, and power usage. To address this bottleneck, we propose folding attention, a technique targeting these linear layers, significantly reducing model size and improving memory and power efficiency. Experiments on on-device Transformer-based streaming speech recognition models show that folding attention reduces model size (and corresponding memory consumption) by up to 24% and power consumption by up to 23%, all without compromising model accuracy or computation overhead.

Learning · MoDELS · Integration · Networking · 遷移學習 ·

2023 年 9 月 14 日

Disentangling Spatial and Temporal Learning for Efficient Image-to-Video Transfer Learning

Zhiwu Qing,Shiwei Zhang,Ziyuan Huang,Yingya Zhang,Changxin Gao,Deli Zhao,Nong Sang

from arxiv, ICCV2023. Code: //github.com/alibaba-mmai-research/DiST

Recently, large-scale pre-trained language-image models like CLIP have shown extraordinary capabilities for understanding spatial contents, but naively transferring such models to video recognition still suffers from unsatisfactory temporal modeling capabilities. Existing methods insert tunable structures into or in parallel with the pre-trained model, which either requires back-propagation through the whole pre-trained model and is thus resource-demanding, or is limited by the temporal reasoning capability of the pre-trained structure. In this work, we present DiST, which disentangles the learning of spatial and temporal aspects of videos. Specifically, DiST uses a dual-encoder structure, where a pre-trained foundation model acts as the spatial encoder, and a lightweight network is introduced as the temporal encoder. An integration branch is inserted between the encoders to fuse spatio-temporal information. The disentangled spatial and temporal learning in DiST is highly efficient because it avoids the back-propagation of massive pre-trained parameters. Meanwhile, we empirically show that disentangled learning with an extra network for integration benefits both spatial and temporal understanding. Extensive experiments on five benchmarks show that DiST delivers better performance than existing state-of-the-art methods by convincing gaps. When pre-training on the large-scale Kinetics-710, we achieve 89.7% on Kinetics-400 with a frozen ViT-L model, which verifies the scalability of DiST. Codes and models can be found in //github.com/alibaba-mmai-research/DiST.

entity · 語言模型化 · E2E · MoDELS · 分解的 ·

2023 年 9 月 14 日

Incorporating Class-based Language Model for Named Entity Recognition in Factorized Neural Transducer

Peng Wang,Yifan Yang,Zheng Liang,Tian Tan,Shiliang Zhang,Xie Chen

In spite of the excellent strides made by end-to-end (E2E) models in speech recognition in recent years, named entity recognition is still challenging but critical for semantic understanding. In order to enhance the ability to recognize named entities in E2E models, previous studies mainly focus on various rule-based or attention-based contextual biasing algorithms. However, their performance might be sensitive to the biasing weight or degraded by excessive attention to the named entity list, along with a risk of false triggering. Inspired by the success of the class-based language model (LM) in named entity recognition in conventional hybrid systems and the effective decoupling of acoustic and linguistic information in the factorized neural Transducer (FNT), we propose a novel E2E model to incorporate class-based LMs into FNT, which is referred as C-FNT. In C-FNT, the language model score of named entities can be associated with the name class instead of its surface form. The experimental results show that our proposed C-FNT presents significant error reduction in named entities without hurting performance in general word recognition.

MoDELS · 語言模型化 · 語音識別 · 分離的 · 自動語音識別 ·

2023 年 9 月 14 日

Hybrid Attention-based Encoder-decoder Model for Efficient Language Model Adaptation

Shaoshi Ling,Guoli Ye,Rui Zhao,Yifan Gong

Attention-based encoder-decoder (AED) speech recognition model has been widely successful in recent years. However, the joint optimization of acoustic model and language model in end-to-end manner has created challenges for text adaptation. In particular, effectively, quickly and inexpensively adapting text has become a primary concern for deploying AED systems in industry. To address this issue, we propose a novel model, the hybrid attention-based encoder-decoder (HAED) speech recognition model that preserves the modularity of conventional hybrid automatic speech recognition systems. Our HAED model separates the acoustic and language models, allowing for the use of conventional text-based language model adaptation techniques. We demonstrate that the proposed HAED model yields 21\% Word Error Rate (WER) improvements in relative when out-of-domain text data is used for language model adaptation, and with only a minor degradation in WER on a general test set compared with conventional AED model.

Performer · 損失 · Learning · Networking · 可辨認的 ·

2023 年 9 月 13 日

Multi-Modal Hybrid Learning and Sequential Training for RGB-T Saliency Detection

Guangyu Ren,Jitesh Joshi,Youngjun Cho

from arxiv, 8 Pages main text, 3 pages supplementary information, 12 figures

RGB-T saliency detection has emerged as an important computer vision task, identifying conspicuous objects in challenging scenes such as dark environments. However, existing methods neglect the characteristics of cross-modal features and rely solely on network structures to fuse RGB and thermal features. To address this, we first propose a Multi-Modal Hybrid loss (MMHL) that comprises supervised and self-supervised loss functions. The supervised loss component of MMHL distinctly utilizes semantic features from different modalities, while the self-supervised loss component reduces the distance between RGB and thermal features. We further consider both spatial and channel information during feature fusion and propose the Hybrid Fusion Module to effectively fuse RGB and thermal features. Lastly, instead of jointly training the network with cross-modal features, we implement a sequential training strategy which performs training only on RGB images in the first stage and then learns cross-modal features in the second stage. This training strategy improves saliency detection performance without computational overhead. Results from performance evaluation and ablation studies demonstrate the superior performance achieved by the proposed method compared with the existing state-of-the-art methods.

學成 · 表示學習 · MoDELS · CASES · contrastive ·

2021 年 6 月 3 日

Improving Event Causality Identification via Self-Supervised Representation Learning on External Causal Statement

Xinyu Zuo,Pengfei Cao,Yubo Chen,Kang Liu,Jun Zhao,Weihua Peng,Yuguang Chen

from arxiv, Accepted to Findings of ACL 2021

Current models for event causality identification (ECI) mainly adopt a supervised framework, which heavily rely on labeled data for training. Unfortunately, the scale of current annotated datasets is relatively limited, which cannot provide sufficient support for models to capture useful indicators from causal statements, especially for handing those new, unseen cases. To alleviate this problem, we propose a novel approach, shortly named CauSeRL, which leverages external causal statements for event causality identification. First of all, we design a self-supervised framework to learn context-specific causal patterns from external causal statements. Then, we adopt a contrastive transfer strategy to incorporate the learned context-specific causal patterns into the target ECI model. Experimental results show that our method significantly outperforms previous methods on EventStoryLine and Causal-TimeBank (+2.0 and +3.4 points on F1 value respectively).

Networking · INTERACT · INFORMS · 卷積 · MoDELS ·

2021 年 1 月 21 日

Self-Supervised Multi-Channel Hypergraph Convolutional Network for Social Recommendation

Junliang Yu,Hongzhi Yin,Jundong Li,Qinyong Wang,Nguyen Quoc Viet Hung,Xiangliang Zhang

from arxiv, 12 pages, Accepted to WWW'21

Social relations are often used to improve recommendation quality when user-item interaction data is sparse in recommender systems. Most existing social recommendation models exploit pairwise relations to mine potential user preferences. However, real-life interactions among users are very complicated and user relations can be high-order. Hypergraph provides a natural way to model complex high-order relations, while its potentials for improving social recommendation are under-explored. In this paper, we fill this gap and propose a multi-channel hypergraph convolutional network to enhance social recommendation by leveraging high-order user relations. Technically, each channel in the network encodes a hypergraph that depicts a common high-order user relation pattern via hypergraph convolution. By aggregating the embeddings learned through multiple channels, we obtain comprehensive user representations to generate recommendation results. However, the aggregation operation might also obscure the inherent characteristics of different types of high-order connectivity information. To compensate for the aggregating loss, we innovatively integrate self-supervised learning into the training of the hypergraph convolutional network to regain the connectivity information with hierarchical mutual information maximization. The experimental results on multiple real-world datasets show that the proposed model outperforms the SOTA methods, and the ablation study verifies the effectiveness of the multi-channel setting and the self-supervised task. The implementation of our model is available via //github.com/Coder-Yu/RecQ.

閱讀: 0 點贊: 0

小貼士

登錄享

相關主題

命(ming)名實體識別

北京阿比特科技有限公司

注冊地址：北京市海淀區羊坊店路18號2幢3層301-191

<tfoot id='5g3iw'></tfoot>

<legend id='5g3iw'><style id='5g3iw'><dir id='5g3iw'><q id='5g3iw'></q></dir></style></legend>

<i id='5g3iw'><tr id='5g3iw'><dt id='5g3iw'><q id='5g3iw'><span id='5g3iw'><b id='5g3iw'><form id='5g3iw'><ins id='5g3iw'></ins><ul id='5g3iw'></ul><sub id='5g3iw'></sub></form><legend id='5g3iw'></legend><bdo id='5g3iw'><pre id='5g3iw'><center id='5g3iw'></center></pre></bdo></b><th id='5g3iw'></th></span></q></dt></tr></i><div id='5g3iw'><tfoot id='5g3iw'></tfoot><dl id='5g3iw'><fieldset id='5g3iw'></fieldset></dl></div>