久久久久久久精品少妇9999-五月婷婷开心之中文字幕

Federated learning enables multiple decentralized clients to learn collaboratively without sharing the local training data. However, the expensive annotation cost to acquire data labels on local clients remains an obstacle in utilizing local data. In this paper, we propose a federated active learning paradigm to efficiently learn a global model with limited annotation budget while protecting data privacy in a decentralized learning way. The main challenge faced by federated active learning is the mismatch between the active sampling goal of the global model on the server and that of the asynchronous local clients. This becomes even more significant when data is distributed non-IID across local clients. To address the aforementioned challenge, we propose Knowledge-Aware Federated Active Learning (KAFAL), which consists of Knowledge-Specialized Active Sampling (KSAS) and Knowledge-Compensatory Federated Update (KCFU). KSAS is a novel active sampling method tailored for the federated active learning problem. It deals with the mismatch challenge by sampling actively based on the discrepancies between local and global models. KSAS intensifies specialized knowledge in local clients, ensuring the sampled data to be informative for both the local clients and the global model. KCFU, in the meantime, deals with the client heterogeneity caused by limited data and non-IID data distributions. It compensates for each client's ability in weak classes by the assistance of the global model. Extensive experiments and analyses are conducted to show the superiority of KSAS over the state-of-the-art active learning methods and the efficiency of KCFU under the federated active learning framework.

相關內容

主動學(xue)習(xi)

關注 240

主動(dong)學(xue)(xue)(xue)(xue)(xue)(xue)習(xi)是(shi)(shi)機器學(xue)(xue)(xue)(xue)(xue)(xue)習(xi)（更普遍的(de)說(shuo)是(shi)(shi)人工智(zhi)能(neng)）的(de)一個(ge)子領(ling)域(yu)，在統(tong)計學(xue)(xue)(xue)(xue)(xue)(xue)領(ling)域(yu)也(ye)叫查詢學(xue)(xue)(xue)(xue)(xue)(xue)習(xi)、最優實驗(yan)設計。“學(xue)(xue)(xue)(xue)(xue)(xue)習(xi)模(mo)塊(kuai)”和“選(xuan)擇(ze)策(ce)略(lve)”是(shi)(shi)主動(dong)學(xue)(xue)(xue)(xue)(xue)(xue)習(xi)算法的(de)2個(ge)基本且(qie)重要的(de)模(mo)塊(kuai)。主動(dong)學(xue)(xue)(xue)(xue)(xue)(xue)習(xi)是(shi)(shi)“一種學(xue)(xue)(xue)(xue)(xue)(xue)習(xi)方(fang)(fang)法，在這種方(fang)(fang)法中(zhong)，學(xue)(xue)(xue)(xue)(xue)(xue)生(sheng)(sheng)會主動(dong)或(huo)體驗(yan)性地參(can)與(yu)學(xue)(xue)(xue)(xue)(xue)(xue)習(xi)過程(cheng)(cheng)(cheng)，并(bing)且(qie)根據學(xue)(xue)(xue)(xue)(xue)(xue)生(sheng)(sheng)的(de)參(can)與(yu)程(cheng)(cheng)(cheng)度(du)，有不同程(cheng)(cheng)(cheng)度(du)的(de)主動(dong)學(xue)(xue)(xue)(xue)(xue)(xue)習(xi)。” （Bonwell＆Eison 1991）Bonwell＆Eison（1991）指出：“學(xue)(xue)(xue)(xue)(xue)(xue)生(sheng)(sheng)除(chu)了(le)被動(dong)地聽課以(yi)(yi)外，還從事(shi)(shi)其他活動(dong)。” 在高(gao)(gao)等教育研究協(xie)會（ASHE）的(de)一份報告(gao)中(zhong)，作者(zhe)討論了(le)各種促進主動(dong)學(xue)(xue)(xue)(xue)(xue)(xue)習(xi)的(de)方(fang)(fang)法。他們(men)引用(yong)了(le)一些文(wen)獻(xian)，這些文(wen)獻(xian)表明學(xue)(xue)(xue)(xue)(xue)(xue)生(sheng)(sheng)不僅要做(zuo)聽，還必須做(zuo)更多的(de)事(shi)(shi)情(qing)才能(neng)學(xue)(xue)(xue)(xue)(xue)(xue)習(xi)。他們(men)必須閱(yue)讀，寫作，討論并(bing)參(can)與(yu)解決(jue)問題。此(ci)過程(cheng)(cheng)(cheng)涉及三個(ge)學(xue)(xue)(xue)(xue)(xue)(xue)習(xi)領(ling)域(yu)，即知識，技能(neng)和態度(du)（KSA）。這種學(xue)(xue)(xue)(xue)(xue)(xue)習(xi)行為分類法可以(yi)(yi)被認(ren)為是(shi)(shi)“學(xue)(xue)(xue)(xue)(xue)(xue)習(xi)過程(cheng)(cheng)(cheng)的(de)目標”。特別是(shi)(shi)，學(xue)(xue)(xue)(xue)(xue)(xue)生(sheng)(sheng)必須從事(shi)(shi)諸如分析，綜合和評估之(zhi)類的(de)高(gao)(gao)級(ji)思維任(ren)務。

data integrity · Integration · 多峰值 · MoDELS · CASE ·

2023 年 10 月 10 日

Temporally Aligning Long Audio Interviews with Questions: A Case Study in Multimodal Data Integration

Piyush Singh Pasi,Karthikeya Battepati,Preethi Jyothi,Ganesh Ramakrishnan,Tanmay Mahapatra,Manoj Singh

from arxiv, Work Accepted in IJCAI-23- AI and Social Good Track

The problem of audio-to-text alignment has seen significant amount of research using complete supervision during training. However, this is typically not in the context of long audio recordings wherein the text being queried does not appear verbatim within the audio file. This work is a collaboration with a non-governmental organization called CARE India that collects long audio health surveys from young mothers residing in rural parts of Bihar, India. Given a question drawn from a questionnaire that is used to guide these surveys, we aim to locate where the question is asked within a long audio recording. This is of great value to African and Asian organizations that would otherwise have to painstakingly go through long and noisy audio recordings to locate questions (and answers) of interest. Our proposed framework, INDENT, uses a cross-attention-based model and prior information on the temporal ordering of sentences to learn speech embeddings that capture the semantics of the underlying spoken text. These learnt embeddings are used to retrieve the corresponding audio segment based on text queries at inference time. We empirically demonstrate the significant effectiveness (improvement in R-avg of about 3%) of our model over those obtained using text-based heuristics. We also show how noisy ASR, generated using state-of-the-art ASR models for Indian languages, yields better results when used in place of speech. INDENT, trained only on Hindi data is able to cater to all languages supported by the (semantically) shared text space. We illustrate this empirically on 11 Indic languages.

損失 · 路徑 · Networking · 稀疏 · Learning ·

2023 年 10 月 10 日

Transformer-Based Neural Surrogate for Link-Level Path Loss Prediction from Variable-Sized Maps

Thomas M. Hehn,Tribhuvanesh Orekondy,Ori Shental,Arash Behboodi,Juan Bucheli,Akash Doshi,June Namgoong,Taesang Yoo,Ashwin Sampath,Joseph B. Soriaga

from arxiv, Accepted at IEEE GLOBECOM 2023, v2: Changed license on arxiv

Estimating path loss for a transmitter-receiver location is key to many use-cases including network planning and handover. Machine learning has become a popular tool to predict wireless channel properties based on map data. In this work, we present a transformer-based neural network architecture that enables predicting link-level properties from maps of various dimensions and from sparse measurements. The map contains information about buildings and foliage. The transformer model attends to the regions that are relevant for path loss prediction and, therefore, scales efficiently to maps of different size. Further, our approach works with continuous transmitter and receiver coordinates without relying on discretization. In experiments, we show that the proposed model is able to efficiently learn dominant path losses from sparse training data and generalizes well when tested on novel maps.

估計/估計量 · 優化器 · Learning · 置信度 · MoDELS ·

2023 年 10 月 10 日

Bi-Level Offline Policy Optimization with Limited Exploration

Wenzhuo Zhou

We study offline reinforcement learning (RL) which seeks to learn a good policy based on a fixed, pre-collected dataset. A fundamental challenge behind this task is the distributional shift due to the dataset lacking sufficient exploration, especially under function approximation. To tackle this issue, we propose a bi-level structured policy optimization algorithm that models a hierarchical interaction between the policy (upper-level) and the value function (lower-level). The lower level focuses on constructing a confidence set of value estimates that maintain sufficiently small weighted average Bellman errors, while controlling uncertainty arising from distribution mismatch. Subsequently, at the upper level, the policy aims to maximize a conservative value estimate from the confidence set formed at the lower level. This novel formulation preserves the maximum flexibility of the implicitly induced exploratory data distribution, enabling the power of model extrapolation. In practice, it can be solved through a computationally efficient, penalized adversarial estimation procedure. Our theoretical regret guarantees do not rely on any data-coverage and completeness-type assumptions, only requiring realizability. These guarantees also demonstrate that the learned policy represents the "best effort" among all policies, as no other policies can outperform it. We evaluate our model using a blend of synthetic, benchmark, and real-world datasets for offline RL, showing that it performs competitively with state-of-the-art methods.

Learning · 表示學習 · 不變 · 表示 · 潛在 ·

2023 年 10 月 9 日

Multi-Domain Causal Representation Learning via Weak Distributional Invariances

Kartik Ahuja,Amin Mansouri,Yixin Wang

Causal representation learning has emerged as the center of action in causal machine learning research. In particular, multi-domain datasets present a natural opportunity for showcasing the advantages of causal representation learning over standard unsupervised representation learning. While recent works have taken crucial steps towards learning causal representations, they often lack applicability to multi-domain datasets due to over-simplifying assumptions about the data; e.g. each domain comes from a different single-node perfect intervention. In this work, we relax these assumptions and capitalize on the following observation: there often exists a subset of latents whose certain distributional properties (e.g., support, variance) remain stable across domains; this property holds when, for example, each domain comes from a multi-node imperfect intervention. Leveraging this observation, we show that autoencoders that incorporate such invariances can provably identify the stable set of latents from the rest across different settings.

Conformer · U-Net · 語音增強 · MoDELS · 塊 ·

2023 年 10 月 8 日

Deep Complex U-Net with Conformer for Audio-Visual Speech Enhancement

Shafique Ahmed,Chia-Wei Chen,Wenze Ren,Chin-Jou Li,Ernie Chu,Jun-Cheng Chen,Amir Hussain,Hsin-Min Wang,Yu Tsao,Jen-Cheng Hou

Recent studies have increasingly acknowledged the advantages of incorporating visual data into speech enhancement (SE) systems. In this paper, we introduce a novel audio-visual SE approach, termed DCUC-Net (deep complex U-Net with conformer network). The proposed DCUC-Net leverages complex domain features and a stack of conformer blocks. The encoder and decoder of DCUC-Net are designed using a complex U-Net-based framework. The audio and visual signals are processed using a complex encoder and a ResNet-18 model, respectively. These processed signals are then fused using the conformer blocks and transformed into enhanced speech waveforms via a complex decoder. The conformer blocks consist of a combination of self-attention mechanisms and convolutional operations, enabling DCUC-Net to effectively capture both global and local audio-visual dependencies. Our experimental results demonstrate the effectiveness of DCUC-Net, as it outperforms the baseline model from the COG-MHEAR AVSE Challenge 2023 by a notable margin of 0.14 in terms of PESQ. Additionally, the proposed DCUC-Net performs comparably to a state-of-the-art model and outperforms all other compared models on the Taiwan Mandarin speech with video (TMSV) dataset.

Learning · 聯邦學習 · Machine Learning · ML · 可辨認的 ·

2023 年 10 月 6 日

A Comprehensive Empirical Study of Bugs in Open-Source Federated Learning Frameworks

Weijie Shao,Yuyang Gao,Fu Song,Sen Chen,Lingling Fan,JingZhu He

Federated learning (FL) is a distributed machine learning (ML) paradigm, allowing multiple clients to collaboratively train shared machine learning (ML) models without exposing clients' data privacy. It has gained substantial popularity in recent years, especially since the enforcement of data protection laws and regulations in many countries. To foster the application of FL, a variety of FL frameworks have been proposed, allowing non-experts to easily train ML models. As a result, understanding bugs in FL frameworks is critical for facilitating the development of better FL frameworks and potentially encouraging the development of bug detection, localization and repair tools. Thus, we conduct the first empirical study to comprehensively collect, taxonomize, and characterize bugs in FL frameworks. Specifically, we manually collect and classify 1,119 bugs from all the 676 closed issues and 514 merged pull requests in 17 popular and representative open-source FL frameworks on GitHub. We propose a classification of those bugs into 12 bug symptoms, 12 root causes, and 18 fix patterns. We also study their correlations and distributions on 23 functionalities. We identify nine major findings from our study, discuss their implications and future research directions based on our findings.

多峰值 · 穩健性 · Performer · 多模態學習 · Learning ·

2023 年 10 月 6 日

Robust Multimodal Learning with Missing Modalities via Parameter-Efficient Adaptation

Md Kaykobad Reza,Ashley Prater-Bennette,M. Salman Asif

from arxiv, 18 pages, 3 figures, 11 tables

Multimodal learning seeks to utilize data from multiple sources to improve the overall performance of downstream tasks. It is desirable for redundancies in the data to make multimodal systems robust to missing or corrupted observations in some correlated modalities. However, we observe that the performance of several existing multimodal networks significantly deteriorates if one or multiple modalities are absent at test time. To enable robustness to missing modalities, we propose simple and parameter-efficient adaptation procedures for pretrained multimodal networks. In particular, we exploit low-rank adaptation and modulation of intermediate features to compensate for the missing modalities. We demonstrate that such adaptation can partially bridge performance drop due to missing modalities and outperform independent, dedicated networks trained for the available modality combinations in some cases. The proposed adaptation requires extremely small number of parameters (e.g., fewer than 0.7% of the total parameters in most experiments). We conduct a series of experiments to highlight the robustness of our proposed method using diverse datasets for RGB-thermal and RGB-Depth semantic segmentation, multimodal material segmentation, and multimodal sentiment analysis tasks. Our proposed method demonstrates versatility across various tasks and datasets, and outperforms existing methods for robust multimodal learning with missing modalities.

contrastive · 學成 · 對比學習 · 目標檢測 · 優化器 ·

2021 年 4 月 4 日

Dense Contrastive Learning for Self-Supervised Visual Pre-Training

Xinlong Wang,Rufeng Zhang,Chunhua Shen,Tao Kong,Lei Li

from arxiv, 11 pages. Accepted to IEEE/CVF Conf. Comp. Vision Pattern Recognition (CVPR) 2021; Oral paper

To date, most existing self-supervised learning methods are designed and optimized for image classification. These pre-trained models can be sub-optimal for dense prediction tasks due to the discrepancy between image-level prediction and pixel-level prediction. To fill this gap, we aim to design an effective, dense self-supervised learning method that directly works at the level of pixels (or local features) by taking into account the correspondence between local features. We present dense contrastive learning, which implements self-supervised learning by optimizing a pairwise contrastive (dis)similarity loss at the pixel level between two views of input images. Compared to the baseline method MoCo-v2, our method introduces negligible computation overhead (only <1% slower), but demonstrates consistently superior performance when transferring to downstream dense prediction tasks including object detection, semantic segmentation and instance segmentation; and outperforms the state-of-the-art methods by a large margin. Specifically, over the strong MoCo-v2 baseline, our method achieves significant improvements of 2.0% AP on PASCAL VOC object detection, 1.1% AP on COCO object detection, 0.9% AP on COCO instance segmentation, 3.0% mIoU on PASCAL VOC semantic segmentation and 1.8% mIoU on Cityscapes semantic segmentation. Code is available at: //git.io/AdelaiDet

INFORMS · 學成 · 強化學習 · 分離的 · state-of-the-art ·

2021 年 2 月 7 日

MetaCURE: Meta Reinforcement Learning with Empowerment-Driven Exploration

Jin Zhang,Jianhao Wang,Hao Hu,Tong Chen,Yingfeng Chen,Changjie Fan,Chongjie Zhang

Meta reinforcement learning (meta-RL) extracts knowledge from previous tasks and achieves fast adaptation to new tasks. Despite recent progress, efficient exploration in meta-RL remains a key challenge in sparse-reward tasks, as it requires quickly finding informative task-relevant experiences in both meta-training and adaptation. To address this challenge, we explicitly model an exploration policy learning problem for meta-RL, which is separated from exploitation policy learning, and introduce a novel empowerment-driven exploration objective, which aims to maximize information gain for task identification. We derive a corresponding intrinsic reward and develop a new off-policy meta-RL framework, which efficiently learns separate context-aware exploration and exploitation policies by sharing the knowledge of task inference. Experimental evaluation shows that our meta-RL method significantly outperforms state-of-the-art baselines on various sparse-reward MuJoCo locomotion tasks and more complex sparse-reward Meta-World tasks.

小樣本學習 · 學成 · 示例 · 泛函 · 訓練實例 ·

2018 年 12 月 10 日

Learning Embedding Adaptation for Few-Shot Learning

Han-Jia Ye,Hexiang Hu,De-Chuan Zhan,Fei Sha

Learning with limited data is a key challenge for visual recognition. Few-shot learning methods address this challenge by learning an instance embedding function from seen classes and apply the function to instances from unseen classes with limited labels. This style of transfer learning is task-agnostic: the embedding function is not learned optimally discriminative with respect to the unseen classes, where discerning among them is the target task. In this paper, we propose a novel approach to adapt the embedding model to the target classification task, yielding embeddings that are task-specific and are discriminative. To this end, we employ a type of self-attention mechanism called Transformer to transform the embeddings from task-agnostic to task-specific by focusing on relating instances from the test instances to the training instances in both seen and unseen classes. Our approach also extends to both transductive and generalized few-shot classification, two important settings that have essential use cases. We verify the effectiveness of our model on two standard benchmark few-shot classification datasets --- MiniImageNet and CUB, where our approach demonstrates state-of-the-art empirical performance.