A级日本乱理伦片免费入口,日本一区二区三区免视频免费播放,亚州欧美中文日韩,亚洲欧美精品久久久

Personalized federated learning is tasked with training machine learning models for multiple clients, each with its own data distribution. The goal is to train personalized models in a collaborative way while accounting for data disparities across clients and reducing communication costs. We propose a novel approach to this problem using hypernetworks, termed pFedHN for personalized Federated HyperNetworks. In this approach, a central hypernetwork model is trained to generate a set of models, one model for each client. This architecture provides effective parameter sharing across clients, while maintaining the capacity to generate unique and diverse personal models. Furthermore, since hypernetwork parameters are never transmitted, this approach decouples the communication cost from the trainable model size. We test pFedHN empirically in several personalized federated learning challenges and find that it outperforms previous methods. Finally, since hypernetworks share information across clients we show that pFedHN can generalize better to new clients whose distributions differ from any client observed during training.

相關內容

聯邦學(xue)習

關注 199

聯(lian)邦(bang)學(xue)習（Federated Learning）是(shi)一種(zhong)新興的(de)(de)人(ren)工智能(neng)基礎(chu)技術(shu)，在(zai) 2016 年由谷(gu)歌最先提出(chu)，原本(ben)用(yong)于(yu)解決(jue)安(an)卓手機(ji)終端用(yong)戶在(zai)本(ben)地更新模型(xing)的(de)(de)問題，其(qi)(qi)設計目標(biao)是(shi)在(zai)保障大(da)數(shu)據(ju)交(jiao)換時的(de)(de)信息安(an)全、保護終端數(shu)據(ju)和個人(ren)數(shu)據(ju)隱私(si)、保證合法合規的(de)(de)前(qian)提下，在(zai)多參與(yu)方或多計算(suan)結點之間開展高效(xiao)率(lv)的(de)(de)機(ji)器學(xue)習。其(qi)(qi)中，聯(lian)邦(bang)學(xue)習可使用(yong)的(de)(de)機(ji)器學(xue)習算(suan)法不(bu)局限(xian)于(yu)神經網絡，還(huan)包括隨機(ji)森林等重要算(suan)法。聯(lian)邦(bang)學(xue)習有望成為下一代人(ren)工智能(neng)協(xie)同算(suan)法和協(xie)作網絡的(de)(de)基礎(chu)。

Performer · Wireless Networks · Networks · 損失函數（機器學習） · Better ·

2021 年 4 月 30 日

On In-network learning. A Comparative Study with Federated and Split Learning

Matei Moldoveanu,Abdellatif Zaidi

from arxiv, Submitted to the 2021 IEEE 22nd International Workshop on Signal Processing Advances in Wireless Communications (SPAWC), special session on Machine learning at the Edge

In this paper, we consider a problem in which distributively extracted features are used for performing inference in wireless networks. We elaborate on our proposed architecture, which we herein refer to as "in-network learning", provide a suitable loss function and discuss its optimization using neural networks. We compare its performance with both Federated- and Split learning; and show that this architecture offers both better accuracy and bandwidth savings.

簇 · 聯邦學習 · 可辨認的 · 學成 · 圖 ·

2021 年 4 月 29 日

Cluster-driven Graph Federated Learning over Multiple Domains

Debora Caldarola,Massimiliano Mancini,Fabio Galasso,Marco Ciccone,Emanuele Rodolà,Barbara Caputo

from arxiv, Accepted to CVPR21 Workshop Learning from Limited or Imperfect Data (L^2ID)

Federated Learning (FL) deals with learning a central model (i.e. the server) in privacy-constrained scenarios, where data are stored on multiple devices (i.e. the clients). The central model has no direct access to the data, but only to the updates of the parameters computed locally by each client. This raises a problem, known as statistical heterogeneity, because the clients may have different data distributions (i.e. domains). This is only partly alleviated by clustering the clients. Clustering may reduce heterogeneity by identifying the domains, but it deprives each cluster model of the data and supervision of others. Here we propose a novel Cluster-driven Graph Federated Learning (FedCG). In FedCG, clustering serves to address statistical heterogeneity, while Graph Convolutional Networks (GCNs) enable sharing knowledge across them. FedCG: i) identifies the domains via an FL-compliant clustering and instantiates domain-specific modules (residual branches) for each domain; ii) connects the domain-specific modules through a GCN at training to learn the interactions among domains and share knowledge; and iii) learns to cluster unsupervised via teacher-student classifier-training iterations and to address novel unseen test domains via their domain soft-assignment scores. Thanks to the unique interplay of GCN over clusters, FedCG achieves the state-of-the-art on multiple FL benchmarks.

聯邦學習 · 回合 · MoDELS · 學成 · 推斷 ·

2021 年 4 月 29 日

PPFL: Privacy-preserving Federated Learning with Trusted Execution Environments

Fan Mo,Hamed Haddadi,Kleomenis Katevas,Eduard Marin,Diego Perino,Nicolas Kourtellis

from arxiv, 15 pages, 8 figures, accepted to MobiSys 2021

We propose and implement a Privacy-preserving Federated Learning (PPFL) framework for mobile systems to limit privacy leakages in federated learning. Leveraging the widespread presence of Trusted Execution Environments (TEEs) in high-end and mobile devices, we utilize TEEs on clients for local training, and on servers for secure aggregation, so that model/gradient updates are hidden from adversaries. Challenged by the limited memory size of current TEEs, we leverage greedy layer-wise training to train each model's layer inside the trusted area until its convergence. The performance evaluation of our implementation shows that PPFL can significantly improve privacy while incurring small system overheads at the client-side. In particular, PPFL can successfully defend the trained model against data reconstruction, property inference, and membership inference attacks. Furthermore, it can achieve comparable model utility with fewer communication rounds (0.54x) and a similar amount of network traffic (1.002x) compared to the standard federated learning of a complete model. This is achieved while only introducing up to ~15% CPU time, ~18% memory usage, and ~21% energy consumption overhead in PPFL's client-side.

Machine Learning · 學成 · 分布式機器學習 · 聯邦學習 · Taxonomy ·

2021 年 4 月 29 日

From Distributed Machine Learning to Federated Learning: A Survey

Ji Liu,Jizhou Huang,Yang Zhou,Xuhong Li,Shilei Ji,Haoyi Xiong,Dejing Dou

from arxiv, 31 pages, 8 figures

In recent years, data and computing resources are typically distributed in the devices of end users, various regions or organizations. Because of laws or regulations, the distributed data and computing resources cannot be directly shared among different regions or organizations for machine learning tasks. Federated learning emerges as an efficient approach to exploit distributed data and computing resources, so as to collaboratively train machine learning models, while obeying the laws and regulations and ensuring data security and data privacy. In this paper, we provide a comprehensive survey of existing works for federated learning. We propose a functional architecture of federated learning systems and a taxonomy of related techniques. Furthermore, we present the distributed training, data communication, and security of FL systems. Finally, we analyze their limitations and propose future research directions.

聯邦學習 · 學成 · Extensibility · MoDELS · Performer ·

2021 年 3 月 30 日

Model-Contrastive Federated Learning

Qinbin Li,Bingsheng He,Dawn Song

from arxiv, Accepted by CVPR 2021

Federated learning enables multiple parties to collaboratively train a machine learning model without communicating their local data. A key challenge in federated learning is to handle the heterogeneity of local data distribution across parties. Although many studies have been proposed to address this challenge, we find that they fail to achieve high performance in image datasets with deep learning models. In this paper, we propose MOON: model-contrastive federated learning. MOON is a simple and effective federated learning framework. The key idea of MOON is to utilize the similarity between model representations to correct the local training of individual parties, i.e., conducting contrastive learning in model-level. Our extensive experiments show that MOON significantly outperforms the other state-of-the-art federated learning algorithms on various image classification tasks.

小樣本學習 · 注意力機制 · 圖形處理器 · GNN · 學成 ·

2020 年 7 月 14 日

Attentive Graph Neural Networks for Few-Shot Learning

Hao Cheng,Joey Tianyi Zhou,Wee Peng Tay,Bihan Wen

Graph Neural Networks (GNN) has demonstrated the superior performance in many challenging applications, including the few-shot learning tasks. Despite its powerful capacity to learn and generalize from few samples, GNN usually suffers from severe over-fitting and over-smoothing as the model becomes deep, which limit the model scalability. In this work, we propose a novel Attentive GNN to tackle these challenges, by incorporating a triple-attention mechanism, \ie node self-attention, neighborhood attention, and layer memory attention. We explain why the proposed attentive modules can improve GNN for few-shot learning with theoretical analysis and illustrations. Extensive experiments show that the proposed Attentive GNN outperforms the state-of-the-art GNN-based methods for few-shot learning over the mini-ImageNet and Tiered-ImageNet datasets, with both inductive and transductive settings.

簇 · Performer · 數據集 · MoDELS · DBSCAN ·

2019 年 10 月 30 日

Meta-Learning to Cluster

Yibo Jiang,Nakul Verma

Clustering is one of the most fundamental and wide-spread techniques in exploratory data analysis. Yet, the basic approach to clustering has not really changed: a practitioner hand-picks a task-specific clustering loss to optimize and fit the given data to reveal the underlying cluster structure. Some types of losses---such as k-means, or its non-linear version: kernelized k-means (centroid based), and DBSCAN (density based)---are popular choices due to their good empirical performance on a range of applications. Although every so often the clustering output using these standard losses fails to reveal the underlying structure, and the practitioner has to custom-design their own variation. In this work we take an intrinsically different approach to clustering: rather than fitting a dataset to a specific clustering loss, we train a recurrent model that learns how to cluster. The model uses as training pairs examples of datasets (as input) and its corresponding cluster identities (as output). By providing multiple types of training datasets as inputs, our model has the ability to generalize well on unseen datasets (new clustering tasks). Our experiments reveal that by training on simple synthetically generated datasets or on existing real datasets, we can achieve better clustering performance on unseen real-world datasets when compared with standard benchmark clustering techniques. Our meta clustering model works well even for small datasets where the usual deep learning models tend to perform worse.

學成 · MoDELS · 類別 · 生成模型 · Wasserstein生成對抗網絡 ·

2019 年 9 月 10 日

A Meta-Learning Framework for Generalized Zero-Shot Learning

Vinay Kumar Verma,Dhanajit Brahma,Piyush Rai

from arxiv, Under Submission

Learning to classify unseen class samples at test time is popularly referred to as zero-shot learning (ZSL). If test samples can be from training (seen) as well as unseen classes, it is a more challenging problem due to the existence of strong bias towards seen classes. This problem is generally known as \emph{generalized} zero-shot learning (GZSL). Thanks to the recent advances in generative models such as VAEs and GANs, sample synthesis based approaches have gained considerable attention for solving this problem. These approaches are able to handle the problem of class bias by synthesizing unseen class samples. However, these ZSL/GZSL models suffer due to the following key limitations: $(i)$ Their training stage learns a class-conditioned generator using only \emph{seen} class data and the training stage does not \emph{explicitly} learn to generate the unseen class samples; $(ii)$ They do not learn a generic optimal parameter which can easily generalize for both seen and unseen class generation; and $(iii)$ If we only have access to a very few samples per seen class, these models tend to perform poorly. In this paper, we propose a meta-learning based generative model that naturally handles these limitations. The proposed model is based on integrating model-agnostic meta learning with a Wasserstein GAN (WGAN) to handle $(i)$ and $(iii)$, and uses a novel task distribution to handle $(ii)$. Our proposed model yields significant improvements on standard ZSL as well as more challenging GZSL setting. In ZSL setting, our model yields 4.5\%, 6.0\%, 9.8\%, and 27.9\% relative improvements over the current state-of-the-art on CUB, AWA1, AWA2, and aPY datasets, respectively.

學成 · 聯邦學習 · 可辨認的 · 集成學習 · AUC ·

2019 年 3 月 5 日

One-Shot Federated Learning

Neel Guha,Ameet Talwalkar,Virginia Smith

from arxiv, 5 pages, 3 figures, 1 table. 2nd Workshop on Machine Learning on the Phone and other Consumer Devices, NeurIPs 2018

We present one-shot federated learning, where a central server learns a global model over a network of federated devices in a single round of communication. Our approach - drawing on ensemble learning and knowledge aggregation - achieves an average relative gain of 51.5% in AUC over local baselines and comes within 90.1% of the (unattainable) global ideal. We discuss these methods and identify several promising directions of future work.

學成 · 小樣本學習 · Networking · 訓練實例 · ONCE ·

2018 年 12 月 25 日

Learning to Propagate Labels: Transductive Propagation Network for Few-shot Learning

Yanbin Liu,Juho Lee,Minseop Park,Saehoon Kim,Eunho Yang,Sungju Hwang,Yi Yang

from arxiv, Accepted in ICLR 2019; code available at //github.com/csyanbin/TPN

The goal of few-shot learning is to learn a classifier that generalizes well even when trained with a limited number of training instances per class. The recently introduced meta-learning approaches tackle this problem by learning a generic classifier across a large number of multiclass classification tasks and generalizing the model to a new task. Yet, even with such meta-learning, the low-data problem in the novel classification task still remains. In this paper, we propose Transductive Propagation Network (TPN), a novel meta-learning framework for transductive inference that classifies the entire test set at once to alleviate the low-data problem. Specifically, we propose to learn to propagate labels from labeled instances to unlabeled test instances, by learning a graph construction module that exploits the manifold structure in the data. TPN jointly learns both the parameters of feature embedding and the graph construction in an end-to-end manner. We validate TPN on multiple benchmark datasets, on which it largely outperforms existing few-shot learning approaches and achieves the state-of-the-art results.