好诱人的搜子好爽免费观看_GOGOGO高清在线播放_国精产品W灬源码1688伊_日韩一区二区在线观看免费视频_国产成人电影在线观看_国内精品久久久中文字幕第一区_1024国产成人精品视频WWW

Federated Averaging (FedAvg) and its variants are the most popular optimization algorithms in federated learning (FL). Previous convergence analyses of FedAvg either assume full client participation or partial client participation where the clients can be uniformly sampled. However, in practical cross-device FL systems, only a subset of clients that satisfy local criteria such as battery status, network connectivity, and maximum participation frequency requirements (to ensure privacy) are available for training at a given time. As a result, client availability follows a natural cyclic pattern. We provide (to our knowledge) the first theoretical framework to analyze the convergence of FedAvg with cyclic client participation with several different client optimizers such as GD, SGD, and shuffled SGD. Our analysis discovers that cyclic client participation can achieve a faster asymptotic convergence rate than vanilla FedAvg with uniform client participation under suitable conditions, providing valuable insights into the design of client sampling protocols.

相關內容

SGD

關注 0

RE · 分離的 · 輪廓 · 周期性 · 震蕩 ·

2023 年 3 月 30 日

Comprehensive study of forced convection over a heated elliptical cylinder with varying angle of incidences to uniform free stream

Raghav Singhal,Sailen Dutta,Jiten C. Kalita

In this paper we carry out a numerical investigation of forced convection heat transfer from a heated elliptical cylinder in a uniform free stream with angle of inclination $\theta^{\circ}$. Numerical simulations were carried out for $10 \leq Re \leq 120$, $0^{\circ} \leq \theta \leq 180^{\circ}$, and $Pr = 0.71$. Results are reported for both steady and unsteady state regime in terms of streamlines, vorticity contours, isotherms, drag and lift coefficients, Strouhal number, and Nusselt number. In the process, we also propose a novel method of computing the Nusselt number by merely gathering flow information along the normal to the ellipse boundary. The critical $Re$ at which which flow becomes unsteady, $Re_c$ is reported for all the values of $\theta$ considered and found to be the same for $\theta$ and $180^\circ -\theta$ for $0^\circ \leq \theta \leq 90^\circ$. In the steady regime, the $Re$ at which flow separation occurs progressively decreases as $\theta$ increases. The surface averaged Nusselt number ($Nu_{\text{av}}$) increases with $Re$, whereas the drag force experienced by the cylinder decreases with $Re$. The transient regime is characterized by periodic vortex shedding, which is quantified by the Strouhal number ($St$). Vortex shedding frequency increases with $Re$ and decreases with $\theta$ for a given $Re$. $Nu_{\text{av}}$ also exhibits a time-varying oscillatory behaviour with a time period which is half the time period of vortex shedding. The amplitude of oscillation of $Nu_{\text{av}}$ increases with $\theta$.

聯邦學習 · 非獨立同分布 · 獨立同分布 · 點過程 · 收斂速度 ·

2023 年 3 月 30 日

DPP-based Client Selection for Federated Learning with Non-IID Data

Yuxuan Zhang,Chao Xu,Howard H. Yang,Xijun Wang,Tony Q. S. Quek

This paper proposes a client selection (CS) method to tackle the communication bottleneck of federated learning (FL) while concurrently coping with FL's data heterogeneity issue. Specifically, we first analyze the effect of CS in FL and show that FL training can be accelerated by adequately choosing participants to diversify the training dataset in each round of training. Based on this, we leverage data profiling and determinantal point process (DPP) sampling techniques to develop an algorithm termed Federated Learning with DPP-based Participant Selection (FL-DP$^3$S). This algorithm effectively diversifies the participants' datasets in each round of training while preserving their data privacy. We conduct extensive experiments to examine the efficacy of our proposed method. The results show that our scheme attains a faster convergence rate, as well as a smaller communication overhead than several baselines.

譜半徑 · 超圖 · 結構 · 對稱矩陣 · 對角矩陣 ·

2023 年 3 月 29 日

On the $α$-spectral radius of hypergraphs

Haiyan Guo,Bo Zhou,Bizhu Lin

For real $\alpha\in [0,1)$ and a hypergraph $G$, the $\alpha$-spectral radius of $G$ is the largest eigenvalue of the matrix $A_{\alpha}(G)=\alpha D(G)+(1-\alpha)A(G)$, where $A(G)$ is the adjacency matrix of $G$, which is a symmetric matrix with zero diagonal such that for distinct vertices $u,v$ of $G$, the $(u,v)$-entry of $A(G)$ is exactly the number of edges containing both $u$ and $v$, and $D(G)$ is the diagonal matrix of row sums of $A(G)$. We study the $\alpha$-spectral radius of a hypergraph that is uniform or not necessarily uniform. We propose some local grafting operations that increase or decrease the $\alpha$-spectral radius of a hypergraph. We determine the unique hypergraphs with maximum $\alpha$-spectral radius among $k$-uniform hypertrees, among $k$-uniform unicyclic hypergraphs, and among $k$-uniform hypergraphs with fixed number of pendant edges. We also determine the unique hypertrees with maximum $\alpha$-spectral radius among hypertrees with given number of vertices and edges, the unique hypertrees with the first three largest (two smallest, respectively) $\alpha$-spectral radii among hypertrees with given number of vertices, the unique hypertrees with minimum $\alpha$-spectral radius among the hypertrees that are not $2$-uniform, the unique hypergraphs with the first two largest (smallest, respectively) $\alpha$-spectral radii among unicyclic hypergraphs with given number of vertices, and the unique hypergraphs with maximum $\alpha$-spectral radius among hypergraphs with fixed number of pendant edges.

聯邦學習 · 分類數據 · 數據集 · 數據分布 · 訓練數據 ·

2023 年 3 月 28 日

On the Local Cache Update Rules in Streaming Federated Learning

Heqiang Wang,Jieming Bian,Jie Xu

In this study, we address the emerging field of Streaming Federated Learning (SFL) and propose local cache update rules to manage dynamic data distributions and limited cache capacity. Traditional federated learning relies on fixed data sets, whereas in SFL, data is streamed, and its distribution changes over time, leading to discrepancies between the local training dataset and long-term distribution. To mitigate this problem, we propose three local cache update rules - First-In-First-Out (FIFO), Static Ratio Selective Replacement (SRSR), and Dynamic Ratio Selective Replacement (DRSR) - that update the local cache of each client while considering the limited cache capacity. Furthermore, we derive a convergence bound for our proposed SFL algorithm as a function of the distribution discrepancy between the long-term data distribution and the client's local training dataset. We then evaluate our proposed algorithm on two datasets: a network traffic classification dataset and an image classification dataset. Our experimental results demonstrate that our proposed local cache update rules significantly reduce the distribution discrepancy and outperform the baseline methods. Our study advances the field of SFL and provides practical cache management solutions in federated learning.

COVID-19 · 醫院 · 聯邦學習 · 算法 · 學習模型 ·

2023 年 3 月 28 日

A Comparative Study of Federated Learning Models for COVID-19 Detection

Erfan Darzidehkalani,Nanna M. Sijtsema,P. M. A van Ooijen

Deep learning is effective in diagnosing COVID-19 and requires a large amount of data to be effectively trained. Due to data and privacy regulations, hospitals generally have no access to data from other hospitals. Federated learning (FL) has been used to solve this problem, where it utilizes a distributed setting to train models in hospitals in a privacy-preserving manner. Deploying FL is not always feasible as it requires high computation and network communication resources. This paper evaluates five FL algorithms' performance and resource efficiency for Covid-19 detection. A decentralized setting with CNN networks is set up, and the performance of FL algorithms is compared with a centralized environment. We examined the algorithms with varying numbers of participants, federated rounds, and selection algorithms. Our results show that cyclic weight transfer can have better overall performance, and results are better with fewer participating hospitals. Our results demonstrate good performance for detecting COVID-19 patients and might be useful in deploying FL algorithms for covid-19 detection and medical image analysis in general.

梯度 · 聯邦學習 · 均值 · 非獨立同分布 · 收斂速率 ·

2023 年 3 月 28 日

Fast Convergence Federated Learning with Aggregated Gradients

Wenhao Yuan,Xuehe Wang

from arxiv, 7 pages, 2 figures

Federated Learning (FL) is a novel machine learning framework, which enables multiple distributed devices cooperatively training a shared model scheduled by a central server while protecting private data locally. However, the non-independent-and-identically-distributed (Non-IID) data samples and frequent communication among participants will slow down the convergent rate and increase communication costs. To achieve fast convergence, we ameliorate the local gradient descend approach in conventional local update rule by introducing the aggregated gradients at each local update epoch, and propose an adaptive learning rate algorithm that further takes the deviation of local parameter and global parameter into consideration at each iteration. The above strategy requires all clients' local parameters and gradients at each local iteration, which is challenging as there is no communication during local update epochs. Accordingly, we utilize mean field approach by introducing two mean field terms to estimate the average local parameters and gradients respectively, which does not require clients to exchange their private information with each other at each local update epoch. Numerical results show that our proposed framework is superior to the state-of-art schemes in model accuracy and convergent rate on both IID and Non-IID dataset.

癌癥 · DBSCAN · 計算效率 · 鄰域 · 磁流變材料 ·

2023 年 3 月 24 日

Computationally Efficient Labeling of Cancer Related Forum Posts by Non-Clinical Text Information Retrieval

Jimmi Agerskov,Kristian Nielsen,Christian Marius Lillelund,Christian Fischer Pedersen

An abundance of information about cancer exists online, but categorizing and extracting useful information from it is difficult. Almost all research within healthcare data processing is concerned with formal clinical data, but there is valuable information in non-clinical data too. The present study combines methods within distributed computing, text retrieval, clustering, and classification into a coherent and computationally efficient system, that can clarify cancer patient trajectories based on non-clinical and freely available information. We produce a fully-functional prototype that can retrieve, cluster and present information about cancer trajectories from non-clinical forum posts. We evaluate three clustering algorithms (MR-DBSCAN, DBSCAN, and HDBSCAN) and compare them in terms of Adjusted Rand Index and total run time as a function of the number of posts retrieved and the neighborhood radius. Clustering results show that neighborhood radius has the most significant impact on clustering performance. For small values, the data set is split accordingly, but high values produce a large number of possible partitions and searching for the best partition is hereby time-consuming. With a proper estimated radius, MR-DBSCAN can cluster 50000 forum posts in 46.1 seconds, compared to DBSCAN (143.4) and HDBSCAN (282.3). We conduct an interview with the Danish Cancer Society and present our software prototype. The organization sees a potential in software that can democratize online information about cancer and foresee that such systems will be required in the future.

Learning · 聯邦學習 · MoDELS · 情景 · Taxonomy ·

2022 年 10 月 10 日

A Survey on Heterogeneous Federated Learning

Dashan Gao,Xin Yao,Qiang Yang

from arxiv, 46 pages, 10 figures, 10 tables

Federated learning (FL) has been proposed to protect data privacy and virtually assemble the isolated data silos by cooperatively training models among organizations without breaching privacy and security. However, FL faces heterogeneity from various aspects, including data space, statistical, and system heterogeneity. For example, collaborative organizations without conflict of interest often come from different areas and have heterogeneous data from different feature spaces. Participants may also want to train heterogeneous personalized local models due to non-IID and imbalanced data distribution and various resource-constrained devices. Therefore, heterogeneous FL is proposed to address the problem of heterogeneity in FL. In this survey, we comprehensively investigate the domain of heterogeneous FL in terms of data space, statistical, system, and model heterogeneity. We first give an overview of FL, including its definition and categorization. Then, We propose a precise taxonomy of heterogeneous FL settings for each type of heterogeneity according to the problem setting and learning objective. We also investigate the transfer learning methodologies to tackle the heterogeneity in FL. We further present the applications of heterogeneous FL. Finally, we highlight the challenges and opportunities and envision promising future research directions toward new framework design and trustworthy approaches.

優化器 · MoDELS · 分布式機器學習 · Performer · CIFAR-10 ·

2020 年 2 月 18 日

Distributed Non-Convex Optimization with Sublinear Speedup under Intermittent Client Availability

Yikai Yan,Chaoyue Niu,Yucheng Ding,Zhenzhe Zheng,Fan Wu,Guihai Chen,Shaojie Tang,Zhihua Wu

from arxiv, ICML 2020 Submission

Federated learning is a new distributed machine learning framework, where a bunch of heterogeneous clients collaboratively train a model without sharing training data. In this work, we consider a practical and ubiquitous issue in federated learning: intermittent client availability, where the set of eligible clients may change during the training process. Such an intermittent client availability model would significantly deteriorate the performance of the classical Federated Averaging algorithm (FedAvg for short). We propose a simple distributed non-convex optimization algorithm, called Federated Latest Averaging (FedLaAvg for short), which leverages the latest gradients of all clients, even when the clients are not available, to jointly update the global model in each iteration. Our theoretical analysis shows that FedLaAvg attains the convergence rate of $O(1/(N^{1/4} T^{1/2}))$, achieving a sublinear speedup with respect to the total number of clients. We implement and evaluate FedLaAvg with the CIFAR-10 dataset. The evaluation results demonstrate that FedLaAvg indeed reaches a sublinear speedup and achieves 4.23% higher test accuracy than FedAvg.

樣本 · 類別 · 損失 · Performer · SimPLe ·

2019 年 1 月 16 日

Class-Balanced Loss Based on Effective Number of Samples

Yin Cui,Menglin Jia,Tsung-Yi Lin,Yang Song,Serge Belongie

from arxiv, Code is available at: //github.com/richardaecn/class-balanced-loss

With the rapid increase of large-scale, real-world datasets, it becomes critical to address the problem of long-tailed data distribution (i.e., a few classes account for most of the data, while most classes are under-represented). Existing solutions typically adopt class re-balancing strategies such as re-sampling and re-weighting based on the number of observations for each class. In this work, we argue that as the number of samples increases, the additional benefit of a newly added data point will diminish. We introduce a novel theoretical framework to measure data overlap by associating with each sample a small neighboring region rather than a single point. The effective number of samples is defined as the volume of samples and can be calculated by a simple formula $(1-\beta^{n})/(1-\beta)$, where $n$ is the number of samples and $\beta \in [0,1)$ is a hyperparameter. We design a re-weighting scheme that uses the effective number of samples for each class to re-balance the loss, thereby yielding a class-balanced loss. Comprehensive experiments are conducted on artificially induced long-tailed CIFAR datasets and large-scale datasets including ImageNet and iNaturalist. Our results show that when trained with the proposed class-balanced loss, the network is able to achieve significant performance gains on long-tailed datasets.