欧美精品日韩精品国内精品-午夜一级精品无码免费毛片

Topological data analysis (TDA) is an expanding field that leverages principles and tools from algebraic topology to quantify structural features of data sets or transform them into more manageable forms. As its theoretical foundations have been developed, TDA has shown promise in extracting useful information from high-dimensional, noisy, and complex data such as those used in biomedicine. To improve efficiency, these techniques may employ landmark samplers. The heuristic maxmin procedure obtains a roughly even distribution of sample points by implicitly constructing a cover comprising sets of uniform radius. However, issues arise with data that vary in density or include points with multiplicities, as are common in biomedicine. We propose an analogous procedure, "lastfirst" based on ranked distances, which implies a cover comprising sets of uniform cardinality. We first rigorously define the procedure and prove that it obtains landmarks with desired properties. We then perform benchmark tests and compare its performance to that of maxmin, on feature detection and class prediction tasks involving simulated and real-world biomedical data. Lastfirst is more general than maxmin in that it can be applied to any data on which arbitrary (and not necessarily symmetric) pairwise distances can be computed. Lastfirst is more computationally costly, but our implementation scales at the same rate as maxmin. We find that lastfirst achieves comparable performance on prediction tasks and outperforms maxmin on homology detection tasks. Where the numerical values of similarity measures are not meaningful, as in many biomedical contexts, lastfirst sampling may also improve interpretability.

相關內容

Performer

關注 10

Learning · 聯邦學習 · MoDELS · Extensibility · 圖片分類 ·

2023 年 3 月 9 日

Faster Adaptive Federated Learning

Xidong Wu,Feihu Huang,Zhengmian Hu,Heng Huang

Federated learning has attracted increasing attention with the emergence of distributed data. While extensive federated learning algorithms have been proposed for the non-convex distributed problem, federated learning in practice still faces numerous challenges, such as the large training iterations to converge since the sizes of models and datasets keep increasing, and the lack of adaptivity by SGD-based model updates. Meanwhile, the study of adaptive methods in federated learning is scarce and existing works either lack a complete theoretical convergence guarantee or have slow sample complexity. In this paper, we propose an efficient adaptive algorithm (i.e., FAFED) based on the momentum-based variance-reduced technique in cross-silo FL. We first explore how to design the adaptive algorithm in the FL setting. By providing a counter-example, we prove that a simple combination of FL and adaptive methods could lead to divergence. More importantly, we provide a convergence analysis for our method and prove that our algorithm is the first adaptive FL algorithm to reach the best-known samples $O(\epsilon^{-3})$ and $O(\epsilon^{-2})$ communication rounds to find an $\epsilon$-stationary point without large batches. The experimental results on the language modeling task and image classification task with heterogeneous data demonstrate the efficiency of our algorithms.

信念傳播 · 分解的 · Learning · Networking · Neural Networks ·

2023 年 3 月 8 日

DNBP: Differentiable Nonparametric Belief Propagation

Anthony Opipari,Jana Pavlasek,Chao Chen,Shoutian Wang,Karthik Desingh,Odest Chadwicke Jenkins

from arxiv, arXiv admin note: text overlap with arXiv:2101.05948

We present a differentiable approach to learn the probabilistic factors used for inference by a nonparametric belief propagation algorithm. Existing nonparametric belief propagation methods rely on domain-specific features encoded in the probabilistic factors of a graphical model. In this work, we replace each crafted factor with a differentiable neural network enabling the factors to be learned using an efficient optimization routine from labeled data. By combining differentiable neural networks with an efficient belief propagation algorithm, our method learns to maintain a set of marginal posterior samples using end-to-end training. We evaluate our differentiable nonparametric belief propagation (DNBP) method on a set of articulated pose tracking tasks and compare performance with learned baselines. Results from these experiments demonstrate the effectiveness of using learned factors for tracking and suggest the practical advantage over hand-crafted approaches. The project webpage is available at: //progress.eecs.umich.edu/projects/dnbp/ .

Weight · 相同 · 對偶問題 · Performer · 查準率/準確率 ·

2023 年 3 月 8 日

An MP-DWR method for $h$-adaptive finite element methods

Chengyu Liu,Guanghui Hu

In a dual weighted residual method based on the finite element framework, the Galerkin orthogonality is an issue that prevents solving the dual equation in the same space as the one for the primal equation. In the literature, there have been two popular approaches to constructing a new space for the dual problem, i.e., refining mesh grids ($h$-approach) and raising the order of approximate polynomials ($p$-approach). In this paper, a novel approach is proposed for the purpose based on the multiple-precision technique, i.e., the construction of the new finite element space is based on the same configuration as the one for the primal equation, except for the precision in calculations. The feasibility of such a new approach is discussed in detail in the paper. In numerical experiments, the proposed approach can be realized conveniently with C++ \textit{template}. Moreover, the new approach shows remarkable improvements in both efficiency and storage compared with the $h$-approach and the $p$-approach. It is worth mentioning that the performance of our approach is comparable with the one through a higher order interpolation ($i$-approach) in the literature. The combination of these two approaches is believed to further enhance the efficiency of the dual weighted residual method.

決策樹樁 · 優化器 · 特征選擇 · Analysis · 稀疏 ·

2023 年 3 月 8 日

Optimal Sparse Recovery with Decision Stumps

Kiarash Banihashem,MohammadTaghi Hajiaghayi,Max Springer

from arxiv, Accepted to AAAI 2023

Decision trees are widely used for their low computational cost, good predictive performance, and ability to assess the importance of features. Though often used in practice for feature selection, the theoretical guarantees of these methods are not well understood. We here obtain a tight finite sample bound for the feature selection problem in linear regression using single-depth decision trees. We examine the statistical properties of these "decision stumps" for the recovery of the $s$ active features from $p$ total features, where $s \ll p$. Our analysis provides tight sample performance guarantees on high-dimensional sparse systems which align with the finite sample bound of $O(s \log p)$ as obtained by Lasso, improving upon previous bounds for both the median and optimal splitting criteria. Our results extend to the non-linear regime as well as arbitrary sub-Gaussian distributions, demonstrating that tree based methods attain strong feature selection properties under a wide variety of settings and further shedding light on the success of these methods in practice. As a byproduct of our analysis, we show that we can provably guarantee recovery even when the number of active features $s$ is unknown. We further validate our theoretical results and proof methodology using computational experiments.

Processing（編程語言） · Networking · Tensor · Neural Networks · 深度學習框架 ·

2023 年 3 月 7 日

Data-Driven Target Localization Using Adaptive Radar Processing and Convolutional Neural Networks

Shyam Venkatasubramanian,Sandeep Gogineni,Bosung Kang,Ali Pezeshki,Muralidhar Rangaswamy,Vahid Tarokh

from arxiv, 34 pages, 22 figures. Submitted to IEEE Transactions on Aerospace and Electronic Systems

Facilitated by the recent emergence of radio frequency (RF) modeling and simulation tools purposed for adaptive radar processing applications, data-driven approaches to classical problems in radar have rapidly grown in popularity over the past decade. Despite this surge, limited focus has been directed toward the theoretical foundations of these data-driven approaches. In this regard, using adaptive radar processing techniques, we propose a data-driven approach in this work to address the classical problem of radar target localization post adaptive radar detection. To give context to the performance of this data-driven approach, we first analyze the asymptotic breakdown signal-to-clutter-plus-noise ratio (SCNR) threshold of the normalized adaptive matched filter (NAMF) test statistic within the context of radar target localization, and augment this analysis through our proposed deep learning framework for target location estimation. In this procedure, we generate comprehensive datasets by randomly placing targets of variable strengths in predetermined constrained areas using RFView, a site-specific, digital twin, RF modeling and simulation tool. For each radar return from these predefined constrained areas, we generate heatmap tensors in range, azimuth, and elevation of the NAMF test statistic, and of the output power of a generalized sidelobe canceller (GSC). Using our deep learning framework, we estimate target locations from these heatmap tensors to demonstrate the feasibility of and significant improvements provided by our data-driven approach across matched and mismatched settings.

BASIC · 方陣 · 秩 · 全 · 情景 ·

2023 年 3 月 7 日

Adapted AZNN Methods for Time-Varying and Static Matrix Problems

Frank Uhlig

from arxiv, 15 pages, 12 figures with 20 graphs

We present adapted Zhang Neural Networks (AZNN) in which the parameter settings for the exponential decay constant $\eta$ and the length of the start-up phase of basic ZNN are adapted to the problem at hand. Specifically we study experiments with AZNN for time-varying square matrix factorizations as a product of time-varying symmetric matrices and for the time-varying matrix square roots problem. Differing from generally used small $\eta$ values and minimal start-up length phases in ZNN, we adapt the basic ZNN method to work with large or even gigantic $\eta$ settings and arbitrary length start-ups using Euler's low accuracy finite difference formula. These adaptations improve the speed of AZNN's convergence and lower its solution error bounds for our chosen problems significantly to near machine constant or even lower levels. Parameter-varying AZNN also allows us to find full rank symmetrizers of static matrices reliably, for example for the Kahan and Frank matrices and for matrices with highly ill-conditioned eigenvalues and complicated Jordan structures of dimensions from $n = 2$ on up. This helps in cases where full rank static matrix symmetrizers have never been successfully computed before.

GROUP · MoDELS · Group Lasso · 估計/估計量 · Analysis ·

2023 年 3 月 6 日

Bayesian Adaptive Selection of Variables for Function-on-Scalar Regression Models

Pedro Henrique T. O. Sousa,Camila P. E. de Souza,Ronaldo Dias

Considering the field of functional data analysis, we developed a new Bayesian method for variable selection in function-on-scalar regression (FOSR). Our approach uses latent variables, allowing an adaptive selection since it can determine the number of variables and which ones should be selected for a function-on-scalar regression model. Simulation studies show the proposed method's main properties, such as its accuracy in estimating the coefficients and high capacity to select variables correctly. Furthermore, we conducted comparative studies with the main competing methods, such as the BGLSS method as well as the group LASSO, the group MCP and the group SCAD. We also used a COVID-19 dataset and some socioeconomic data from Brazil for real data application. In short, the proposed Bayesian variable selection model is extremely competitive, showing significant predictive and selective quality.

簇 · 目標領域 · 未標記 · Performer · 標注 ·

2021 年 4 月 19 日

Cross-Domain Adaptive Clustering for Semi-Supervised Domain Adaptation

Jichang Li,Guanbin Li,Yemin Shi,Yizhou Yu

from arxiv, To appear in CVPR2021

In semi-supervised domain adaptation, a few labeled samples per class in the target domain guide features of the remaining target samples to aggregate around them. However, the trained model cannot produce a highly discriminative feature representation for the target domain because the training data is dominated by labeled samples from the source domain. This could lead to disconnection between the labeled and unlabeled target samples as well as misalignment between unlabeled target samples and the source domain. In this paper, we propose a novel approach called Cross-domain Adaptive Clustering to address this problem. To achieve both inter-domain and intra-domain adaptation, we first introduce an adversarial adaptive clustering loss to group features of unlabeled target data into clusters and perform cluster-wise feature alignment across the source and target domains. We further apply pseudo labeling to unlabeled samples in the target domain and retain pseudo-labels with high confidence. Pseudo labeling expands the number of ``labeled" samples in each class in the target domain, and thus produces a more robust and powerful cluster core for each class to facilitate adversarial learning. Extensive experiments on benchmark datasets, including DomainNet, Office-Home and Office, demonstrate that our proposed approach achieves the state-of-the-art performance in semi-supervised domain adaptation.

泛化理論 · INFORMS · Performer · 測試樣本 · state-of-the-art ·

2021 年 3 月 29 日

Adaptive Methods for Real-World Domain Generalization

Abhimanyu Dubey,Vignesh Ramanathan,Alex Pentland,Dhruv Mahajan

from arxiv, To appear as an oral presentation in IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2021

Invariant approaches have been remarkably successful in tackling the problem of domain generalization, where the objective is to perform inference on data distributions different from those used in training. In our work, we investigate whether it is possible to leverage domain information from the unseen test samples themselves. We propose a domain-adaptive approach consisting of two steps: a) we first learn a discriminative domain embedding from unsupervised training examples, and b) use this domain embedding as supplementary information to build a domain-adaptive model, that takes both the input as well as its domain into account while making predictions. For unseen domains, our method simply uses few unlabelled test examples to construct the domain embedding. This enables adaptive classification on any unseen domain. Our approach achieves state-of-the-art performance on various domain generalization benchmarks. In addition, we introduce the first real-world, large-scale domain generalization benchmark, Geo-YFCC, containing 1.1M samples over 40 training, 7 validation, and 15 test domains, orders of magnitude larger than prior work. We show that the existing approaches either do not scale to this dataset or underperform compared to the simple baseline of training a model on the union of data from all training domains. In contrast, our approach achieves a significant improvement.

圖 · Neural Networks · state-of-the-art · SimPLe · 向量化 ·

2018 年 10 月 1 日

How Powerful are Graph Neural Networks?

Keyulu Xu,Weihua Hu,Jure Leskovec,Stefanie Jegelka

Graph Neural Networks (GNNs) for representation learning of graphs broadly follow a neighborhood aggregation framework, where the representation vector of a node is computed by recursively aggregating and transforming feature vectors of its neighboring nodes. Many GNN variants have been proposed and have achieved state-of-the-art results on both node and graph classification tasks. However, despite GNNs revolutionizing graph representation learning, there is limited understanding of their representational properties and limitations. Here, we present a theoretical framework for analyzing the expressive power of GNNs in capturing different graph structures. Our results characterize the discriminative power of popular GNN variants, such as Graph Convolutional Networks and GraphSAGE, and show that they cannot learn to distinguish certain simple graph structures. We then develop a simple architecture that is provably the most expressive among the class of GNNs and is as powerful as the Weisfeiler-Lehman graph isomorphism test. We empirically validate our theoretical findings on a number of graph classification benchmarks, and demonstrate that our model achieves state-of-the-art performance.