嘘嘘中国免费观看网站_在线播放一区二区三区_欧美日产亚洲国产_亚洲欧美一区二区蜜桃_欧美大黑BBB在线播放_黄色的网站在线视频_日韩大尺度无码视频专区

Graph Neural Networks have emerged as an effective machine learning tool for multi-disciplinary tasks such as pharmaceutical molecule classification and chemical reaction prediction, because they can model non-euclidean relationships between different entities. Particle crushing, as a significant field of civil engineering, describes the breakage of granular materials caused by the breakage of particle fragment bonds under the modeling of numerical simulations, which motivates us to characterize the mechanical behaviors of particle crushing through the connectivity of particle fragments with Graph Neural Networks (GNNs). However, there lacks an open-source large-scale particle crushing dataset for research due to the expensive costs of laboratory tests or numerical simulations. Therefore, we firstly generate a dataset with 45,000 numerical simulations and 900 particle types to facilitate the research progress of machine learning for particle crushing. Secondly, we devise a hybrid framework based on GNNs to predict particle crushing strength in a particle fragment view with the advances of state of the art GNNs. Finally, we compare our hybrid framework against traditional machine learning methods and the plain MLP to verify its effectiveness. The usefulness of different features is further discussed through the gradient attribution explanation w.r.t the predictions. Our data and code are released at //github.com/doujiang-zheng/GNN-For-Particle-Crushing.

相關內容

Machine Learning

關注 2241

機(ji)器(qi)學習(xi)（Machine Learning）是(shi)一個研究(jiu)(jiu)計算學習(xi)方(fang)法(fa)(fa)(fa)的(de)(de)(de)(de)國際論(lun)(lun)壇。該雜志發(fa)表文(wen)章(zhang)，報告廣泛的(de)(de)(de)(de)學習(xi)方(fang)法(fa)(fa)(fa)應(ying)用(yong)于各(ge)種學習(xi)問(wen)題(ti)的(de)(de)(de)(de)實質性結(jie)果。該雜志的(de)(de)(de)(de)特色論(lun)(lun)文(wen)描述(shu)研究(jiu)(jiu)的(de)(de)(de)(de)問(wen)題(ti)和(he)(he)方(fang)法(fa)(fa)(fa)，應(ying)用(yong)研究(jiu)(jiu)和(he)(he)研究(jiu)(jiu)方(fang)法(fa)(fa)(fa)的(de)(de)(de)(de)問(wen)題(ti)。有(you)(you)關學習(xi)問(wen)題(ti)或方(fang)法(fa)(fa)(fa)的(de)(de)(de)(de)論(lun)(lun)文(wen)通過實證研究(jiu)(jiu)、理(li)論(lun)(lun)分析(xi)或與心理(li)現(xian)象(xiang)的(de)(de)(de)(de)比較提供(gong)了(le)堅實的(de)(de)(de)(de)支(zhi)持(chi)。應(ying)用(yong)論(lun)(lun)文(wen)展示了(le)如何(he)應(ying)用(yong)學習(xi)方(fang)法(fa)(fa)(fa)來解決重要(yao)的(de)(de)(de)(de)應(ying)用(yong)問(wen)題(ti)。研究(jiu)(jiu)方(fang)法(fa)(fa)(fa)論(lun)(lun)文(wen)改(gai)進了(le)機(ji)器(qi)學習(xi)的(de)(de)(de)(de)研究(jiu)(jiu)方(fang)法(fa)(fa)(fa)。所有(you)(you)的(de)(de)(de)(de)論(lun)(lun)文(wen)都以其他研究(jiu)(jiu)人員可以驗證或復制的(de)(de)(de)(de)方(fang)式描述(shu)了(le)支(zhi)持(chi)證據。論(lun)(lun)文(wen)還(huan)詳細說明了(le)學習(xi)的(de)(de)(de)(de)組成部分，并討論(lun)(lun)了(le)關于知(zhi)識表示和(he)(he)性能任務的(de)(de)(de)(de)假設(she)。官網地址：

MoDELS · Learning · 機器學習建模 · contrastive · ML ·

2023 年 9 月 18 日

Towards Better Modeling with Missing Data: A Contrastive Learning-based Visual Analytics Perspective

Laixin Xie,Yang Ouyang,Longfei Chen,Ziming Wu,Quan Li

from arxiv, 18 pages, 11 figures. This paper is accepted by IEEE Transactions on Visualization and Computer Graphics (TVCG)

Missing data can pose a challenge for machine learning (ML) modeling. To address this, current approaches are categorized into feature imputation and label prediction and are primarily focused on handling missing data to enhance ML performance. These approaches rely on the observed data to estimate the missing values and therefore encounter three main shortcomings in imputation, including the need for different imputation methods for various missing data mechanisms, heavy dependence on the assumption of data distribution, and potential introduction of bias. This study proposes a Contrastive Learning (CL) framework to model observed data with missing values, where the ML model learns the similarity between an incomplete sample and its complete counterpart and the dissimilarity between other samples. Our proposed approach demonstrates the advantages of CL without requiring any imputation. To enhance interpretability, we introduce CIVis, a visual analytics system that incorporates interpretable techniques to visualize the learning process and diagnose the model status. Users can leverage their domain knowledge through interactive sampling to identify negative and positive pairs in CL. The output of CIVis is an optimized model that takes specified features and predicts downstream tasks. We provide two usage scenarios in regression and classification tasks and conduct quantitative experiments, expert interviews, and a qualitative user study to demonstrate the effectiveness of our approach. In short, this study offers a valuable contribution to addressing the challenges associated with ML modeling in the presence of missing data by providing a practical solution that achieves high predictive accuracy and model interpretability.

INFORMS · 泛函 · Learning · 連結 · DOT ·

2023 年 9 月 15 日

Exploring Meta Information for Audio-based Zero-shot Bird Classification

Alexander Gebhard,Andreas Triantafyllopoulos,Teresa Bez,Lukas Christ,Alexander Kathan,Bj?rn W. Schuller

from arxiv, This work has been submitted to the IEEE for possible publication

Advances in passive acoustic monitoring and machine learning have led to the procurement of vast datasets for computational bioacoustic research. Nevertheless, data scarcity is still an issue for rare and underrepresented species. This study investigates how meta-information can improve zero-shot audio classification, utilising bird species as an example case study due to the availability of rich and diverse metadata. We investigate three different sources of metadata: textual bird sound descriptions encoded via (S)BERT, functional traits (AVONET), and bird life-history (BLH) characteristics. As audio features, we extract audio spectrogram transformer (AST) embeddings and project them to the dimension of the auxiliary information by adopting a single linear layer. Then, we employ the dot product as compatibility function and a standard zero-shot learning ranking hinge loss to determine the correct class. The best results are achieved by concatenating the AVONET and BLH features attaining a mean F1-score of .233 over five different test sets with 8 to 10 classes.

Facebook AI Research · Performer · Weight · MoDELS · Learning ·

2023 年 9 月 15 日

Adaptive Priority Reweighing for Generalizing Fairness Improvement

Zhihao Hu,Yiran Xu,Mengnan Du,Jindong Gu,Xinmei Tian,Fengxiang He

With the increasing penetration of machine learning applications in critical decision-making areas, calls for algorithmic fairness are more prominent. Although there have been various modalities to improve algorithmic fairness through learning with fairness constraints, their performance does not generalize well in the test set. A performance-promising fair algorithm with better generalizability is needed. This paper proposes a novel adaptive reweighing method to eliminate the impact of the distribution shifts between training and test data on model generalizability. Most previous reweighing methods propose to assign a unified weight for each (sub)group. Rather, our method granularly models the distance from the sample predictions to the decision boundary. Our adaptive reweighing method prioritizes samples closer to the decision boundary and assigns a higher weight to improve the generalizability of fair classifiers. Extensive experiments are performed to validate the generalizability of our adaptive priority reweighing method for accuracy and fairness measures (i.e., equal opportunity, equalized odds, and demographic parity) in tabular benchmarks. We also highlight the performance of our method in improving the fairness of language and vision models. The code is available at //github.com/che2198/APW.

dynamic programming · 控制器 · MS · 優化器 · Extensibility ·

2023 年 9 月 14 日

A Unified Perspective on Multiple Shooting In Differential Dynamic Programming

He Li,Wenhao Yu,Tingnan Zhang,Patrick M. Wensing

Differential Dynamic Programming (DDP) is an efficient computational tool for solving nonlinear optimal control problems. It was originally designed as a single shooting method and thus is sensitive to the initial guess supplied. This work considers the extension of DDP to multiple shooting (MS), improving its robustness to initial guesses. A novel derivation is proposed that accounts for the defect between shooting segments during the DDP backward pass, while still maintaining quadratic convergence locally. The derivation enables unifying multiple previous MS algorithms, and opens the door to many smaller algorithmic improvements. A penalty method is introduced to strategically control the step size, further improving the convergence performance. An adaptive merit function and a more reliable acceptance condition are employed for globalization. The effects of these improvements are benchmarked for trajectory optimization with a quadrotor, an acrobot, and a manipulator. MS-DDP is also demonstrated for use in Model Predictive Control (MPC) for dynamic jumping with a quadruped robot, showing its benefits over a single shooting approach.

contrastive · 簇 · prototype · Networking · Learning ·

2023 年 9 月 14 日

Instance Adaptive Prototypical Contrastive Embedding for Generalized Zero Shot Learning

Riti Paul,Sahil Vora,Baoxin Li

from arxiv, 7 pages, 4 figures. Accepted in IJCAI 2023 Workshop on Generalizing from Limited Resources in the Open World

Generalized zero-shot learning(GZSL) aims to classify samples from seen and unseen labels, assuming unseen labels are not accessible during training. Recent advancements in GZSL have been expedited by incorporating contrastive-learning-based (instance-based) embedding in generative networks and leveraging the semantic relationship between data points. However, existing embedding architectures suffer from two limitations: (1) limited discriminability of synthetic features' embedding without considering fine-grained cluster structures; (2) inflexible optimization due to restricted scaling mechanisms on existing contrastive embedding networks, leading to overlapped representations in the embedding space. To enhance the quality of representations in the embedding space, as mentioned in (1), we propose a margin-based prototypical contrastive learning embedding network that reaps the benefits of prototype-data (cluster quality enhancement) and implicit data-data (fine-grained representations) interaction while providing substantial cluster supervision to the embedding network and the generator. To tackle (2), we propose an instance adaptive contrastive loss that leads to generalized representations for unseen labels with increased inter-class margin. Through comprehensive experimental evaluation, we show that our method can outperform the current state-of-the-art on three benchmark datasets. Our approach also consistently achieves the best unseen performance in the GZSL setting.

TVM · 優化器 · AutoTVM · Apache · Tensor ·

2023 年 9 月 13 日

Autotuning Apache TVM-based Scientific Applications Using Bayesian Optimization

Xingfu Wu,Praveen Paramasivam,Valerie Taylor

Apache TVM (Tensor Virtual Machine), an open source machine learning compiler framework designed to optimize computations across various hardware platforms, provides an opportunity to improve the performance of dense matrix factorizations such as LU (Lower Upper) decomposition and Cholesky decomposition on GPUs and AI (Artificial Intelligence) accelerators. In this paper, we propose a new TVM autotuning framework using Bayesian Optimization and use the TVM tensor expression language to implement linear algebra kernels such as LU, Cholesky, and 3mm. We use these scientific computation kernels to evaluate the effectiveness of our methods on a GPU cluster, called Swing, at Argonne National Laboratory. We compare the proposed autotuning framework with the TVM autotuning framework AutoTVM with four tuners and find that our framework outperforms AutoTVM in most cases.

數據增強 · 知識 (knowledge) · contrastive · Performer · 無監督 ·

2023 年 9 月 10 日

Boosting Unsupervised Contrastive Learning Using Diffusion-Based Data Augmentation From Scratch

Zelin Zang,Hao Luo,Kai Wang,Panpan Zhang,Fan Wang,Stan. Z Li,Yang You

from arxiv, arXiv admin note: text overlap with arXiv:2302.07944 by other authors

Unsupervised contrastive learning methods have recently seen significant improvements, particularly through data augmentation strategies that aim to produce robust and generalizable representations. However, prevailing data augmentation methods, whether hand designed or based on foundation models, tend to rely heavily on prior knowledge or external data. This dependence often compromises their effectiveness and efficiency. Furthermore, the applicability of most existing data augmentation strategies is limited when transitioning to other research domains, especially science-related data. This limitation stems from the paucity of prior knowledge and labeled data available in these domains. To address these challenges, we introduce DiffAug-a novel and efficient Diffusion-based data Augmentation technique. DiffAug aims to ensure that the augmented and original data share a smoothed latent space, which is achieved through diffusion steps. Uniquely, unlike traditional methods, DiffAug first mines sufficient prior semantic knowledge about the neighborhood. This provides a constraint to guide the diffusion steps, eliminating the need for labels, external data/models, or prior knowledge. Designed as an architecture-agnostic framework, DiffAug provides consistent improvements. Specifically, it improves image classification and clustering accuracy by 1.6%~4.5%. When applied to biological data, DiffAug improves performance by up to 10.1%, with an average improvement of 5.8%. DiffAug shows good performance in both vision and biological domains.

contrastive · 對比學習 · 相似度 · MoDELS · 學成 ·

2021 年 9 月 24 日

Sequence Level Contrastive Learning for Text Summarization

Shusheng Xu,Xingxing Zhang,Yi Wu,Furu Wei

from arxiv, 2 figures, 12 tables

Contrastive learning models have achieved great success in unsupervised visual representation learning, which maximize the similarities between feature representations of different views of the same image, while minimize the similarities between feature representations of views of different images. In text summarization, the output summary is a shorter form of the input document and they have similar meanings. In this paper, we propose a contrastive learning model for supervised abstractive text summarization, where we view a document, its gold summary and its model generated summaries as different views of the same mean representation and maximize the similarities between them during training. We improve over a strong sequence-to-sequence text generation model (i.e., BART) on three different summarization datasets. Human evaluation also shows that our model achieves better faithfulness ratings compared to its counterpart without contrastive objectives.

估計/估計量 · contrastive · INFORMS · 互信息 · 表示學習 ·

2021 年 6 月 25 日

Decomposed Mutual Information Estimation for Contrastive Representation Learning

Alessandro Sordoni,Nouha Dziri,Hannes Schulz,Geoff Gordon,Phil Bachman,Remi Tachet

from arxiv, ICML 2021

Recent contrastive representation learning methods rely on estimating mutual information (MI) between multiple views of an underlying context. E.g., we can derive multiple views of a given image by applying data augmentation, or we can split a sequence into views comprising the past and future of some step in the sequence. Contrastive lower bounds on MI are easy to optimize, but have a strong underestimation bias when estimating large amounts of MI. We propose decomposing the full MI estimation problem into a sum of smaller estimation problems by splitting one of the views into progressively more informed subviews and by applying the chain rule on MI between the decomposed views. This expression contains a sum of unconditional and conditional MI terms, each measuring modest chunks of the total MI, which facilitates approximation via contrastive bounds. To maximize the sum, we formulate a contrastive lower bound on the conditional MI which can be approximated efficiently. We refer to our general approach as Decomposed Estimation of Mutual Information (DEMI). We show that DEMI can capture a larger amount of MI than standard non-decomposed contrastive bounds in a synthetic setting, and learns better representations in a vision domain and for dialogue generation.

數據增強 · 圖 · 圖形處理器 · Performer · Neural Networks ·

2020 年 12 月 2 日

Data Augmentation for Graph Neural Networks

Tong Zhao,Yozen Liu,Leonardo Neves,Oliver Woodford,Meng Jiang,Neil Shah

from arxiv, AAAI 2021. This complete version contains the Appendix

Data augmentation has been widely used to improve generalizability of machine learning models. However, comparatively little work studies data augmentation for graphs. This is largely due to the complex, non-Euclidean structure of graphs, which limits possible manipulation operations. Augmentation operations commonly used in vision and language have no analogs for graphs. Our work studies graph data augmentation for graph neural networks (GNNs) in the context of improving semi-supervised node-classification. We discuss practical and theoretical motivations, considerations and strategies for graph data augmentation. Our work shows that neural edge predictors can effectively encode class-homophilic structure to promote intra-class edges and demote inter-class edges in given graph structure, and our main contribution introduces the GAug graph data augmentation framework, which leverages these insights to improve performance in GNN-based node classification via edge prediction. Extensive experiments on multiple benchmarks show that augmentation via GAug improves performance across GNN architectures and datasets.