亚洲男人的天堂2018av,欧美草比,久久久久久免费视频精选,国色天香在线看免费,久久久久亚洲av成人片仓井空

<tr id='MongO'><strong id='wb8Dd'></strong><small id='a9KZA'></small><button id='wDNUi'></button><li id='ylwQP'><noscript id='H2tFw'><big id='rk8oa'></big><dt id='g5BEZ'></dt></noscript></li></tr><ol id='T7beH'><option id='HnU8I'><table id='3PBJz'><blockquote id='A2GAh'><tbody id='FSCgV'></tbody></blockquote></table></option></ol><u id='1bwZt'></u><kbd id='Cci5y'><kbd id='Bksji'></kbd></kbd>

<code id='9Bhil'><strong id='L2997'></strong></code>

<fieldset id='gPJKK'></fieldset>

<span id='ykoOH'></span>

<ins id='QVMig'></ins>

<acronym id='jZKGj'><em id='030Yx'></em><td id='ypN7n'><div id='P4LQs'></div></td></acronym><address id='2Jzlc'><big id='gxBVi'><big id='X5LJ6'></big><legend id='oZ01M'></legend></big></address>

<i id='qpPzv'><div id='ny6Yh'><ins id='74YCr'></ins></div></i>

<i id='85CPP'></i>

·

圖 · Learning · 匯聚 · contrastive · GPS ·

2024 年 1 月 29 日

GPS: Graph Contrastive Learning via Multi-scale Augmented Views from Adversarial Pooling

Wei Ju,Yiyang Gu,Zhengyang Mao,Ziyue Qiao,Yifang Qin,Xiao Luo,Hui Xiong,Ming Zhang

from arxiv, Accepted by SCIENCE CHINA Information Sciences (SCIS 2024)

Self-supervised graph representation learning has recently shown considerable promise in a range of fields, including bioinformatics and social networks. A large number of graph contrastive learning approaches have shown promising performance for representation learning on graphs, which train models by maximizing agreement between original graphs and their augmented views (i.e., positive views). Unfortunately, these methods usually involve pre-defined augmentation strategies based on the knowledge of human experts. Moreover, these strategies may fail to generate challenging positive views to provide sufficient supervision signals. In this paper, we present a novel approach named Graph Pooling ContraSt (GPS) to address these issues. Motivated by the fact that graph pooling can adaptively coarsen the graph with the removal of redundancy, we rethink graph pooling and leverage it to automatically generate multi-scale positive views with varying emphasis on providing challenging positives and preserving semantics, i.e., strongly-augmented view and weakly-augmented view. Then, we incorporate both views into a joint contrastive learning framework with similarity learning and consistency learning, where our pooling module is adversarially trained with respect to the encoder for adversarial robustness. Experiments on twelve datasets on both graph classification and transfer learning tasks verify the superiority of the proposed method over its counterparts.

相關內容

知識 (knowledge) · 大語言模型 · MoDELS · 語言模型化 · Notability ·

2024 年 3 月 8 日

LoRAMoE: Alleviate World Knowledge Forgetting in Large Language Models via MoE-Style Plugin

Shihan Dou,Enyu Zhou,Yan Liu,Songyang Gao,Jun Zhao,Wei Shen,Yuhao Zhou,Zhiheng Xi,Xiao Wang,Xiaoran Fan,Shiliang Pu,Jiang Zhu,Rui Zheng,Tao Gui,Qi Zhang,Xuanjing Huang

from arxiv, 14 pages, 7 figures

Supervised fine-tuning (SFT) is a crucial step for large language models (LLMs), enabling them to align with human instructions and enhance their capabilities in downstream tasks. Increasing instruction data substantially is a direct solution to align the model with a broader range of downstream tasks or notably improve its performance on a specific task. However, we find that large-scale increases in instruction data can damage the world knowledge previously stored in LLMs. To address this challenge, we propose LoRAMoE, a novelty framework that introduces several low-rank adapters (LoRA) and integrates them by using a router network, like a plugin version of Mixture of Experts (MoE). It freezes the backbone model and forces a portion of LoRAs to focus on leveraging world knowledge to solve downstream tasks, to alleviate world knowledge-edge forgetting. Experimental results show that, as the instruction data increases, LoRAMoE can significantly improve the ability to process downstream tasks, while maintaining the world knowledge stored in the LLM.

容差 · 標注 · 噪聲 · Learning · 圖 ·

2024 年 3 月 8 日

ERASE: Error-Resilient Representation Learning on Graphs for Label Noise Tolerance

Ling-Hao Chen,Yuanshuo Zhang,Taohua Huang,Liangcai Su,Zeyi Lin,Xi Xiao,Xiaobo Xia,Tongliang Liu

from arxiv, 24 pages, 14 figures, 15 tables and a project page at //eraseai.github.io/ERASE-page

Deep learning has achieved remarkable success in graph-related tasks, yet this accomplishment heavily relies on large-scale high-quality annotated datasets. However, acquiring such datasets can be cost-prohibitive, leading to the practical use of labels obtained from economically efficient sources such as web searches and user tags. Unfortunately, these labels often come with noise, compromising the generalization performance of deep networks. To tackle this challenge and enhance the robustness of deep learning models against label noise in graph-based tasks, we propose a method called ERASE (Error-Resilient representation learning on graphs for lAbel noiSe tolerancE). The core idea of ERASE is to learn representations with error tolerance by maximizing coding rate reduction. Particularly, we introduce a decoupled label propagation method for learning representations. Before training, noisy labels are pre-corrected through structural denoising. During training, ERASE combines prototype pseudo-labels with propagated denoised labels and updates representations with error resilience, which significantly improves the generalization performance in node classification. The proposed method allows us to more effectively withstand errors caused by mislabeled nodes, thereby strengthening the robustness of deep networks in handling noisy graph data. Extensive experimental results show that our method can outperform multiple baselines with clear margins in broad noise levels and enjoy great scalability. Codes are released at //github.com/eraseai/erase.

MoDELS · 剪枝 · 稀疏 · 模型評估 · 縮放 ·

2024 年 3 月 8 日

Balancing Act: Constraining Disparate Impact in Sparse Models

Meraj Hashemizadeh,Juan Ramirez,Rohan Sukumaran,Golnoosh Farnadi,Simon Lacoste-Julien,Jose Gallego-Posada

from arxiv, Published at ICLR 2024. Code available at //github.com/merajhashemi/balancing-act

Model pruning is a popular approach to enable the deployment of large deep learning models on edge devices with restricted computational or storage capacities. Although sparse models achieve performance comparable to that of their dense counterparts at the level of the entire dataset, they exhibit high accuracy drops for some data sub-groups. Existing methods to mitigate this disparate impact induced by pruning (i) rely on surrogate metrics that address the problem indirectly and have limited interpretability; or (ii) scale poorly with the number of protected sub-groups in terms of computational cost. We propose a constrained optimization approach that directly addresses the disparate impact of pruning: our formulation bounds the accuracy change between the dense and sparse models, for each sub-group. This choice of constraints provides an interpretable success criterion to determine if a pruned model achieves acceptable disparity levels. Experimental results demonstrate that our technique scales reliably to problems involving large models and hundreds of protected sub-groups.

MoDELS · Learning · 深度學習 · Performer · INTERACT ·

2024 年 3 月 7 日

SGNet: Folding Symmetrical Protein Complex with Deep Learning

Zhaoqun Li,Jingcheng Yu,Qiwei Ye

Deep learning has made significant progress in protein structure prediction, advancing the development of computational biology. However, despite the high accuracy achieved in predicting single-chain structures, a significant number of large homo-oligomeric assemblies exhibit internal symmetry, posing a major challenge in structure determination. The performances of existing deep learning methods are limited since the symmetrical protein assembly usually has a long sequence, making structural computation infeasible. In addition, multiple identical subunits in symmetrical protein complex cause the issue of supervision ambiguity in label assignment, requiring a consistent structure modeling for the training. To tackle these problems, we propose a protein folding framework called SGNet to model protein-protein interactions in symmetrical assemblies. SGNet conducts feature extraction on a single subunit and generates the whole assembly using our proposed symmetry module, which largely mitigates computational problems caused by sequence length. Thanks to the elaborate design of modeling symmetry consistently, we can model all global symmetry types in quaternary protein structure prediction. Extensive experimental results on a benchmark of symmetrical protein complexes further demonstrate the effectiveness of our method.

Performer · MoDELS · Networking · 可約的 · Extensibility ·

2024 年 3 月 7 日

LORS: Low-rank Residual Structure for Parameter-Efficient Network Stacking

Jialin Li,Qiang Nie,Weifu Fu,Yuhuan Lin,Guangpin Tao,Yong Liu,Chengjie Wang

from arxiv, 9 pages, 5 figures, 11 tables, CVPR2024 accepted

Deep learning models, particularly those based on transformers, often employ numerous stacked structures, which possess identical architectures and perform similar functions. While effective, this stacking paradigm leads to a substantial increase in the number of parameters, posing challenges for practical applications. In today's landscape of increasingly large models, stacking depth can even reach dozens, further exacerbating this issue. To mitigate this problem, we introduce LORS (LOw-rank Residual Structure). LORS allows stacked modules to share the majority of parameters, requiring a much smaller number of unique ones per module to match or even surpass the performance of using entirely distinct ones, thereby significantly reducing parameter usage. We validate our method by applying it to the stacked decoders of a query-based object detector, and conduct extensive experiments on the widely used MS COCO dataset. Experimental results demonstrate the effectiveness of our method, as even with a 70\% reduction in the parameters of the decoder, our method still enables the model to achieve comparable or

MoDELS · Performer · IP · 剪枝 · 掩碼 ·

2024 年 3 月 7 日

MAP: MAsk-Pruning for Source-Free Model Intellectual Property Protection

Boyang Peng,Sanqing Qu,Yong Wu,Tianpei Zou,Lianghua He,Alois Knoll,Guang Chen,changjun jiang

from arxiv, Accepted to CVPR 2024

Deep learning has achieved remarkable progress in various applications, heightening the importance of safeguarding the intellectual property (IP) of well-trained models. It entails not only authorizing usage but also ensuring the deployment of models in authorized data domains, i.e., making models exclusive to certain target domains. Previous methods necessitate concurrent access to source training data and target unauthorized data when performing IP protection, making them risky and inefficient for decentralized private data. In this paper, we target a practical setting where only a well-trained source model is available and investigate how we can realize IP protection. To achieve this, we propose a novel MAsk Pruning (MAP) framework. MAP stems from an intuitive hypothesis, i.e., there are target-related parameters in a well-trained model, locating and pruning them is the key to IP protection. Technically, MAP freezes the source model and learns a target-specific binary mask to prevent unauthorized data usage while minimizing performance degradation on authorized data. Moreover, we introduce a new metric aimed at achieving a better balance between source and target performance degradation. To verify the effectiveness and versatility, we have evaluated MAP in a variety of scenarios, including vanilla source-available, practical source-free, and challenging data-free. Extensive experiments indicate that MAP yields new state-of-the-art performance.

知識 (knowledge) · Machine Learning · MoDELS · 學成 · Conformer ·

2022 年 5 月 10 日

Knowledge Augmented Machine Learning with Applications in Autonomous Driving: A Survey

Julian W?rmann,Daniel Bogdoll,Etienne Bührle,Han Chen,Evaristus Fuh Chuo,Kostadin Cvejoski,Ludger van Elst,Tobias Glei?ner,Philip Gottschall,Stefan Griesche,Christian Hellert,Christian Hesels,Sebastian Houben,Tim Joseph,Niklas Keil,Johann Kelsch,Hendrik K?nigshof,Erwin Kraft,Leonie Kreuser,Kevin Krone,Tobias Latka,Denny Mattern,Stefan Matthes,Mohsin Munir,Moritz Nekolla,Adrian Paschke,Maximilian Alexander Pintz,Tianming Qiu,Faraz Qureishi,Syed Tahseen Raza Rizvi,J?rg Reichardt,Laura von Rueden,Stefan Rudolph,Alexander Sagel,Gerhard Schunk,Hao Shen,Hendrik Stapelbroek,Vera Stehr,Gurucharan Srinivas,Anh Tuan Tran,Abhishek Vivekanandan,Ya Wang,Florian Wasserrab,Tino Werner,Christian Wirth,Stefan Zwicklbauer

from arxiv, 93 pages

The existence of representative datasets is a prerequisite of many successful artificial intelligence and machine learning models. However, the subsequent application of these models often involves scenarios that are inadequately represented in the data used for training. The reasons for this are manifold and range from time and cost constraints to ethical considerations. As a consequence, the reliable use of these models, especially in safety-critical applications, is a huge challenge. Leveraging additional, already existing sources of knowledge is key to overcome the limitations of purely data-driven approaches, and eventually to increase the generalization capability of these models. Furthermore, predictions that conform with knowledge are crucial for making trustworthy and safe decisions even in underrepresented scenarios. This work provides an overview of existing techniques and methods in the literature that combine data-based models with existing knowledge. The identified approaches are structured according to the categories integration, extraction and conformity. Special attention is given to applications in the field of autonomous driving.

MoDELS · 可約的 · INTERACT · 估計/估計量 · 學成 ·

2022 年 3 月 3 日

NeuroFluid: Fluid Dynamics Grounding with Particle-Driven Neural Radiance Fields

Shanyan Guan,Huayu Deng,Yunbo Wang,Xiaokang Yang

Deep learning has shown great potential for modeling the physical dynamics of complex particle systems such as fluids (in Lagrangian descriptions). Existing approaches, however, require the supervision of consecutive particle properties, including positions and velocities. In this paper, we consider a partially observable scenario known as fluid dynamics grounding, that is, inferring the state transitions and interactions within the fluid particle systems from sequential visual observations of the fluid surface. We propose a differentiable two-stage network named NeuroFluid. Our approach consists of (i) a particle-driven neural renderer, which involves fluid physical properties into the volume rendering function, and (ii) a particle transition model optimized to reduce the differences between the rendered and the observed images. NeuroFluid provides the first solution to unsupervised learning of particle-based fluid dynamics by training these two models jointly. It is shown to reasonably estimate the underlying physics of fluids with different initial shapes, viscosity, and densities. It is a potential alternative approach to understanding complex fluid mechanics, such as turbulence, that are difficult to model using traditional methods of mathematical physics.

相關系數 · AUC · 變換 · 示例 · 學成 ·

2021 年 6 月 2 日

TransMIL: Transformer based Correlated Multiple Instance Learning for Whole Slide Image Classication

Zhuchen Shao,Hao Bian,Yang Chen,Yifeng Wang,Jian Zhang,Xiangyang Ji,Yongbing Zhang

Multiple instance learning (MIL) is a powerful tool to solve the weakly supervised classification in whole slide image (WSI) based pathology diagnosis. However, the current MIL methods are usually based on independent and identical distribution hypothesis, thus neglect the correlation among different instances. To address this problem, we proposed a new framework, called correlated MIL, and provided a proof for convergence. Based on this framework, we devised a Transformer based MIL (TransMIL), which explored both morphological and spatial information. The proposed TransMIL can effectively deal with unbalanced/balanced and binary/multiple classification with great visualization and interpretability. We conducted various experiments for three different computational pathology problems and achieved better performance and faster convergence compared with state-of-the-art methods. The test AUC for the binary tumor classification can be up to 93.09% over CAMELYON16 dataset. And the AUC over the cancer subtypes classification can be up to 96.03% and 98.82% over TCGA-NSCLC dataset and TCGA-RCC dataset, respectively.

圖卷積神經網絡/圖卷積網絡 · INFORMS · 圖卷積 · Integration · 描述符 ·

2021 年 5 月 10 日

Z-GCNETs: Time Zigzags at Graph Convolutional Networks for Time Series Forecasting

Yuzhou Chen,Ignacio Segovia-Dominguez,Yulia R. Gel

from arxiv, Accepted at the International Conference on Machine Learning (ICML) 2021

There recently has been a surge of interest in developing a new class of deep learning (DL) architectures that integrate an explicit time dimension as a fundamental building block of learning and representation mechanisms. In turn, many recent results show that topological descriptors of the observed data, encoding information on the shape of the dataset in a topological space at different scales, that is, persistent homology of the data, may contain important complementary information, improving both performance and robustness of DL. As convergence of these two emerging ideas, we propose to enhance DL architectures with the most salient time-conditioned topological information of the data and introduce the concept of zigzag persistence into time-aware graph convolutional networks (GCNs). Zigzag persistence provides a systematic and mathematically rigorous framework to track the most important topological features of the observed data that tend to manifest themselves over time. To integrate the extracted time-conditioned topological descriptors into DL, we develop a new topological summary, zigzag persistence image, and derive its theoretical stability guarantees. We validate the new GCNs with a time-aware zigzag topological layer (Z-GCNETs), in application to traffic forecasting and Ethereum blockchain price prediction. Our results indicate that Z-GCNET outperforms 13 state-of-the-art methods on 4 time series datasets.

閱讀: 0 點贊: 0

小貼士

登錄享

相關主題

北京阿比特科技有限公司

注冊地址：北京市海淀區羊坊店路18號2幢3層301-191

<tr id='a77QC'><strong id='I7Fe0'></strong><small id='MAK5n'></small><button id='dlCbg'></button><li id='3kync'><noscript id='AkAEC'><big id='RC5EC'></big><dt id='JPKBZ'></dt></noscript></li></tr><ol id='1x7XS'><option id='rYUMc'><table id='43Yuf'><blockquote id='1HTNb'><tbody id='yFbLr'></tbody></blockquote></table></option></ol><u id='jGfCO'></u><kbd id='xLalp'><kbd id='FnKFU'></kbd></kbd>

<code id='8Th5v'><strong id='QICs0'></strong></code>

<fieldset id='gaC0c'></fieldset>

<span id='dFdqn'></span>

<ins id='rSgSN'></ins>

<acronym id='kOv31'><em id='hrbHB'></em><td id='V3v0u'><div id='huJRO'></div></td></acronym><address id='PDiQT'><big id='nP7QB'><big id='kKywc'></big><legend id='F0qCA'></legend></big></address>

<i id='NM77r'><div id='Gh8G1'><ins id='Wz1BU'></ins></div></i>

<i id='kmGu6'></i>