亚洲男人的天堂2018av,欧美草比,久久久久久免费视频精选,国色天香在线看免费,久久久久亚洲av成人片仓井空

<li id='Y43bC'></li>

_{^{<dd id='usL1l'><tbody id='kXFqW'><td id='2pHEO'><optgroup id='AAY4O'><strong id='JE3Lp'></strong></optgroup><address id='l6KIx'><ul id='qi3zZ'></ul></address><big id='BUvp6'></big></td><table id='uxHHQ'></table></tbody><pre id='9RAKd'></pre></dd><span id='ePwmc'><b id='ZAaK0'></b></span>}}


<dfn id='QB0QD'><optgroup id='jXLs4'></optgroup></dfn><tfoot id='UpVpd'><bdo id='O4nsn'><div id='e2nRG'></div><i id='Q5Dhb'><dt id='FjRl2'></dt></i></bdo></tfoot>

_{<fieldset id='Sw1tm'></fieldset>}

·

Networking · Neural Networks · Analysis · Learning · AIM ·

2023 年 9 月 6 日

Deep Learning for Polycystic Kidney Disease: Utilizing Neural Networks for Accurate and Early Detection through Gene Expression Analysis

Kapil Panda,Anirudh Mazumder

from arxiv, 6 pages, 5 figures

With Polycystic Kidney Disease (PKD) potentially leading to fatal complications in patients due to the formation of cysts in the kidneys, early detection of PKD is crucial for effective management of the condition. However, the various patient-specific factors that play a role in the diagnosis make it an intricate puzzle for clinicians to solve. Therefore, in this study, we aim to utilize a deep learning-based approach for early disease detection. The devised neural network can achieve accurate and robust predictions for possible PKD in patients by analyzing patient gene expressions.

相關內容

Networking

Networking：IFIP International Conferences on Networking。 Explanation：國際網絡會議。 Publisher：IFIP。 SIT：

INFORMS · 信息抽取 · MoDELS · 可約的 · 訓練數據 ·

2023 年 10 月 23 日

Efficient Data Learning for Open Information Extraction with Pre-trained Language Models

Zhiyuan Fan,Shizhu He

Open Information Extraction (OpenIE) is a fundamental yet challenging task in Natural Language Processing, which involves extracting all triples (subject, predicate, object) from a given sentence. While labeling-based methods have their merits, generation-based techniques offer unique advantages, such as the ability to generate tokens not present in the original sentence. However, these generation-based methods often require a significant amount of training data to learn the task form of OpenIE and substantial training time to overcome slow model convergence due to the order penalty. In this paper, we introduce a novel framework, OK-IE, that ingeniously transforms the task form of OpenIE into the pre-training task form of the T5 model, thereby reducing the need for extensive training data. Furthermore, we introduce an innovative concept of Anchor to control the sequence of model outputs, effectively eliminating the impact of order penalty on model convergence and significantly reducing training time. Experimental results indicate that, compared to previous SOTA methods, OK-IE requires only 1/100 of the training data (900 instances) and 1/120 of the training time (3 minutes) to achieve comparable results.

MoDELS · Networking · 門控 · 門控RNN · 圖片分類 ·

2023 年 10 月 23 日

Delayed Memory Unit: Modelling Temporal Dependency Through Delay Gate

Pengfei Sun,Jibin Wu,Malu Zhang,Paul Devos,Dick Botteldooren

Recurrent Neural Networks (RNNs) are renowned for their adeptness in modeling temporal dependencies, a trait that has driven their widespread adoption for sequential data processing. Nevertheless, vanilla RNNs are confronted with the well-known issue of gradient vanishing and exploding, posing a significant challenge for learning and establishing long-range dependencies. Additionally, gated RNNs tend to be over-parameterized, resulting in poor network generalization. To address these challenges, we propose a novel Delayed Memory Unit (DMU) in this paper, wherein a delay line structure, coupled with delay gates, is introduced to facilitate temporal interaction and temporal credit assignment, so as to enhance the temporal modeling capabilities of vanilla RNNs. Particularly, the DMU is designed to directly distribute the input information to the optimal time instant in the future, rather than aggregating and redistributing it over time through intricate network dynamics. Our proposed DMU demonstrates superior temporal modeling capabilities across a broad range of sequential modeling tasks, utilizing considerably fewer parameters than other state-of-the-art gated RNN models in applications such as speech recognition, radar gesture recognition, ECG waveform segmentation, and permuted sequential image classification.

語言模型化 · MoDELS · 優化器 · 分解 · 可約的 ·

2023 年 10 月 23 日

DialCoT Meets PPO: Decomposing and Exploring Reasoning Paths in Smaller Language Models

Chengcheng Han,Xiaowei Du,Che Zhang,Yixin Lian,Xiang Li,Ming Gao,Baoyuan Wang

from arxiv, Accepted to EMNLP 2023

Chain-of-Thought (CoT) prompting has proven to be effective in enhancing the reasoning capabilities of Large Language Models (LLMs) with at least 100 billion parameters. However, it is ineffective or even detrimental when applied to reasoning tasks in Smaller Language Models (SLMs) with less than 10 billion parameters. To address this limitation, we introduce Dialogue-guided Chain-of-Thought (DialCoT) which employs a dialogue format to generate intermediate reasoning steps, guiding the model toward the final answer. Additionally, we optimize the model's reasoning path selection using the Proximal Policy Optimization (PPO) algorithm, further enhancing its reasoning capabilities. Our method offers several advantages compared to previous approaches. Firstly, we transform the process of solving complex reasoning questions by breaking them down into a series of simpler sub-questions, significantly reducing the task difficulty and making it more suitable for SLMs. Secondly, we optimize the model's reasoning path selection through the PPO algorithm. We conduct comprehensive experiments on four arithmetic reasoning datasets, demonstrating that our method achieves significant performance improvements compared to state-of-the-art competitors.

線性的 · 重要性采樣 · 樣本 · 隨機場 · 平穩的 ·

2023 年 10 月 22 日

Nonasymptotic Convergence Rate of Quasi-Monte Carlo: Applications to Linear Elliptic PDEs with Lognormal Coefficients and Importance Samplings

Yang Liu,Raúl Tempone

This study analyzes the nonasymptotic convergence behavior of the quasi-Monte Carlo (QMC) method with applications to linear elliptic partial differential equations (PDEs) with lognormal coefficients. Building upon the error analysis presented in (Owen, 2006), we derive a nonasymptotic convergence estimate depending on the specific integrands, the input dimensionality, and the finite number of samples used in the QMC quadrature. We discuss the effects of the variance and dimensionality of the input random variable. Then, we apply the QMC method with importance sampling (IS) to approximate deterministic, real-valued, bounded linear functionals that depend on the solution of a linear elliptic PDE with a lognormal diffusivity coefficient in bounded domains of $\mathbb{R}^d$, where the random coefficient is modeled as a stationary Gaussian random field parameterized by the trigonometric and wavelet-type basis. We propose two types of IS distributions, analyze their effects on the QMC convergence rate, and observe the improvements.

MoDELS · 情景 · Performer · 成對型 · 數據集 ·

2023 年 10 月 20 日

Jina Embeddings: A Novel Set of High-Performance Sentence Embedding Models

Michael Günther,Louis Milliken,Jonathan Geuter,Georgios Mastrapas,Bo Wang,Han Xiao

from arxiv, 9 pages, 2 page appendix

Jina Embeddings constitutes a set of high-performance sentence embedding models adept at translating textual inputs into numerical representations, capturing the semantics of the text. These models excel in applications like dense retrieval and semantic textual similarity. This paper details the development of Jina Embeddings, starting with the creation of high-quality pairwise and triplet datasets. It underlines the crucial role of data cleaning in dataset preparation, offers in-depth insights into the model training process, and concludes with a comprehensive performance evaluation using the Massive Text Embedding Benchmark (MTEB). Furthermore, to increase the model's awareness of grammatical negation, we construct a novel training and evaluation dataset of negated and non-negated statements, which we make publicly available to the community.

MoDELS · 特化 · Vision · 跡 · Facebook AI Research ·

2023 年 10 月 20 日

Revisiting Implicit Models: Sparsity Trade-offs Capability in Weight-tied Model for Vision Tasks

Haobo Song,Soumajit Majumder,Tao Lin

Implicit models such as Deep Equilibrium Models (DEQs) have garnered significant attention in the community for their ability to train infinite layer models with elegant solution-finding procedures and constant memory footprint. However, despite several attempts, these methods are heavily constrained by model inefficiency and optimization instability. Furthermore, fair benchmarking across relevant methods for vision tasks is missing. In this work, we revisit the line of implicit models and trace them back to the original weight-tied models. Surprisingly, we observe that weight-tied models are more effective, stable, as well as efficient on vision tasks, compared to the DEQ variants. Through the lens of these simple-yet-clean weight-tied models, we further study the fundamental limits in the model capacity of such models and propose the use of distinct sparse masks to improve the model capacity. Finally, for practitioners, we offer design guidelines regarding the depth, width, and sparsity selection for weight-tied models, and demonstrate the generalizability of our insights to other learning paradigms.

Projection · 可辨認的 · MoDELS · 數據集 · Analysis ·

2023 年 10 月 19 日

Experimenting AI Technologies for Disinformation Combat: the IDMO Project

Lorenzo Canale,Alberto Messina

The Italian Digital Media Observatory (IDMO) project, part of a European initiative, focuses on countering disinformation and fake news. This report outlines contributions from Rai-CRITS to the project, including: (i) the creation of novel datasets for testing technologies (ii) development of an automatic model for categorizing Pagella Politica verdicts to facilitate broader analysis (iii) creation of an automatic model for recognizing textual entailment with exceptional accuracy on the FEVER dataset (iv) assessment using GPT-4 to identify textual entailmen (v) a game to raise awareness about fake news at national events.

圖 · Networking · Processing（編程語言） · 圖卷積 · 圖卷積神經網絡/圖卷積網絡 ·

2021 年 12 月 27 日

Powerful Graph Convolutioal Networks with Adaptive Propagation Mechanism for Homophily and Heterophily

Tao Wang,Rui Wang,Di Jin,Dongxiao He,Yuxiao Huang

Graph Convolutional Networks (GCNs) have been widely applied in various fields due to their significant power on processing graph-structured data. Typical GCN and its variants work under a homophily assumption (i.e., nodes with same class are prone to connect to each other), while ignoring the heterophily which exists in many real-world networks (i.e., nodes with different classes tend to form edges). Existing methods deal with heterophily by mainly aggregating higher-order neighborhoods or combing the immediate representations, which leads to noise and irrelevant information in the result. But these methods did not change the propagation mechanism which works under homophily assumption (that is a fundamental part of GCNs). This makes it difficult to distinguish the representation of nodes from different classes. To address this problem, in this paper we design a novel propagation mechanism, which can automatically change the propagation and aggregation process according to homophily or heterophily between node pairs. To adaptively learn the propagation process, we introduce two measurements of homophily degree between node pairs, which is learned based on topological and attribute information, respectively. Then we incorporate the learnable homophily degree into the graph convolution framework, which is trained in an end-to-end schema, enabling it to go beyond the assumption of homophily. More importantly, we theoretically prove that our model can constrain the similarity of representations between nodes according to their homophily degree. Experiments on seven real-world datasets demonstrate that this new approach outperforms the state-of-the-art methods under heterophily or low homophily, and gains competitive performance under homophily.

泛化理論 · INFORMS · 估計/估計量 · 互信息 · 泛化誤差 ·

2021 年 6 月 18 日

A Probabilistic Representation of DNNs: Bridging Mutual Information and Generalization

Xinjie Lan,Kenneth Barner

from arxiv, To appear in the ICML 2021 Workshop on Theoretic Foundation, Criticism, and Application Trend of Explainable AI

Recently, Mutual Information (MI) has attracted attention in bounding the generalization error of Deep Neural Networks (DNNs). However, it is intractable to accurately estimate the MI in DNNs, thus most previous works have to relax the MI bound, which in turn weakens the information theoretic explanation for generalization. To address the limitation, this paper introduces a probabilistic representation of DNNs for accurately estimating the MI. Leveraging the proposed MI estimator, we validate the information theoretic explanation for generalization, and derive a tighter generalization bound than the state-of-the-art relaxations.

目標檢測 · 學成 · 深度學習 · Performance · BASIC ·

2021 年 5 月 26 日

Deep Learning for Weakly-Supervised Object Detection and Object Localization: A Survey

Feifei Shao,Long Chen,Jian Shao,Wei Ji,Shaoning Xiao,Lu Ye,Yueting Zhuang,Jun Xiao

from arxiv, 13 pages, 4 figures

Weakly-Supervised Object Detection (WSOD) and Localization (WSOL), i.e., detecting multiple and single instances with bounding boxes in an image using image-level labels, are long-standing and challenging tasks in the CV community. With the success of deep neural networks in object detection, both WSOD and WSOL have received unprecedented attention. Hundreds of WSOD and WSOL methods and numerous techniques have been proposed in the deep learning era. To this end, in this paper, we consider WSOL is a sub-task of WSOD and provide a comprehensive survey of the recent achievements of WSOD. Specifically, we firstly describe the formulation and setting of the WSOD, including the background, challenges, basic framework. Meanwhile, we summarize and analyze all advanced techniques and training tricks for improving detection performance. Then, we introduce the widely-used datasets and evaluation metrics of WSOD. Lastly, we discuss the future directions of WSOD. We believe that these summaries can help pave a way for future research on WSOD and WSOL.

閱讀: 0 點贊: 0

小貼士

登錄享

相關主題

Neural Networks

北京阿比特科技有限公司

注冊地址：北京市海淀區羊坊店路18號2幢3層301-191

<dir id='1c4yo'><del id='1c4yo'><del id='1c4yo'></del><pre id='1c4yo'><pre id='1c4yo'><option id='1c4yo'><address id='1c4yo'></address><bdo id='1c4yo'><tr id='1c4yo'><acronym id='1c4yo'><pre id='1c4yo'></pre></acronym><div id='1c4yo'></div></tr></bdo></option></pre><small id='1c4yo'><address id='1c4yo'><u id='1c4yo'><legend id='1c4yo'><option id='1c4yo'><abbr id='1c4yo'></abbr><li id='1c4yo'><pre id='1c4yo'></pre></li></option></legend><select id='1c4yo'></select></u></address></small></pre></del><sup id='1c4yo'></sup><blockquote id='1c4yo'><dt id='1c4yo'></dt></blockquote><blockquote id='1c4yo'></blockquote></dir><tt id='1c4yo'></tt><u id='1c4yo'><tt id='1c4yo'><form id='1c4yo'></form></tt><td id='1c4yo'><dt id='1c4yo'></dt></td></u>

<code id='1c4yo'><i id='1c4yo'><q id='1c4yo'><legend id='1c4yo'><pre id='1c4yo'><style id='1c4yo'><acronym id='1c4yo'><i id='1c4yo'><form id='1c4yo'><option id='1c4yo'><center id='1c4yo'></center></option></form></i></acronym></style><tt id='1c4yo'></tt></pre></legend></q></i></code><center id='1c4yo'></center>

<dd id='1c4yo'></dd>

<style id='1c4yo'></style><sub id='1c4yo'><dfn id='1c4yo'><abbr id='1c4yo'><big id='1c4yo'><bdo id='1c4yo'></bdo></big></abbr></dfn></sub>_{<dir id='1c4yo'></dir>}