亚洲男人的天堂2018av,欧美草比,久久久久久免费视频精选,国色天香在线看免费,久久久久亚洲av成人片仓井空

<dir id='s7h1w'><del id='s7h1w'><del id='s7h1w'></del><pre id='s7h1w'><pre id='s7h1w'><option id='s7h1w'><address id='s7h1w'></address><bdo id='s7h1w'><tr id='s7h1w'><acronym id='s7h1w'><pre id='s7h1w'></pre></acronym><div id='s7h1w'></div></tr></bdo></option></pre><small id='s7h1w'><address id='s7h1w'><u id='s7h1w'><legend id='s7h1w'><option id='s7h1w'><abbr id='s7h1w'></abbr><li id='s7h1w'><pre id='s7h1w'></pre></li></option></legend><select id='s7h1w'></select></u></address></small></pre></del><sup id='s7h1w'></sup><blockquote id='s7h1w'><dt id='s7h1w'></dt></blockquote><blockquote id='s7h1w'></blockquote></dir><tt id='s7h1w'></tt><u id='s7h1w'><tt id='s7h1w'><form id='s7h1w'></form></tt><td id='s7h1w'><dt id='s7h1w'></dt></td></u>

<code id='s7h1w'><i id='s7h1w'><q id='s7h1w'><legend id='s7h1w'><pre id='s7h1w'><style id='s7h1w'><acronym id='s7h1w'><i id='s7h1w'><form id='s7h1w'><option id='s7h1w'><center id='s7h1w'></center></option></form></i></acronym></style><tt id='s7h1w'></tt></pre></legend></q></i></code><center id='s7h1w'></center>

<dd id='s7h1w'></dd>

<style id='s7h1w'></style><sub id='s7h1w'><dfn id='s7h1w'><abbr id='s7h1w'><big id='s7h1w'><bdo id='s7h1w'></bdo></big></abbr></dfn></sub>_{<dir id='s7h1w'></dir>}

·

穩健性 · Neural Networks · 優化器 · Networking · 模型評估 ·

2024 年 10 月 25 日

A constrained optimization approach to improve robustness of neural networks

Shudian Zhao,Jan Kronqvist

from arxiv, 29 pages, 4 figures, 5 tables

In this paper, we present a novel nonlinear programming-based approach to fine-tune pre-trained neural networks to improve robustness against adversarial attacks while maintaining high accuracy on clean data. Our method introduces adversary-correction constraints to ensure correct classification of adversarial data and minimizes changes to the model parameters. We propose an efficient cutting-plane-based algorithm to iteratively solve the large-scale nonconvex optimization problem by approximating the feasible region through polyhedral cuts and balancing between robustness and accuracy. Computational experiments on standard datasets such as MNIST and CIFAR10 demonstrate that the proposed approach significantly improves robustness, even with a very small set of adversarial data, while maintaining minimal impact on accuracy.

相關內容

穩健性

正則化項 · 隨機梯度下降 · 知識 (knowledge) · 梯度下降法 · 均值 ·

2024 年 12 月 7 日

Hanke-Raus heuristic rule for iteratively regularized stochastic gradient descent

Harshit Bajpai,Gaurav Mittal,Ankik Kumar Giri

In this work, we present a novel variant of the stochastic gradient descent method termed as iteratively regularized stochastic gradient descent (IRSGD) method to solve nonlinear ill-posed problems in Hilbert spaces. Under standard assumptions, we demonstrate that the mean square iteration error of the method converges to zero for exact data. In the presence of noisy data, we first propose a heuristic parameter choice rule (HPCR) based on the method suggested by Hanke and Raus, and then apply the IRSGD method in combination with HPCR. Precisely, HPCR selects the regularization parameter without requiring any a-priori knowledge of the noise level. We show that the method terminates in finitely many steps in case of noisy data and has regularizing features. Further, we discuss the convergence rates of the method using well-known source and other related conditions under HPCR as well as discrepancy principle. To the best of our knowledge, this is the first work that establishes both the regularization properties and convergence rates of a stochastic gradient method using a heuristic type rule in the setting of infinite-dimensional Hilbert spaces. Finally, we provide the numerical experiments to showcase the practical efficacy of the proposed method.

類別 · 劃分 · Weight · 約束 · 情景 ·

2024 年 12 月 6 日

On a class of interdiction problems with partition matroids: complexity and polynomial-time algorithms

Sergey S. Ketkov,Oleg A. Prokopyev

In this study, we consider a class of linear matroid interdiction problems, where the feasible sets for the upper-level decision-maker (referred to as a leader) and the lower-level decision-maker (referred to as a follower) are induced by two distinct partition matroids with a common weighted ground set. Unlike classical network interdiction models where the leader is subject to a single budget constraint, in our setting, both the leader and the follower are subject to several independent capacity constraints and engage in a zero-sum game. While the problem of finding a maximum weight independent set in a partition matroid is known to be polynomially solvable, we prove that the considered bilevel problem is $NP$-hard even when the weights of ground elements are all binary. On a positive note, it is revealed that, if the number of capacity constraints is fixed for either the leader or the follower, then the considered class of bilevel problems admits several polynomial-time solution schemes. Specifically, these schemes are based on a single-level dual reformulation, a dynamic programming-based approach, and a greedy algorithm for the leader.

Neural Networks · Networking · Performer · 線性的 · 值域 ·

2024 年 12 月 6 日

Hybrid deep additive neural networks

Gyu Min Kim,Jeong Min Jeon

from arxiv, 30 pages, 10 figures

Traditional neural networks (multi-layer perceptrons) have become an important tool in data science due to their success across a wide range of tasks. However, their performance is sometimes unsatisfactory, and they often require a large number of parameters, primarily due to their reliance on the linear combination structure. Meanwhile, additive regression has been a popular alternative to linear regression in statistics. In this work, we introduce novel deep neural networks that incorporate the idea of additive regression. Our neural networks share architectural similarities with Kolmogorov-Arnold networks but are based on simpler yet flexible activation and basis functions. Additionally, we introduce several hybrid neural networks that combine this architecture with that of traditional neural networks. We derive their universal approximation properties and demonstrate their effectiveness through simulation studies and a real-data application. The numerical results indicate that our neural networks generally achieve better performance than traditional neural networks while using fewer parameters.

估計/估計量 · 推斷 · 大學 · 通用近似器 · CASE ·

2024 年 12 月 5 日

From interpretability to inference: an estimation framework for universal approximators

from arxiv, 37 pages, 5 figures, 3 tables, 1 algorithm

We present a novel framework for estimation and inference with the broad class of universal approximators. Estimation is based on the decomposition of model predictions into Shapley values. Inference relies on analyzing the bias and variance properties of individual Shapley components. We show that Shapley value estimation is asymptotically unbiased, and we introduce Shapley regressions as a tool to uncover the true data generating process from noisy data alone. The well-known case of the linear regression is the special case in our framework if the model is linear in parameters. We present theoretical, numerical, and empirical results for the estimation of heterogeneous treatment effects as our guiding example.

Conformer · 論文 · 計算學習理論 · 統計理論 ·

2024 年 12 月 4 日

Validity and efficiency of the conformal CUSUM procedure

Vladimir Vovk,Ilia Nouretdinov,Alex Gammerman

from arxiv, 19 pages, 7 figures

In this paper we study the validity and efficiency of a conformal version of the CUSUM procedure for change detection both experimentally and theoretically.

可辨認的 · 極大似然估計 · 估計/估計量 · MoDELS · 周期的 ·

2024 年 12 月 4 日

Identifiability implies consistency of MLE in partially observed diffusions on a torus

Ibrahim Ekren,Sergey Nadtochiy Ibrahim Ekren,Sergey Nadtochiy

In this paper, we consider a general partially observed diffusion model with periodic coefficients and with non-degenerate diffusion component. The coefficients of such a model depend on an unknown (static and deterministic) parameter which needs to be estimated based on the observed component of the diffusion process. We show that, under a minimal assumption of identifiability, and given enough regularity of the diffusion coefficients, a maximum likelihood estimator of the unknown parameter converges to the true parameter value as the sample size grows to infinity.

MoDELS · 語言模型化 · Learning · 講稿 · state-of-the-art ·

2024 年 12 月 4 日

Automatic detection of diseases in Spanish clinical notes combining medical language models and ontologies

Leon-Paul Schaub Torre,Pelayo Quiros,Helena Garcia Mieres

from arxiv, Translation of SEPLN 2024 es paper

In this paper we present a hybrid method for the automatic detection of dermatological pathologies in medical reports. We use a large language model combined with medical ontologies to predict, given a first appointment or follow-up medical report, the pathology a person may suffer from. The results show that teaching the model to learn the type, severity and location on the body of a dermatological pathology, as well as in which order it has to learn these three features, significantly increases its accuracy. The article presents the demonstration of state-of-the-art results for classification of medical texts with a precision of 0.84, micro and macro F1-score of 0.82 and 0.75, and makes both the method and the data set used available to the community.

Microsoft Surface · Neural Networks · Networking · MoDELS · 損失函數（機器學習） ·

2021 年 5 月 28 日

Incorporating prior financial domain knowledge into neural networks for implied volatility surface prediction

Yu Zheng,Yongxin Yang,Bowei Chen

from arxiv, 8 pages, SIGKDD 2021

In this paper we develop a novel neural network model for predicting implied volatility surface. Prior financial domain knowledge is taken into account. A new activation function that incorporates volatility smile is proposed, which is used for the hidden nodes that process the underlying asset price. In addition, financial conditions, such as the absence of arbitrage, the boundaries and the asymptotic slope, are embedded into the loss function. This is one of the very first studies which discuss a methodological framework that incorporates prior financial domain knowledge into neural network architecture design and model training. The proposed model outperforms the benchmarked models with the option data on the S&P 500 index over 20 years. More importantly, the domain knowledge is satisfied empirically, showing the model is consistent with the existing financial theories and conditions related to implied volatility surface.

Neural Networks · Parse · Networking · 粵港澳大灣區數字經濟研究院 · 解析樹 ·

2021 年 2 月 25 日

How to represent part-whole hierarchies in a neural network

Geoffrey Hinton

from arxiv, 43 pages, 5 figures

This paper does not describe a working system. Instead, it presents a single idea about representation which allows advances made by several different groups to be combined into an imaginary system called GLOM. The advances include transformers, neural fields, contrastive representation learning, distillation and capsules. GLOM answers the question: How can a neural network with a fixed architecture parse an image into a part-whole hierarchy which has a different structure for each image? The idea is simply to use islands of identical vectors to represent the nodes in the parse tree. If GLOM can be made to work, it should significantly improve the interpretability of the representations produced by transformer-like systems when applied to vision or language

FCN · 全卷積網絡 · 3D · 級聯 · MoDELS ·

2018 年 3 月 20 日

An application of cascaded 3D fully convolutional networks for medical image segmentation

Holger R. Roth,Hirohisa Oda,Xiangrong Zhou,Natsuki Shimizu,Ying Yang,Yuichiro Hayashi,Masahiro Oda,Michitaka Fujiwara,Kazunari Misawa,Kensaku Mori

from arxiv, Preprint accepted for publication in Computerized Medical Imaging and Graphics. Substantial extension of arXiv:1704.06382; Corrected references to figure numbers in this version

Recent advances in 3D fully convolutional networks (FCN) have made it feasible to produce dense voxel-wise predictions of volumetric images. In this work, we show that a multi-class 3D FCN trained on manually labeled CT scans of several anatomical structures (ranging from the large organs to thin vessels) can achieve competitive segmentation results, while avoiding the need for handcrafting features or training class-specific models. To this end, we propose a two-stage, coarse-to-fine approach that will first use a 3D FCN to roughly define a candidate region, which will then be used as input to a second 3D FCN. This reduces the number of voxels the second FCN has to classify to ~10% and allows it to focus on more detailed segmentation of the organs and vessels. We utilize training and validation sets consisting of 331 clinical CT images and test our models on a completely unseen data collection acquired at a different hospital that includes 150 CT scans, targeting three anatomical organs (liver, spleen, and pancreas). In challenging organs such as the pancreas, our cascaded approach improves the mean Dice score from 68.5 to 82.2%, achieving the highest reported average score on this dataset. We compare with a 2D FCN method on a separate dataset of 240 CT scans with 18 classes and achieve a significantly higher performance in small organs and vessels. Furthermore, we explore fine-tuning our models to different datasets. Our experiments illustrate the promise and robustness of current 3D FCN based semantic segmentation of medical images, achieving state-of-the-art results. Our code and trained models are available for download: //github.com/holgerroth/3Dunet_abdomen_cascade.

閱讀: 0 點贊: 0

小貼士

登錄享

相關主題

Neural Networks

北京阿比特科技有限公司

注冊地址：北京市海淀區羊坊店路18號2幢3層301-191

<form id='s7h1w'></form>

<bdo id='s7h1w'><sup id='s7h1w'><div id='s7h1w'><bdo id='s7h1w'></bdo></div></sup></bdo>