顾美玲国产一区二区三区,国产亚洲欧美日韩精品色狠二区,免费啪视频观在线视频欧美,99久久婷婷国产自综合青草,另类亚洲欧美精品久久

Finding similar patients is a common objective in precision medicine, facilitating treatment outcome assessment and clinical decision support. Choosing widely-available patient features and appropriate mathematical methods for similarity calculations is crucial. International Statistical Classification of Diseases and Related Health Problems (ICD) codes are used worldwide to encode diseases and are available for nearly all patients. Aggregated as sets consisting of primary and secondary diagnoses they can display a degree of comorbidity and reveal comorbidity patterns. It is possible to compute the similarity of patients based on their ICD codes by using semantic similarity algorithms. These algorithms have been traditionally evaluated using a single-term expert rated data set. However, real-word patient data often display varying degrees of documented comorbidities that might impair algorithm performance. To account for this, we present a scale term that considers documented comorbidity-variance. In this work, we compared the performance of 80 combinations of established algorithms in terms of semantic similarity based on ICD-code sets. The sets have been extracted from patients with a C25.X (pancreatic cancer) primary diagnosis and provide a variety of different combinations of ICD-codes. Using our scale term we yielded the best results with a combination of level-based information content, Leacock & Chodorow concept similarity and bipartite graph matching for the set similarities reaching a correlation of 0.75 with our expert's ground truth. Our results highlight the importance of accounting for comorbidity variance while demonstrating how well current semantic similarity algorithms perform.

相關內容

相似度

關注 0

INFORMS · 自編碼器 · 潛在 · Learning · HTTPS ·

2023 年 10 月 4 日

Variantional autoencoder with decremental information bottleneck for disentanglement

Jiantao Wu,Shentong Mo,Xiang Yang,Muhammad Awais,Sara Atito,Xingshen Zhang,Lin Wang,Xiang Yang

One major challenge of disentanglement learning with variational autoencoders is the trade-off between disentanglement and reconstruction fidelity. Previous studies, which increase the information bottleneck during training, tend to lose the constraint of disentanglement, leading to the information diffusion problem. In this paper, we present a novel framework for disentangled representation learning, DeVAE, which utilizes hierarchical latent spaces with decreasing information bottlenecks across these spaces. The key innovation of our approach lies in connecting the hierarchical latent spaces through disentanglement-invariant transformations, allowing the sharing of disentanglement properties among spaces while maintaining an acceptable level of reconstruction performance. We demonstrate the effectiveness of DeVAE in achieving a balance between disentanglement and reconstruction through a series of experiments and ablation studies on dSprites and Shapes3D datasets. Code is available at //github.com/erow/disentanglement_lib/tree/pytorch#devae.

INTERACT · INFORMS · Performer · 相關系數 · 數據集 ·

2023 年 10 月 4 日

Rethinking superpixel segmentation from biologically inspired mechanisms

TingYu Zhao,Bo Peng,Yuan Sun,DaiPeng Yang,ZhenGuang Zhange,Xi Wu

Recently, advancements in deep learning-based superpixel segmentation methods have brought about improvements in both the efficiency and the performance of segmentation. However, a significant challenge remains in generating superpixels that strictly adhere to object boundaries while conveying rich visual significance, especially when cross-surface color correlations may interfere with objects. Drawing inspiration from neural structure and visual mechanisms, we propose a biological network architecture comprising an Enhanced Screening Module (ESM) and a novel Boundary-Aware Label (BAL) for superpixel segmentation. The ESM enhances semantic information by simulating the interactive projection mechanisms of the visual cortex. Additionally, the BAL emulates the spatial frequency characteristics of visual cortical cells to facilitate the generation of superpixels with strong boundary adherence. We demonstrate the effectiveness of our approach through evaluations on both the BSDS500 dataset and the NYUv2 dataset.

Agent · FAST · Automator · PARCO · Continuity ·

2023 年 10 月 3 日

Fast algorithm for centralized multi-agent maze exploration

Bojan Crnkovi?,Stefan Ivi?,Mila Zovko

Recent advancements in robotics have paved the way for robots to replace humans in perilous situations, such as searching for victims in blazing buildings, earthquake-damaged structures, uncharted caves, traversing minefields, or patrolling crime-ridden streets. These challenges can be generalized as problems where agents need to explore unknown mazes. Although various algorithms for single-agent maze exploration exist, extending them to multi-agent systems poses complexities. We propose a solution: a cooperative multi-agent system of automated mobile agents for exploring unknown mazes and locating stationary targets. Our algorithm employs a potential field governing maze exploration, integrating cooperative agent behaviors like collision avoidance, coverage coordination, and path planning. This approach builds upon the Heat Equation Driven Area Coverage (HEDAC) method by Ivi\'c, Crnkovi\'c, and Mezi\'c. Unlike previous continuous domain applications, we adapt HEDAC for discrete domains, specifically mazes divided into nodes. Our algorithm is versatile, easily modified for anti-collision requirements, and adaptable to expanding mazes and numerical meshes over time. Comparative evaluations against alternative maze-solving methods illustrate our algorithm's superiority. The results highlight significant enhancements, showcasing its applicability across diverse mazes. Numerical simulations affirm its robustness, adaptability, scalability, and simplicity, enabling centralized parallel computation in autonomous systems of basic agents/robots.

吸引點 · Continuity · INFORMS · 符號表示 · 組合性 ·

2023 年 10 月 3 日

Discrete, compositional, and symbolic representations through attractor dynamics

Andrew Nam,Eric Elmoznino,Nikolay Malkin,Chen Sun,Yoshua Bengio,Guillaume Lajoie

Compositionality is an important feature of discrete symbolic systems, such as language and programs, as it enables them to have infinite capacity despite a finite symbol set. It serves as a useful abstraction for reasoning in both cognitive science and in AI, yet the interface between continuous and symbolic processing is often imposed by fiat at the algorithmic level, such as by means of quantization or a softmax sampling step. In this work, we explore how discretization could be implemented in a more neurally plausible manner through the modeling of attractor dynamics that partition the continuous representation space into basins that correspond to sequences of symbols. Building on established work in attractor networks and introducing novel training methods, we show that imposing structure in the symbolic space can produce compositionality in the attractor-supported representation space of rich sensory inputs. Lastly, we argue that our model exhibits the process of an information bottleneck that is thought to play a role in conscious experience, decomposing the rich information of a sensory input into stable components encoding symbolic information.

估計/估計量 · 集成 · 方陣 · 訓練誤差 · Analysis ·

2023 年 10 月 2 日

Corrected generalized cross-validation for finite ensembles of penalized estimators

Pierre Bellec,Jin-Hong Du,Takuya Koriyama,Pratik Patil,Kai Tan

from arxiv, 82 pages, 25 figures

Generalized cross-validation (GCV) is a widely-used method for estimating the squared out-of-sample prediction risk that employs a scalar degrees of freedom adjustment (in a multiplicative sense) to the squared training error. In this paper, we examine the consistency of GCV for estimating the prediction risk of arbitrary ensembles of penalized least squares estimators. We show that GCV is inconsistent for any finite ensemble of size greater than one. Towards repairing this shortcoming, we identify a correction that involves an additional scalar correction (in an additive sense) based on degrees of freedom adjusted training errors from each ensemble component. The proposed estimator (termed CGCV) maintains the computational advantages of GCV and requires neither sample splitting, model refitting, or out-of-bag risk estimation. The estimator stems from a finer inspection of ensemble risk decomposition and two intermediate risk estimators for the components in this decomposition. We provide a non-asymptotic analysis of the CGCV and the two intermediate risk estimators for ensembles of convex penalized estimators under Gaussian features and a linear response model. In the special case of ridge regression, we extend the analysis to general feature and response distributions using random matrix theory, which establishes model-free uniform consistency of CGCV.

MoDELS · 變分自編碼 · Better · 模態 · 自編碼器 ·

2023 年 10 月 2 日

Multi-modal Variational Autoencoders for normative modelling across multiple imaging modalities

Ana Lawry Aguila,James Chapman,Andre Altmann

One of the challenges of studying common neurological disorders is disease heterogeneity including differences in causes, neuroimaging characteristics, comorbidities, or genetic variation. Normative modelling has become a popular method for studying such cohorts where the 'normal' behaviour of a physiological system is modelled and can be used at subject level to detect deviations relating to disease pathology. For many heterogeneous diseases, we expect to observe abnormalities across a range of neuroimaging and biological variables. However, thus far, normative models have largely been developed for studying a single imaging modality. We aim to develop a multi-modal normative modelling framework where abnormality is aggregated across variables of multiple modalities and is better able to detect deviations than uni-modal baselines. We propose two multi-modal VAE normative models to detect subject level deviations across T1 and DTI data. Our proposed models were better able to detect diseased individuals, capture disease severity, and correlate with patient cognition than baseline approaches. We also propose a multivariate latent deviation metric, measuring deviations from the joint latent space, which outperformed feature-based metrics.

近似 · Extensibility · Continuity · Performer · 操作 ·

2023 年 10 月 1 日

The Active Flux method for the Euler equations on Cartesian grids

Rémi Abgrall,Wasilij Barsukow,Christian Klingenberg

Active Flux is an extension of the Finite Volume method and additionally incorporates point values located at cell boundaries. This gives rise to a globally continuous approximation of the solution. The method is third-order accurate. We demonstrate that a new semi-discrete Active Flux method (first described in Abgrall&Barsukow, 2023 for one space dimension) can easily be used to solve nonlinear hyperbolic systems in multiple dimensions, such as the compressible Euler equations of inviscid hydrodynamics. Originally, the Active Flux method emerged as a fully discrete method, and required an exact or approximate evolution operator for the point value update. For nonlinear problems such an operator is often difficult to obtain, in particular for multiple spatial dimensions. With the new approach it becomes possible to leave behind these difficulties. We introduce a multi-dimensional limiting strategy and demonstrate the performance of the new method on both Riemann problems and subsonic flows.

INFORMS · 極小點 · MoDELS · 線性模型 · 核化 ·

2023 年 9 月 29 日

Revisiting minimum description length complexity in overparameterized models

Raaz Dwivedi,Chandan Singh,Bin Yu,Martin J. Wainwright

from arxiv, First two authors contributed equally

Complexity is a fundamental concept underlying statistical learning theory that aims to inform generalization performance. Parameter count, while successful in low-dimensional settings, is not well-justified for overparameterized settings when the number of parameters is more than the number of training samples. We revisit complexity measures based on Rissanen's principle of minimum description length (MDL) and define a novel MDL-based complexity (MDL-COMP) that remains valid for overparameterized models. MDL-COMP is defined via an optimality criterion over the encodings induced by a good Ridge estimator class. We provide an extensive theoretical characterization of MDL-COMP for linear models and kernel methods and show that it is not just a function of parameter count, but rather a function of the singular values of the design or the kernel matrix and the signal-to-noise ratio. For a linear model with $n$ observations, $d$ parameters, and i.i.d. Gaussian predictors, MDL-COMP scales linearly with $d$ when $d<n$, but the scaling is exponentially smaller -- $\log d$ for $d>n$. For kernel methods, we show that MDL-COMP informs minimax in-sample error, and can decrease as the dimensionality of the input increases. We also prove that MDL-COMP upper bounds the in-sample mean squared error (MSE). Via an array of simulations and real-data experiments, we show that a data-driven Prac-MDL-COMP informs hyper-parameter tuning for optimizing test MSE with ridge regression in limited data settings, sometimes improving upon cross-validation and (always) saving computational costs. Finally, our findings also suggest that the recently observed double decent phenomenons in overparameterized models might be a consequence of the choice of non-ideal estimators.

PCA · cancer · 可辨認的 · 相關系數 · MoDELS ·

2023 年 9 月 29 日

Patient-specific computational forecasting of prostate cancer growth during active surveillance using an imaging-informed biomechanistic model

Guillermo Lorenzo,Jon S. Heiselman,Michael A. Liss,Michael I. Miga,Hector Gomez,Thomas E. Yankeelov,Alessandro Reali,Thomas J. R. Hughes

Active surveillance (AS) is a suitable management option for newly-diagnosed prostate cancer (PCa), which usually presents low to intermediate clinical risk. Patients enrolled in AS have their tumor closely monitored via longitudinal multiparametric magnetic resonance imaging (mpMRI), serum prostate-specific antigen tests, and biopsies. Hence, the patient is prescribed treatment when these tests identify progression to higher-risk PCa. However, current AS protocols rely on detecting tumor progression through direct observation according to standardized monitoring strategies. This approach limits the design of patient-specific AS plans and may lead to the late detection and treatment of tumor progression. Here, we propose to address these issues by leveraging personalized computational predictions of PCa growth. Our forecasts are obtained with a spatiotemporal biomechanistic model informed by patient-specific longitudinal mpMRI data. Our results show that our predictive technology can represent and forecast the global tumor burden for individual patients, achieving concordance correlation coefficients ranging from 0.93 to 0.99 across our cohort (n=7). Additionally, we identify a model-based biomarker of higher-risk PCa: the mean proliferation activity of the tumor (p=0.041). Using logistic regression, we construct a PCa risk classifier based on this biomarker that achieves an area under the receiver operating characteristic curve of 0.83. We further show that coupling our tumor forecasts with this PCa risk classifier enables the early identification of PCa progression to higher-risk disease by more than one year. Thus, we posit that our predictive technology constitutes a promising clinical decision-making tool to design personalized AS plans for PCa patients.

MoDELS · INTERACT · Machine Learning · 優化器 · Learning ·

2023 年 9 月 27 日

Explainable machine learning-based prediction model for diabetic nephropathy

Jing-Mei Yin,Yang Li,Jun-Tang Xue,Guo-Wei Zong,Zhong-Ze Fang,Lang Zou

The aim of this study is to analyze the effect of serum metabolites on diabetic nephropathy (DN) and predict the prevalence of DN through a machine learning approach. The dataset consists of 548 patients from April 2018 to April 2019 in Second Affiliated Hospital of Dalian Medical University (SAHDMU). We select the optimal 38 features through a Least absolute shrinkage and selection operator (LASSO) regression model and a 10-fold cross-validation. We compare four machine learning algorithms, including eXtreme Gradient Boosting (XGB), random forest, decision tree and logistic regression, by AUC-ROC curves, decision curves, calibration curves. We quantify feature importance and interaction effects in the optimal predictive model by Shapley Additive exPlanations (SHAP) method. The XGB model has the best performance to screen for DN with the highest AUC value of 0.966. The XGB model also gains more clinical net benefits than others and the fitting degree is better. In addition, there are significant interactions between serum metabolites and duration of diabetes. We develop a predictive model by XGB algorithm to screen for DN. C2, C5DC, Tyr, Ser, Met, C24, C4DC, and Cys have great contribution in the model, and can possibly be biomarkers for DN.