欧美成人性色XXⅩXXA片在线_五月天婷婷丁香基地综合_久草精品视频在线观看_国产精品99久久久久久一二区_国产日韩专区视频在线一区二区_日日丝袜美女一区二区三区_9999久久精品国产

We propose an easy-to-use adjustment estimator for the effect of a treatment based on observational data from a single (social) network of units. The approach allows for interactions among units within the network, called interference, and for observed confounding. We define a simplified causal graph that does not differentiate between units, called generic graph. Using valid adjustment sets determined in the generic graph, we can identify the treatment effect and build a corresponding estimator. We establish the estimator's consistency and its convergence to a Gaussian limiting distribution at the parametric rate under certain regularity conditions that restrict the growth of dependencies among units. We empirically verify the theoretical properties of our estimator through a simulation study and apply it to estimate the effect of a strict facial-mask policy on the spread of COVID-19 in Switzerland.

相關內容

估計/估計量(liang)

關注 3

圖 · 估計/估計量 · 可辨認的 · 有向非循環圖 · DAG ·

2024 年 1 月 26 日

Linear-Time Algorithms for Front-Door Adjustment in Causal Graphs

Marcel Wien?bst,Benito van der Zander,Maciej Li?kiewicz

from arxiv, Extended version of paper accepted to the Proceedings of the 38th AAAI Conference on Artificial Intelligence (AAAI-2024)

Causal effect estimation from observational data is a fundamental task in empirical sciences. It becomes particularly challenging when unobserved confounders are involved in a system. This paper focuses on front-door adjustment -- a classic technique which, using observed mediators allows to identify causal effects even in the presence of unobserved confounding. While the statistical properties of the front-door estimation are quite well understood, its algorithmic aspects remained unexplored for a long time. In 2022, Jeong, Tian, and Bareinboim presented the first polynomial-time algorithm for finding sets satisfying the front-door criterion in a given directed acyclic graph (DAG), with an $O(n^3(n+m))$ run time, where $n$ denotes the number of variables and $m$ the number of edges of the causal graph. In our work, we give the first linear-time, i.e., $O(n+m)$, algorithm for this task, which thus reaches the asymptotically optimal time complexity. This result implies an $O(n(n+m))$ delay enumeration algorithm of all front-door adjustment sets, again improving previous work by a factor of $n^3$. Moreover, we provide the first linear-time algorithm for finding a minimal front-door adjustment set. We offer implementations of our algorithms in multiple programming languages to facilitate practical usage and empirically validate their feasibility, even for large graphs.

生成式人工智能 · Automator · AI · MoDELS · Facebook AI Research ·

2024 年 1 月 26 日

Leveraging Generative AI for Clinical Evidence Summarization Needs to Ensure Trustworthiness

Gongbo Zhang,Qiao Jin,Denis Jered McInerney,Yong Chen,Fei Wang,Curtis L. Cole,Qian Yang,Yanshan Wang,Bradley A. Malin,Mor Peleg,Byron C. Wallace,Zhiyong Lu,Chunhua Weng,Yifan Peng

Evidence-based medicine promises to improve the quality of healthcare by empowering medical decisions and practices with the best available evidence. The rapid growth of medical evidence, which can be obtained from various sources, poses a challenge in collecting, appraising, and synthesizing the evidential information. Recent advancements in generative AI, exemplified by large language models, hold promise in facilitating the arduous task. However, developing accountable, fair, and inclusive models remains a complicated undertaking. In this perspective, we discuss the trustworthiness of generative AI in the context of automated summarization of medical evidence.

INFORMS · 模型評估 · MoDELS · Processing（編程語言） · 自動問答 ·

2024 年 1 月 25 日

LongHealth: A Question Answering Benchmark with Long Clinical Documents

Lisa Adams,Felix Busch,Tianyu Han,Jean-Baptiste Excoffier,Matthieu Ortala,Alexander L?ser,Hugo JWL. Aerts,Jakob Nikolas Kather,Daniel Truhn,Keno Bressem

from arxiv, 11 pages, 3 figures, 5 tables

Background: Recent advancements in large language models (LLMs) offer potential benefits in healthcare, particularly in processing extensive patient records. However, existing benchmarks do not fully assess LLMs' capability in handling real-world, lengthy clinical data. Methods: We present the LongHealth benchmark, comprising 20 detailed fictional patient cases across various diseases, with each case containing 5,090 to 6,754 words. The benchmark challenges LLMs with 400 multiple-choice questions in three categories: information extraction, negation, and sorting, challenging LLMs to extract and interpret information from large clinical documents. Results: We evaluated nine open-source LLMs with a minimum of 16,000 tokens and also included OpenAI's proprietary and cost-efficient GPT-3.5 Turbo for comparison. The highest accuracy was observed for Mixtral-8x7B-Instruct-v0.1, particularly in tasks focused on information retrieval from single and multiple patient documents. However, all models struggled significantly in tasks requiring the identification of missing information, highlighting a critical area for improvement in clinical data interpretation. Conclusion: While LLMs show considerable potential for processing long clinical documents, their current accuracy levels are insufficient for reliable clinical use, especially in scenarios requiring the identification of missing information. The LongHealth benchmark provides a more realistic assessment of LLMs in a healthcare setting and highlights the need for further model refinement for safe and effective clinical application. We make the benchmark and evaluation code publicly available.

專利 · Learning · 推斷 · 可辨認的 · Continuity ·

2024 年 1 月 25 日

Improving Antibody Humanness Prediction using Patent Data

Talip Ucar,Aubin Ramon,Dino Oglic,Rebecca Croasdale-Wood,Tom Diethe,Pietro Sormanni

from arxiv, 13 pages, 6 figures, Code: //github.com/AstraZeneca/SelfPAD

We investigate the potential of patent data for improving the antibody humanness prediction using a multi-stage, multi-loss training process. Humanness serves as a proxy for the immunogenic response to antibody therapeutics, one of the major causes of attrition in drug discovery and a challenging obstacle for their use in clinical settings. We pose the initial learning stage as a weakly-supervised contrastive-learning problem, where each antibody sequence is associated with possibly multiple identifiers of function and the objective is to learn an encoder that groups them according to their patented properties. We then freeze a part of the contrastive encoder and continue training it on the patent data using the cross-entropy loss to predict the humanness score of a given antibody sequence. We illustrate the utility of the patent data and our approach by performing inference on three different immunogenicity datasets, unseen during training. Our empirical results demonstrate that the learned model consistently outperforms the alternative baselines and establishes new state-of-the-art on five out of six inference tasks, irrespective of the used metric.

Networking · 查準率/準確率 · 模型評估 · 推斷 · 可約的 ·

2024 年 1 月 25 日

Towards Cheaper Inference in Deep Networks with Lower Bit-Width Accumulators

Yaniv Blumenfeld,Itay Hubara,Daniel Soudry

The majority of the research on the quantization of Deep Neural Networks (DNNs) is focused on reducing the precision of tensors visible by high-level frameworks (e.g., weights, activations, and gradients). However, current hardware still relies on high-accuracy core operations. Most significant is the operation of accumulating products. This high-precision accumulation operation is gradually becoming the main computational bottleneck. This is because, so far, the usage of low-precision accumulators led to a significant degradation in performance. In this work, we present a simple method to train and fine-tune high-end DNNs, to allow, for the first time, utilization of cheaper, $12$-bits accumulators, with no significant degradation in accuracy. Lastly, we show that as we decrease the accumulation precision further, using fine-grained gradient approximations can improve the DNN accuracy.

Lipschitz · Networking · Neural Networks · 約束 · 激活函數 ·

2024 年 1 月 25 日

Novel Quadratic Constraints for Extending LipSDP beyond Slope-Restricted Activations

Patricia Pauli,Aaron Havens,Alexandre Araujo,Siddharth Garg,Farshad Khorrami,Frank Allg?wer,Bin Hu

from arxiv, accepted as a conference paper at ICLR 2024

Recently, semidefinite programming (SDP) techniques have shown great promise in providing accurate Lipschitz bounds for neural networks. Specifically, the LipSDP approach (Fazlyab et al., 2019) has received much attention and provides the least conservative Lipschitz upper bounds that can be computed with polynomial time guarantees. However, one main restriction of LipSDP is that its formulation requires the activation functions to be slope-restricted on $[0,1]$, preventing its further use for more general activation functions such as GroupSort, MaxMin, and Householder. One can rewrite MaxMin activations for example as residual ReLU networks. However, a direct application of LipSDP to the resultant residual ReLU networks is conservative and even fails in recovering the well-known fact that the MaxMin activation is 1-Lipschitz. Our paper bridges this gap and extends LipSDP beyond slope-restricted activation functions. To this end, we provide novel quadratic constraints for GroupSort, MaxMin, and Householder activations via leveraging their underlying properties such as sum preservation. Our proposed analysis is general and provides a unified approach for estimating $\ell_2$ and $\ell_\infty$ Lipschitz bounds for a rich class of neural network architectures, including non-residual and residual neural networks and implicit models, with GroupSort, MaxMin, and Householder activations. Finally, we illustrate the utility of our approach with a variety of experiments and show that our proposed SDPs generate less conservative Lipschitz bounds in comparison to existing approaches.

Extensibility · Agent · Continuity · Performer · Integration ·

2024 年 1 月 25 日

An Extensible Framework for Open Heterogeneous Collaborative Perception

Yifan Lu,Yue Hu,Yiqi Zhong,Dequan Wang,Siheng Chen,Yanfeng Wang

from arxiv, Accepted by ICLR 2024. The code and data are open-sourced at //github.com/yifanlu0227/HEAL

Collaborative perception aims to mitigate the limitations of single-agent perception, such as occlusions, by facilitating data exchange among multiple agents. However, most current works consider a homogeneous scenario where all agents use identity sensors and perception models. In reality, heterogeneous agent types may continually emerge and inevitably face a domain gap when collaborating with existing agents. In this paper, we introduce a new open heterogeneous problem: how to accommodate continually emerging new heterogeneous agent types into collaborative perception, while ensuring high perception performance and low integration cost? To address this problem, we propose HEterogeneous ALliance (HEAL), a novel extensible collaborative perception framework. HEAL first establishes a unified feature space with initial agents via a novel multi-scale foreground-aware Pyramid Fusion network. When heterogeneous new agents emerge with previously unseen modalities or models, we align them to the established unified space with an innovative backward alignment. This step only involves individual training on the new agent type, thus presenting extremely low training costs and high extensibility. It also protects new agents' model details from disclosure since the training can be conducted by the agent owner locally. To enrich agents' data heterogeneity, we bring OPV2V-H, a new large-scale dataset with more diverse sensor types. Extensive experiments on OPV2V-H and DAIR-V2X datasets show that HEAL surpasses SOTA methods in performance while reducing the training parameters by 91.5% when integrating 3 new agent types. Code and data are available at: //github.com/yifanlu0227/HEAL.

圖形處理器 · 圖 · Networking · Neural Networks · 相似度 ·

2024 年 1 月 23 日

Probabilistic Demand Forecasting with Graph Neural Networks

Nikita Kozodoi,Elizaveta Zinovyeva,Simon Valentin,Jo?o Pereira,Rodrigo Agundez

from arxiv, Preprint of the paper accepted to ECML PKDD 2023 ML4ITS Workshop

Demand forecasting is a prominent business use case that allows retailers to optimize inventory planning, logistics, and core business decisions. One of the key challenges in demand forecasting is accounting for relationships and interactions between articles. Most modern forecasting approaches provide independent article-level predictions that do not consider the impact of related articles. Recent research has attempted addressing this challenge using Graph Neural Networks (GNNs) and showed promising results. This paper builds on previous research on GNNs and makes two contributions. First, we integrate a GNN encoder into a state-of-the-art DeepAR model. The combined model produces probabilistic forecasts, which are crucial for decision-making under uncertainty. Second, we propose to build graphs using article attribute similarity, which avoids reliance on a pre-defined graph structure. Experiments on three real-world datasets show that the proposed approach consistently outperforms non-graph benchmarks. We also show that our approach produces article embeddings that encode article similarity and demand dynamics and are useful for other downstream business tasks beyond forecasting.

社區發現 · Networks · Networking · 學成 · Neural Networks ·

2021 年 5 月 26 日

A Comprehensive Survey on Community Detection with Deep Learning

Xing Su,Shan Xue,Fanzhen Liu,Jia Wu,Jian Yang,Chuan Zhou,Wenbin Hu,Cecile Paris,Surya Nepal,Di Jin,Quan Z. Sheng,Philip S. Yu

A community reveals the features and connections of its members that are different from those in other communities in a network. Detecting communities is of great significance in network analysis. Despite the classical spectral clustering and statistical inference methods, we notice a significant development of deep learning techniques for community detection in recent years with their advantages in handling high dimensional network data. Hence, a comprehensive overview of community detection's latest progress through deep learning is timely to both academics and practitioners. This survey devises and proposes a new taxonomy covering different categories of the state-of-the-art methods, including deep learning-based models upon deep neural networks, deep nonnegative matrix factorization and deep sparse filtering. The main category, i.e., deep neural networks, is further divided into convolutional networks, graph attention networks, generative adversarial networks and autoencoders. The survey also summarizes the popular benchmark data sets, model evaluation metrics, and open-source implementations to address experimentation settings. We then discuss the practical applications of community detection in various domains and point to implementation scenarios. Finally, we outline future directions by suggesting challenging topics in this fast-growing deep learning field.

塑造 · 可辨認的 · Better · 目標檢測 · state-of-the-art ·

2018 年 1 月 10 日

From Superpixel to Human Shape Modelling for Carried Object Detection

Farnoosh Ghadiri,Robert Bergevin,Guillaume-Alexandre Bilodeau

Detecting carried objects is one of the requirements for developing systems to reason about activities involving people and objects. We present an approach to detect carried objects from a single video frame with a novel method that incorporates features from multiple scales. Initially, a foreground mask in a video frame is segmented into multi-scale superpixels. Then the human-like regions in the segmented area are identified by matching a set of extracted features from superpixels against learned features in a codebook. A carried object probability map is generated using the complement of the matching probabilities of superpixels to human-like regions and background information. A group of superpixels with high carried object probability and strong edge support is then merged to obtain the shape of the carried object. We applied our method to two challenging datasets, and results show that our method is competitive with or better than the state-of-the-art.