午夜剧场成年免费视_碰碰女人公开免费视频_性国产精品三級片在线免費版_激情的少妇大片免费视频观看_国产日韩精品综合无码一区二区_免费观看一级欧美视频在线_玖玖精品视频导航

Microbiome research is now moving beyond the compositional analysis of microbial taxa in a sample. Increasing evidence from large human microbiome studies suggests that functional consequences of changes in the intestinal microbiome may provide more power for studying their impact on inflammation and immune responses. Although 16S rRNA analysis is one of the most popular and a cost-effective method to profile the microbial compositions, marker-gene sequencing cannot provide direct information about the functional genes that are present in the genomes of community members. Bioinformatic tools have been developed to predict microbiome function with 16S rRNA gene data. Among them, PICRUSt2 has become one of the most popular functional profile prediction tools, which generates community-wide pathway abundances. However, no state-of-art inference tools are available to test the differences in pathway abundances between comparison groups. We have developed ggpicrust2, an R package, to do extensive differential abundance(DA) analyses and provide publishable visualization to highlight the signals.

相關內容

泛函

關注 0

Storage · 可約的 · 覆蓋 · INFORMS · 代價 ·

2023 年 5 月 9 日

Cover Your Bases: How to Minimize the Sequencing Coverage in DNA Storage Systems

Daniella Bar-Lev,Omer Sabary,Ryan Gabrys,Eitan Yaakobi

Although the expenses associated with DNA sequencing have been rapidly decreasing, the current cost stands at roughly \$1.3K/TB, which is dramatically more expensive than reading from existing archival storage solutions today. In this work, we aim to reduce not only the cost but also the latency of DNA storage by studying the DNA coverage depth problem, which aims to reduce the required number of reads to retrieve information from the storage system. Under this framework, our main goal is to understand how to optimally pair an error-correcting code with a given retrieval algorithm to minimize the sequencing coverage depth, while guaranteeing retrieval of the information with high probability. Additionally, we study the DNA coverage depth problem under the random-access setup.

假陰性 · contrastive · 圖像字幕 · 多峰值 · 最優化 ·

2023 年 5 月 9 日

Exploiting Pseudo Image Captions for Multimodal Summarization

Chaoya Jiang,Rui Xie,Wei Ye,Jinan Sun,Shikun Zhang

from arxiv, Accepted at ACL2023 Findings

Cross-modal contrastive learning in vision language pretraining (VLP) faces the challenge of (partial) false negatives. In this paper, we study this problem from the perspective of Mutual Information (MI) optimization. It is common sense that InfoNCE loss used in contrastive learning will maximize the lower bound of MI between anchors and their positives, while we theoretically prove that MI involving negatives also matters when noises commonly exist. Guided by a more general lower bound form for optimization, we propose a contrastive learning strategy regulated by progressively refined cross-modal similarity, to more accurately optimize MI between an image/text anchor and its negative texts/images instead of improperly minimizing it. Our method performs competitively on four downstream cross-modal tasks and systematically balances the beneficial and harmful effects of (partial) false negative samples under theoretical guidance.

Processing（編程語言） · 統計量 · 估計/估計量 · 語言模型化 · INTERACT ·

2023 年 5 月 9 日

Vrunda Gadesha,Keyur D Joshi,Shefali Naik

from arxiv, ICT Analysis and Applications: Proceedings of ICT4SD 2022 pp 627-638

'Mahabharata' is the most popular among many Indian pieces of literature referred to in many domains for completely different purposes. This text itself is having various dimension and aspects which is useful for the human being in their personal life and professional life. This Indian Epic is originally written in the Sanskrit Language. Now in the era of Natural Language Processing, Artificial Intelligence, Machine Learning, and Human-Computer interaction this text can be processed according to the domain requirement. It is interesting to process this text and get useful insights from Mahabharata. The limitation of the humans while analyzing Mahabharata is that they always have a sentiment aspect towards the story narrated by the author. Apart from that, the human cannot memorize statistical or computational details, like which two words are frequently coming in one sentence? What is the average length of the sentences across the whole literature? Which word is the most popular word across the text, what are the lemmas of the words used across the sentences? Thus, in this paper, we propose an NLP pipeline to get some statistical and computational insights along with the most relevant word searching method from the largest epic 'Mahabharata'. We stacked the different text-processing approaches to articulate the best results which can be further used in the various domain where Mahabharata needs to be referred.

INFORMS · Processing（編程語言） · Learning · 可約的 · 隨機采樣 ·

2023 年 5 月 9 日

Medical Image Deidentification, Cleaning and Compression Using Pylogik

Adrienne Kline,Vinesh Appadurai,Yuan Luo,Sanjiv Shah

from arxiv, updates needed to manuscript

Leveraging medical record information in the era of big data and machine learning comes with the caveat that data must be cleaned and deidentified. Facilitating data sharing and harmonization for multi-center collaborations are particularly difficult when protected health information (PHI) is contained or embedded in image meta-data. We propose a novel library in the Python framework, called PyLogik, to help alleviate this issue for ultrasound images, which are particularly challenging because of the frequent inclusion of PHI directly on the images. PyLogik processes the image volumes through a series of text detection/extraction, filtering, thresholding, morphological and contour comparisons. This methodology deidentifies the images, reduces file sizes, and prepares image volumes for applications in deep learning and data sharing. To evaluate its effectiveness in the identification of regions of interest (ROI), a random sample of 50 cardiac ultrasounds (echocardiograms) were processed through PyLogik, and the outputs were compared with the manual segmentations by an expert user. The Dice coefficient of the two approaches achieved an average value of 0.976. Next, an investigation was conducted to ascertain the degree of information compression achieved using the algorithm. Resultant data was found to be on average approximately 72% smaller after processing by PyLogik. Our results suggest that PyLogik is a viable methodology for ultrasound data cleaning and deidentification, determining ROI, and file compression which will facilitate efficient storage, use, and dissemination of ultrasound data.

MoDELS · 線性的 · 線性模型 · 損失 · SOTA ·

2023 年 5 月 8 日

Mlinear: Rethink the Linear Model for Time-series Forecasting

Jianing Chen,Chuhao Chen,Xiangxu Meng

from arxiv, 8 pages,1 figure,4 tables

Recently, significant advancements have been made in time-series forecasting research, with an increasing focus on analyzing the inherent characteristics of time-series data, rather than solely focusing on designing forecasting models.In this paper, we follow this trend and carefully examine previous work to propose an efficient time series forecasting model based on linear models. The model consists of two important core components: (1) the integration of different semantics brought by single-channel and multi-channel data for joint forecasting; (2) the use of a novel loss function that replaces the traditional MSE loss and MAE loss to achieve higher forecasting accuracy.On widely-used benchmark time series datasets, our model not only outperforms the current SOTA, but is also 10 $\times$ speedup and has fewer parameters than the latest SOTA model.

state-of-the-art · Networking · Automator · 標注 · 假陰性 ·

2023 年 5 月 8 日

SILOP: An Automated Framework for Semantic Segmentation Using Image Labels Based on Object Perimeters

Erik Ostrowski,Bharath Srinivas Prabakaran,Muhammad Shafique

from arxiv, Accepted for Publication at the International Joint Conference on Neural Networks (IJCNN), July 2023, Gold Coast, Queensland, Australia

Achieving high-quality semantic segmentation predictions using only image-level labels enables a new level of real-world applicability. Although state-of-the-art networks deliver reliable predictions, the amount of handcrafted pixel-wise annotations to enable these results are not feasible in many real-world applications. Hence, several works have already targeted this bottleneck, using classifier-based networks like Class Activation Maps~\cite{CAM} (CAMs) as a base. Addressing CAM's weaknesses of fuzzy borders and incomplete predictions, state-of-the-art approaches rely only on adding regulations to the classifier loss or using pixel-similarity-based refinement after the fact. We propose a framework that introduces an additional module using object perimeters for improved saliency. We define object perimeter information as the line separating the object and background. Our new PerimeterFit module will be applied to pre-refine the CAM predictions before using the pixel-similarity-based network. In this way, our PerimeterFit increases the quality of the CAM prediction while simultaneously improving the false negative rate. We investigated a wide range of state-of-the-art unsupervised semantic segmentation networks and edge detection techniques to create useful perimeter maps, which enable our framework to predict object locations with sharper perimeters. We achieved up to 1.5% improvement over frameworks without our PerimeterFit module. We conduct an exhaustive analysis to illustrate that SILOP enhances existing state-of-the-art frameworks for image-level-based semantic segmentation. The framework is open-source and accessible online at //github.com/ErikOstrowski/SILOP.

INTERACT · Automator · ONCE · 可理解性 · UniFormer ·

2023 年 5 月 8 日

SmartState: A Protocol-Driven Human Interface

Samuel E. Armstrong,Aaron D. Mullen,V. K. Cody Bumgardner

from arxiv, 12 pages, 23 figures, submitted to AMIA 2023 Annual Symposium

Since the inception of human research studies, researchers often need to interact with participants on a set schedule to collect data. While some human research is automated, most is not; which costs researchers both time and money. Usually, user-provided data collection consists of surveys administered via telephone or email. While these methods are simplest, they are tedious for the survey administrators, which could incur fatigue and potentially lead to collection mistakes. A solution to this was the creation of "chatbots". Early developments relied on mostly rule-based tactics (e.g. ELIZA), which were suitable for uniform input. However, as the complexity of interactions increases, rule-based systems begin breaking down since there exist a variety of ways for a user to express the same intention. This is especially true when tracking states within a research study (or protocol). Recently, natural language processing (NLP) models and, subsequently, virtual assistants have become increasingly more sophisticated when communicating with users. Examples of these efforts range from research studies to commercial health products. This project leverages recent advancements in conversational artificial intelligence (AI), speech-to-text, natural language understanding (NLU), and finite-state machines to automate protocols, specifically in research settings. This application must be generalized, fully customizable, and irrespective of any research study. These parameters allow new research protocols to be created quickly once envisioned. With this in mind, I present SmartState, a fully-customizable, state-driven protocol manager combined with supporting AI components to autonomously manage user data and intelligently determine the intention of users through chat and end device interactions to drive protocols.

cancer · 數據集 · AIM · 查準率/準確率 · 分解的 ·

2023 年 5 月 5 日

Breast Cancer Immunohistochemical Image Generation: a Benchmark Dataset and Challenge Review

Chuang Zhu,Shengjie Liu,Feng Xu,Zekuan Yu,Arpit Aggarwal,Germán Corredor,Anant Madabhushi,Qixun Qu,Hongwei Fan,Fangda Li,Yueheng Li,Xianchao Guan,Yongbing Zhang,Vivek Kumar Singh,Farhan Akram,Md. Mostafa Kamal Sarker,Zhongyue Shi,Mulan Jin

from arxiv, 13 pages, 11 figures, 2tables

For invasive breast cancer, immunohistochemical (IHC) techniques are often used to detect the expression level of human epidermal growth factor receptor-2 (HER2) in breast tissue to formulate a precise treatment plan. From the perspective of saving manpower, material and time costs, directly generating IHC-stained images from hematoxylin and eosin (H&E) stained images is a valuable research direction. Therefore, we held the breast cancer immunohistochemical image generation challenge, aiming to explore novel ideas of deep learning technology in pathological image generation and promote research in this field. The challenge provided registered H&E and IHC-stained image pairs, and participants were required to use these images to train a model that can directly generate IHC-stained images from corresponding H&E-stained images. We selected and reviewed the five highest-ranking methods based on their PSNR and SSIM metrics, while also providing overviews of the corresponding pipelines and implementations. In this paper, we further analyze the current limitations in the field of breast cancer immunohistochemical image generation and forecast the future development of this field. We hope that the released dataset and the challenge will inspire more scholars to jointly study higher-quality IHC-stained image generation.

圖像字幕 · Extensibility · 圖 · 圖卷積神經網絡/圖卷積網絡 · Performer ·

2018 年 9 月 19 日

Exploring Visual Relationship for Image Captioning

Ting Yao,Yingwei Pan,Yehao Li,Tao Mei

from arxiv, ECCV 2018

It is always well believed that modeling relationships between objects would be helpful for representing and eventually describing an image. Nevertheless, there has not been evidence in support of the idea on image description generation. In this paper, we introduce a new design to explore the connections between objects for image captioning under the umbrella of attention-based encoder-decoder framework. Specifically, we present Graph Convolutional Networks plus Long Short-Term Memory (dubbed as GCN-LSTM) architecture that novelly integrates both semantic and spatial object relationships into image encoder. Technically, we build graphs over the detected objects in an image based on their spatial and semantic connections. The representations of each region proposed on objects are then refined by leveraging graph structure through GCN. With the learnt region-level features, our GCN-LSTM capitalizes on LSTM-based captioning framework with attention mechanism for sentence generation. Extensive experiments are conducted on COCO image captioning dataset, and superior results are reported when comparing to state-of-the-art approaches. More remarkably, GCN-LSTM increases CIDEr-D performance from 120.1% to 128.7% on COCO testing set.

圖像字幕 · 模型評估 · MoDELS · 學成 · 標注 ·

2018 年 5 月 13 日

Image Captioning

Vikram Mullachery,Vishal Motwani

from arxiv, arXiv admin note: text overlap with arXiv:1609.06647 by other authors

This paper discusses and demonstrates the outcomes from our experimentation on Image Captioning. Image captioning is a much more involved task than image recognition or classification, because of the additional challenge of recognizing the interdependence between the objects/concepts in the image and the creation of a succinct sentential narration. Experiments on several labeled datasets show the accuracy of the model and the fluency of the language it learns solely from image descriptions. As a toy application, we apply image captioning to create video captions, and we advance a few hypotheses on the challenges we encountered.