国产又色又爽又黄又免费软件_美女让男生痛桶白浆动态视频_欧美一区二区三区在观看_美女视频黄频大全视频网址_精品毛片一区二区_国产精品竹菊久久AV蜜桃免费_欧美性爱在线观看

Multi-modal image registration is a crucial pre-processing step in many medical applications. However, it is a challenging task due to the complex intensity relationships between different imaging modalities, which can result in large discrepancy in image appearance. The success of multi-modal image registration, whether it is conventional or learning based, is predicated upon the choice of an appropriate distance (or similarity) measure. Particularly, deep learning registration algorithms lack in accuracy or even fail completely when attempting to register data from an "unseen" modality. In this work, we present Modality Agnostic Distance (MAD), a deep image distance}] measure that utilises random convolutions to learn the inherent geometry of the images while being robust to large appearance changes. Random convolutions are geometry-preserving modules which we use to simulate an infinite number of synthetic modalities alleviating the need for aligned paired data during training. We can therefore train MAD on a mono-modal dataset and successfully apply it to a multi-modal dataset. We demonstrate that not only can MAD affinely register multi-modal images successfully, but it has also a larger capture range than traditional measures such as Mutual Information and Normalised Gradient Fields.

相關內容

圖(tu)像配準

關注 810

圖(tu)(tu)(tu)像(xiang)(xiang)配準是圖(tu)(tu)(tu)像(xiang)(xiang)處(chu)理研究領域中的(de)(de)(de)(de)一(yi)(yi)個(ge)典型問(wen)(wen)題(ti)(ti)和技術難點，其目(mu)的(de)(de)(de)(de)在于比(bi)較或(huo)融(rong)合針對同一(yi)(yi)對象在不(bu)(bu)同條件下獲取的(de)(de)(de)(de)圖(tu)(tu)(tu)像(xiang)(xiang)，例如圖(tu)(tu)(tu)像(xiang)(xiang)會來自不(bu)(bu)同的(de)(de)(de)(de)采集設(she)備(bei)，取自不(bu)(bu)同的(de)(de)(de)(de)時(shi)間(jian)，不(bu)(bu)同的(de)(de)(de)(de)拍攝(she)視(shi)(shi)角等等，有時(shi)也需要用到針對不(bu)(bu)同對象的(de)(de)(de)(de)圖(tu)(tu)(tu)像(xiang)(xiang)配準問(wen)(wen)題(ti)(ti)。具體(ti)(ti)地(di)說，對于一(yi)(yi)組(zu)圖(tu)(tu)(tu)像(xiang)(xiang)數(shu)據集中的(de)(de)(de)(de)兩(liang)幅(fu)(fu)圖(tu)(tu)(tu)像(xiang)(xiang)，通(tong)過尋(xun)找(zhao)一(yi)(yi)種空間(jian)變(bian)換把一(yi)(yi)幅(fu)(fu)圖(tu)(tu)(tu)像(xiang)(xiang)映射到另一(yi)(yi)幅(fu)(fu)圖(tu)(tu)(tu)像(xiang)(xiang)，使(shi)得兩(liang)圖(tu)(tu)(tu)中對應(ying)于空間(jian)同一(yi)(yi)位置的(de)(de)(de)(de)點一(yi)(yi)一(yi)(yi)對應(ying)起來，從而達到信息融(rong)合的(de)(de)(de)(de)目(mu)的(de)(de)(de)(de)。該技術在計(ji)算機視(shi)(shi)覺、醫學(xue)(xue)圖(tu)(tu)(tu)像(xiang)(xiang)處(chu)理以及材料力(li)學(xue)(xue)等領域都具有廣泛的(de)(de)(de)(de)應(ying)用。根據具體(ti)(ti)應(ying)用的(de)(de)(de)(de)不(bu)(bu)同，有的(de)(de)(de)(de)側重(zhong)于通(tong)過變(bian)換結果融(rong)合兩(liang)幅(fu)(fu)圖(tu)(tu)(tu)像(xiang)(xiang)，有的(de)(de)(de)(de)側重(zhong)于研究變(bian)換本身以獲得對象的(de)(de)(de)(de)一(yi)(yi)些力(li)學(xue)(xue)屬性。

Learning · Extensibility · state-of-the-art · 哈爾濱工業大學（HIT） · 可約的 ·

2023 年 10 月 23 日

SeLeP: Learning Based Semantic Prefetching for Exploratory Database Workloads

Farzaneh Zirak,Farhana Choudhury,Renata Borovica-Gajic

Prefetching is a crucial technique employed in traditional databases to enhance interactivity, particularly in the context of data exploitation. Data exploration is a query processing paradigm in which users search for insights buried in the data, often not knowing what exactly they are looking for. Data exploratory tools deal with multiple challenges such as the need for interactivity with no a priori knowledge being present to help with the system tuning. The state-of-the-art prefetchers are specifically designed for navigational workloads only, where the number of possible actions is limited. The prefetchers that work with SQL-based workloads, on the other hand, mainly rely on data logical addresses rather than the data semantics. They fail to predict complex access patterns in cases where the database size is substantial, resulting in an extensive address space, or when there is frequent co-accessing of data. In this paper, we propose SeLeP, a semantic prefetcher that makes prefetching decisions for both types of workloads, based on the encoding of the data values contained inside the accessed blocks. Following the popular path of using machine learning approaches to automatically learn the hidden patterns, we formulate the prefetching task as a time-series forecasting problem and use an encoder-decoder LSTM architecture to learn the data access pattern. Our extensive experiments, across real-life exploratory workloads, demonstrate that SeLeP improves the hit ratio up to 40% and reduces I/O time up to 45% compared to the state-of-the-art, attaining impressive 95% hit ratio and 80% I/O reduction on average.

語言模型化 · MoDELS · state-of-the-art · Performer · HTTPS ·

2023 年 10 月 20 日

ReLM: Leveraging Language Models for Enhanced Chemical Reaction Prediction

Yaorui Shi,An Zhang,Enzhi Zhang,Zhiyuan Liu,Xiang Wang

Predicting chemical reactions, a fundamental challenge in chemistry, involves forecasting the resulting products from a given reaction process. Conventional techniques, notably those employing Graph Neural Networks (GNNs), are often limited by insufficient training data and their inability to utilize textual information, undermining their applicability in real-world applications. In this work, we propose ReLM, a novel framework that leverages the chemical knowledge encoded in language models (LMs) to assist GNNs, thereby enhancing the accuracy of real-world chemical reaction predictions. To further enhance the model's robustness and interpretability, we incorporate the confidence score strategy, enabling the LMs to self-assess the reliability of their predictions. Our experimental results demonstrate that ReLM improves the performance of state-of-the-art GNN-based methods across various chemical reaction datasets, especially in out-of-distribution settings. Codes are available at //github.com/syr-cn/ReLM.

估計/估計量 · 穩健性 · 稀疏 · Performer · MoDELS ·

2023 年 10 月 19 日

RGM: A Robust Generalist Matching Model

Songyan Zhang,Xinyu Sun,Hao Chen,Bo Li,Chunhua Shen

from arxiv, 17 pages. Code is available at: //github.com/aim-uofa/RGM

Finding corresponding pixels within a pair of images is a fundamental computer vision task with various applications. Due to the specific requirements of different tasks like optical flow estimation and local feature matching, previous works are primarily categorized into dense matching and sparse feature matching focusing on specialized architectures along with task-specific datasets, which may somewhat hinder the generalization performance of specialized models. In this paper, we propose a deep model for sparse and dense matching, termed RGM (Robust Generalist Matching). In particular, we elaborately design a cascaded GRU module for refinement by exploring the geometric similarity iteratively at multiple scales following an additional uncertainty estimation module for sparsification. To narrow the gap between synthetic training samples and real-world scenarios, we build a new, large-scale dataset with sparse correspondence ground truth by generating optical flow supervision with greater intervals. As such, we are able to mix up various dense and sparse matching datasets, significantly improving the training diversity. The generalization capacity of our proposed RGM is greatly improved by learning the matching and uncertainty estimation in a two-stage manner on the large, mixed data. Superior performance is achieved for zero-shot matching and downstream geometry estimation across multiple datasets, outperforming the previous methods by a large margin.

Pair · 數據集 · 得分 · Seven · 圖像配準 ·

2023 年 10 月 19 日

TRUSTED: The Paired 3D Transabdominal Ultrasound and CT Human Data for Kidney Segmentation and Registration Research

William Ndzimbong,Cyril Fourniol,Loic Themyr,Nicolas Thome,Yvonne Keeza,Beniot Sauer,Pierre-Thierry Piechaud,Arnaud Mejean,Jacques Marescaux,Daniel George,Didier Mutter,Alexandre Hostettler,Toby Collins

from arxiv, Alexandre Hostettler, and Toby Collins share last authorship

Inter-modal image registration (IMIR) and image segmentation with abdominal Ultrasound (US) data has many important clinical applications, including image-guided surgery, automatic organ measurement and robotic navigation. However, research is severely limited by the lack of public datasets. We propose TRUSTED (the Tridimensional Renal Ultra Sound TomodEnsitometrie Dataset), comprising paired transabdominal 3DUS and CT kidney images from 48 human patients (96 kidneys), including segmentation, and anatomical landmark annotations by two experienced radiographers. Inter-rater segmentation agreement was over 94 (Dice score), and gold-standard segmentations were generated using the STAPLE algorithm. Seven anatomical landmarks were annotated, important for IMIR systems development and evaluation. To validate the dataset's utility, 5 competitive Deep Learning models for automatic kidney segmentation were benchmarked, yielding average DICE scores from 83.2% to 89.1% for CT, and 61.9% to 79.4% for US images. Three IMIR methods were benchmarked, and Coherent Point Drift performed best with an average Target Registration Error of 4.53mm. The TRUSTED dataset may be used freely researchers to develop and validate new segmentation and IMIR methods.

garment modeling · 設計 · Automator · MoDELS · Principle ·

2023 年 10 月 19 日

GarmentCode: Programming Parametric Sewing Patterns

Maria Korosteleva,Olga Sorkine-Hornung

from arxiv, Presented at SIGGRAPH Asia 2023

Garment modeling is an essential task of the global apparel industry and a core part of digital human modeling. Realistic representation of garments with valid sewing patterns is key to their accurate digital simulation and eventual fabrication. However, little-to-no computational tools provide support for bridging the gap between high-level construction goals and low-level editing of pattern geometry, e.g., combining or switching garment elements, semantic editing, or design exploration that maintains the validity of a sewing pattern. We suggest the first DSL for garment modeling -- GarmentCode -- that applies principles of object-oriented programming to garment construction and allows designing sewing patterns in a hierarchical, component-oriented manner. The programming-based paradigm naturally provides unique advantages of component abstraction, algorithmic manipulation, and free-form design parametrization. We additionally support the construction process by automating typical low-level tasks like placing a dart at a desired location. In our prototype garment configurator, users can manipulate meaningful design parameters and body measurements, while the construction of pattern geometry is handled by garment programs implemented with GarmentCode. Our configurator enables the free exploration of rich design spaces and the creation of garments using interchangeable, parameterized components. We showcase our approach by producing a variety of garment designs and retargeting them to different body shapes using our configurator. Project page: //igl.ethz.ch/projects/garmentcode/

Extensibility · 學成 · SSL · 目標檢測 · 3D ·

2020 年 3 月 20 日

FocalMix: Semi-Supervised Learning for 3D Medical Image Detection

Dong Wang,Yuan Zhang,Kexin Zhang,Liwei Wang

from arxiv, Accepted by CVPR 2020

Applying artificial intelligence techniques in medical imaging is one of the most promising areas in medicine. However, most of the recent success in this area highly relies on large amounts of carefully annotated data, whereas annotating medical images is a costly process. In this paper, we propose a novel method, called FocalMix, which, to the best of our knowledge, is the first to leverage recent advances in semi-supervised learning (SSL) for 3D medical image detection. We conducted extensive experiments on two widely used datasets for lung nodule detection, LUNA16 and NLST. Results show that our proposed SSL methods can achieve a substantial improvement of up to 17.3% over state-of-the-art supervised learning approaches with 400 unlabeled CT scans.

圖像分割 · MoDELS · 深度學習 · Vision · 學成 ·

2020 年 1 月 15 日

Image Segmentation Using Deep Learning: A Survey

Shervin Minaee,Yuri Boykov,Fatih Porikli,Antonio Plaza,Nasser Kehtarnavaz,Demetri Terzopoulos

Image segmentation is a key topic in image processing and computer vision with applications such as scene understanding, medical image analysis, robotic perception, video surveillance, augmented reality, and image compression, among many others. Various algorithms for image segmentation have been developed in the literature. Recently, due to the success of deep learning models in a wide range of vision applications, there has been a substantial amount of works aimed at developing image segmentation approaches using deep learning models. In this survey, we provide a comprehensive review of the literature at the time of this writing, covering a broad spectrum of pioneering works for semantic and instance-level segmentation, including fully convolutional pixel-labeling networks, encoder-decoder architectures, multi-scale and pyramid based approaches, recurrent networks, visual attention models, and generative models in adversarial settings. We investigate the similarity, strengths and challenges of these deep learning models, examine the most widely used datasets, report performances, and discuss promising future research directions in this area.

蒸餾 · BERT · 語言模型化 · Performer · 可理解性 ·

2019 年 9 月 23 日

TinyBERT: Distilling BERT for Natural Language Understanding

Xiaoqi Jiao,Yichun Yin,Lifeng Shang,Xin Jiang,Xiao Chen,Linlin Li,Fang Wang,Qun Liu

from arxiv, 13 pages, 2 figures, 9 tables

Language model pre-training, such as BERT, has significantly improved the performances of many natural language processing tasks. However, pre-trained language models are usually computationally expensive and memory intensive, so it is difficult to effectively execute them on some resource-restricted devices. To accelerate inference and reduce model size while maintaining accuracy, we firstly propose a novel transformer distillation method that is a specially designed knowledge distillation (KD) method for transformer-based models. By leveraging this new KD method, the plenty of knowledge encoded in a large teacher BERT can be well transferred to a small student TinyBERT. Moreover, we introduce a new two-stage learning framework for TinyBERT, which performs transformer distillation at both the pre-training and task-specific learning stages. This framework ensures that TinyBERT can capture both the general-domain and task-specific knowledge of the teacher BERT. TinyBERT is empirically effective and achieves comparable results with BERT in GLUE datasets, while being 7.5x smaller and 9.4x faster on inference. TinyBERT is also significantly better than state-of-the-art baselines, even with only about 28% parameters and 31% inference time of baselines.

state-of-the-art · 可理解性 · BERT · 去噪自編碼器 · Performer ·

2019 年 6 月 19 日

XLNet: Generalized Autoregressive Pretraining for Language Understanding

Zhilin Yang,Zihang Dai,Yiming Yang,Jaime Carbonell,Ruslan Salakhutdinov,Quoc V. Le

from arxiv, Pretrained models and code are available at //github.com/zihangdai/xlnet

With the capability of modeling bidirectional contexts, denoising autoencoding based pretraining like BERT achieves better performance than pretraining approaches based on autoregressive language modeling. However, relying on corrupting the input with masks, BERT neglects dependency between the masked positions and suffers from a pretrain-finetune discrepancy. In light of these pros and cons, we propose XLNet, a generalized autoregressive pretraining method that (1) enables learning bidirectional contexts by maximizing the expected likelihood over all permutations of the factorization order and (2) overcomes the limitations of BERT thanks to its autoregressive formulation. Furthermore, XLNet integrates ideas from Transformer-XL, the state-of-the-art autoregressive model, into pretraining. Empirically, XLNet outperforms BERT on 20 tasks, often by a large margin, and achieves state-of-the-art results on 18 tasks including question answering, natural language inference, sentiment analysis, and document ranking.

Performer · 判別器 · 正例 · 假陽性 · 監督 ·

2018 年 5 月 24 日

DSGAN: Generative Adversarial Training for Distant Supervision Relation Extraction

Pengda Qin,Weiran Xu,William Yang Wang

Distant supervision can effectively label data for relation extraction, but suffers from the noise labeling problem. Recent works mainly perform soft bag-level noise reduction strategies to find the relatively better samples in a sentence bag, which is suboptimal compared with making a hard decision of false positive samples in sentence level. In this paper, we introduce an adversarial learning framework, which we named DSGAN, to learn a sentence-level true-positive generator. Inspired by Generative Adversarial Networks, we regard the positive samples generated by the generator as the negative samples to train the discriminator. The optimal generator is obtained until the discrimination ability of the discriminator has the greatest decline. We adopt the generator to filter distant supervision training dataset and redistribute the false positive instances into the negative set, in which way to provide a cleaned dataset for relation classification. The experimental results show that the proposed strategy significantly improves the performance of distant supervision relation extraction comparing to state-of-the-art systems.