国产高清一区二区在线影院_午夜福利视频欧日韩一区二区_国产在线观看大量精品福利_亚洲日韩在线视频_尤物视频在线免费观看_国产精品18禁精品无遮掩_国产女人一女人精品视频

This paper reports on a new real-time robot-centered 3D-2D vascular image alignment algorithm, which is robust to outliers and can align nonrigid shapes. Few works have managed to achieve both real-time and accurate performance for vascular intervention robots. This work bridges high-accuracy 3D-2D registration techniques and computational efficiency requirements in intervention robot applications. We categorize centerline-based vascular 3D-2D image registration problems as an iterative Perspective-n-Point (PnP) problem and propose to use the Levenberg-Marquardt solver on the Lie manifold. Then, the recently developed Reproducing Kernel Hilbert Space (RKHS) algorithm is introduced to overcome the ``big-to-small'' problem in typical robotic scenarios. Finally, an iterative reweighted least squares is applied to solve RKHS-based formulation efficiently. Experiments indicate that the proposed algorithm processes registration over 50 Hz (rigid) and 20 Hz (nonrigid) and obtains competing registration accuracy similar to other works. Results indicate that our Iterative PnP is suitable for future vascular intervention robot applications.

相關內容

圖像配準

關注 810

圖(tu)(tu)像(xiang)配準(zhun)(zhun)是圖(tu)(tu)像(xiang)處理(li)研(yan)究領域中的(de)(de)(de)一(yi)(yi)個典(dian)型問(wen)(wen)題和技術難點，其(qi)目的(de)(de)(de)在(zai)(zai)于(yu)比較或(huo)融(rong)(rong)合針對(dui)(dui)(dui)同(tong)一(yi)(yi)對(dui)(dui)(dui)象在(zai)(zai)不(bu)(bu)同(tong)條件(jian)下(xia)獲(huo)取的(de)(de)(de)圖(tu)(tu)像(xiang)，例如圖(tu)(tu)像(xiang)會來(lai)自不(bu)(bu)同(tong)的(de)(de)(de)采集設備，取自不(bu)(bu)同(tong)的(de)(de)(de)時間(jian)(jian)，不(bu)(bu)同(tong)的(de)(de)(de)拍攝視角等(deng)等(deng)，有(you)時也需(xu)要用(yong)(yong)到(dao)針對(dui)(dui)(dui)不(bu)(bu)同(tong)對(dui)(dui)(dui)象的(de)(de)(de)圖(tu)(tu)像(xiang)配準(zhun)(zhun)問(wen)(wen)題。具(ju)(ju)體地(di)說，對(dui)(dui)(dui)于(yu)一(yi)(yi)組圖(tu)(tu)像(xiang)數據集中的(de)(de)(de)兩(liang)幅(fu)圖(tu)(tu)像(xiang)，通過尋找一(yi)(yi)種(zhong)空間(jian)(jian)變換(huan)把一(yi)(yi)幅(fu)圖(tu)(tu)像(xiang)映射到(dao)另一(yi)(yi)幅(fu)圖(tu)(tu)像(xiang)，使得兩(liang)圖(tu)(tu)中對(dui)(dui)(dui)應(ying)于(yu)空間(jian)(jian)同(tong)一(yi)(yi)位(wei)置的(de)(de)(de)點一(yi)(yi)一(yi)(yi)對(dui)(dui)(dui)應(ying)起來(lai)，從而達到(dao)信息(xi)融(rong)(rong)合的(de)(de)(de)目的(de)(de)(de)。該技術在(zai)(zai)計算機視覺、醫學圖(tu)(tu)像(xiang)處理(li)以(yi)及材料(liao)力學等(deng)領域都具(ju)(ju)有(you)廣(guang)泛的(de)(de)(de)應(ying)用(yong)(yong)。根據具(ju)(ju)體應(ying)用(yong)(yong)的(de)(de)(de)不(bu)(bu)同(tong)，有(you)的(de)(de)(de)側重(zhong)于(yu)通過變換(huan)結果融(rong)(rong)合兩(liang)幅(fu)圖(tu)(tu)像(xiang)，有(you)的(de)(de)(de)側重(zhong)于(yu)研(yan)究變換(huan)本身以(yi)獲(huo)得對(dui)(dui)(dui)象的(de)(de)(de)一(yi)(yi)些力學屬性(xing)。

分離的 · 統計量 · FAST · Subspace · 共軛梯度 ·

2023 年 12 月 5 日

Fast solution of Sylvester-structured systems for spatial source separation of the Cosmic Microwave Background

Kirk M. Soodhalter,Simon Wilson,Dung Pham

from arxiv, resubmitted for publication/title changes from v2

Implementation of many statistical methods for large, multivariate data sets requires one to solve a linear system that, depending on the method, is of the dimension of the number of observations or each individual data vector. This is often the limiting factor in scaling the method with data size and complexity. In this paper we illustrate the use of Krylov subspace methods to address this issue in a statistical solution to a source separation problem in cosmology where the data size is prohibitively large for direct solution of the required system. Two distinct approaches, adapted from techniques in the literature, are described: one that uses the method of conjugate gradients directly to the Kronecker-structured problem and another that reformulates the system as a Sylvester matrix equation. We show that both approaches produce an accurate solution within an acceptable computation time and with practical memory requirements for the data size that is currently available.

輸出 · MoDELS · Networking · Neural Networks · 模型評估 ·

2023 年 12 月 5 日

A Bayesian neural network approach to Multi-fidelity surrogate modelling

Baptiste Kerleguer,Claire Cannamela,Josselin Garnier

This paper deals with surrogate modelling of a computer code output in a hierarchical multi-fidelity context, i.e., when the output can be evaluated at different levels of accuracy and computational cost. Using observations of the output at low- and high-fidelity levels, we propose a method that combines Gaussian process (GP) regression and Bayesian neural network (BNN), in a method called GPBNN. The low-fidelity output is treated as a single-fidelity code using classical GP regression. The high-fidelity output is approximated by a BNN that incorporates, in addition to the high-fidelity observations, well-chosen realisations of the low-fidelity output emulator. The predictive uncertainty of the final surrogate model is then quantified by a complete characterisation of the uncertainties of the different models and their interaction. GPBNN is compared with most of the multi-fidelity regression methods allowing to quantify the prediction uncertainty.

泛化理論 · 可辨認的 · Analysis · Performer · 分解的 ·

2023 年 12 月 4 日

ContriMix: Unsupervised disentanglement of content and attribute for domain generalization in microscopy image analysis

Tan H. Nguyen,Dinkar Juyal,Jin Li,Aaditya Prakash,Shima Nofallah,Chintan Shah,Sai Chowdary Gullapally,Limin Yu,Michael Griffin,Anand Sampat,John Abel,Justin Lee,Amaro Taylor-Weiner

Domain generalization is critical for real-world applications of machine learning to microscopy images, including histopathology and fluorescence imaging. Artifacts in these modalities arise through a complex combination of factors relating to tissue collection and laboratory processing, as well as factors intrinsic to patient samples. In fluorescence imaging, these artifacts stem from variations across experimental batches. The complexity and subtlety of these artifacts make the enumeration of data domains intractable. Therefore, augmentation-based methods of domain generalization that require domain identifiers and manual fine-tuning are inadequate in this setting. To overcome this challenge, we introduce ContriMix, a domain generalization technique that learns to generate synthetic images by disentangling and permuting the biological content ("content") and technical variations ("attributes") in microscopy images. ContriMix does not rely on domain identifiers or handcrafted augmentations and makes no assumptions about the input characteristics of images. We assess the performance of ContriMix on two pathology datasets dealing with patch classification and Whole Slide Image label prediction tasks respectively (Camelyon17-WILDS and RCC subtyping), and one fluorescence microscopy dataset (RxRx1-WILDS). Without any access to domain identifiers at train or test time, ContriMix performs similar or better than current state-of-the-art methods in all these datasets, motivating its usage for microscopy image analysis in real-world settings where domain information is hard to come by. The code for ContriMix can be found at //gitlab.com/huutan86/contrimix

圖像分割 · 變換 · INFORMS · CNN · 塊 ·

2023 年 12 月 4 日

MobileUtr: Revisiting the relationship between light-weight CNN and Transformer for efficient medical image segmentation

Fenghe Tang,Bingkun Nian,Jianrui Ding,Quan Quan,Jie Yang,Wei Liu,S. Kevin Zhou

from arxiv, 13 pages

Due to the scarcity and specific imaging characteristics in medical images, light-weighting Vision Transformers (ViTs) for efficient medical image segmentation is a significant challenge, and current studies have not yet paid attention to this issue. This work revisits the relationship between CNNs and Transformers in lightweight universal networks for medical image segmentation, aiming to integrate the advantages of both worlds at the infrastructure design level. In order to leverage the inductive bias inherent in CNNs, we abstract a Transformer-like lightweight CNNs block (ConvUtr) as the patch embeddings of ViTs, feeding Transformer with denoised, non-redundant and highly condensed semantic information. Moreover, an adaptive Local-Global-Local (LGL) block is introduced to facilitate efficient local-to-global information flow exchange, maximizing Transformer's global context information extraction capabilities. Finally, we build an efficient medical image segmentation model (MobileUtr) based on CNN and Transformer. Extensive experiments on five public medical image datasets with three different modalities demonstrate the superiority of MobileUtr over the state-of-the-art methods, while boasting lighter weights and lower computational cost. Code is available at //github.com/FengheTan9/MobileUtr.

MoDELS · 推斷 · Facebook AI Research · Seven · 生成模型 ·

2023 年 12 月 3 日

ImagenHub: Standardizing the evaluation of conditional image generation models

Max Ku,Tianle Li,Kai Zhang,Yujie Lu,Xingyu Fu,Wenwen Zhuang,Wenhu Chen

Recently, a myriad of conditional image generation and editing models have been developed to serve different downstream tasks, including text-to-image generation, text-guided image editing, subject-driven image generation, control-guided image generation, etc. However, we observe huge inconsistencies in experimental conditions: datasets, inference, and evaluation metrics - render fair comparisons difficult. This paper proposes ImagenHub, which is a one-stop library to standardize the inference and evaluation of all the conditional image generation models. Firstly, we define seven prominent tasks and curate high-quality evaluation datasets for them. Secondly, we built a unified inference pipeline to ensure fair comparison. Thirdly, we design two human evaluation scores, i.e. Semantic Consistency and Perceptual Quality, along with comprehensive guidelines to evaluate generated images. We train expert raters to evaluate the model outputs based on the proposed metrics. Our human evaluation achieves a high inter-worker agreement of Krippendorff's alpha on 76% models with a value higher than 0.4. We comprehensively evaluated a total of around 30 models and observed three key takeaways: (1) the existing models' performance is generally unsatisfying except for Text-guided Image Generation and Subject-driven Image Generation, with 74% models achieving an overall score lower than 0.5. (2) we examined the claims from published papers and found 83% of them hold with a few exceptions. (3) None of the existing automatic metrics has a Spearman's correlation higher than 0.2 except subject-driven image generation. Moving forward, we will continue our efforts to evaluate newly published models and update our leaderboard to keep track of the progress in conditional image generation.

MoDELS · GANs · 生成模型 · 回合 · 潛在 ·

2023 年 12 月 1 日

Generative models for visualising abstract social processes: Guiding streetview image synthesis of StyleGAN2 with indices of deprivation

Aleksi Knuutila

from arxiv, 10 pages, 3 figures, 1 table, associated website with interactive interface at //site.knuutila.net/thisinequalitydoesnotexist

This paper presents a novel application of Generative Adverserial Networks (GANs) to study visual aspects of social processes. I train a a StyleGAN2-model on a custom dataset of 14,564 images of London, sourced from Google Streetview taken in London. After training, I invert the images in the training set, finding points in the model's latent space that correspond to them, and compare results from three inversion techniques. I connect each data point with metadata from the Indices of Multiple Deprivation, describing income, health and environmental quality in the area where the photographs were taken. It is then possible to map which parts of the model's latent space encode visual features that are distinctive for health, income and environmental quality, and condition the synthesis of new images based on these factors. The synthetic images created reflect visual features of social processes that were previously unknown and difficult to study, describing recurring visual differences between deprived and privileged areas in London. GANs are known for their capability to produce a continuous range of images that exhibit visual differences. The paper tests how to exploit this ability through visual comparisons in still images as well as through an interactive website where users can guide image synthesis with sliders. Though conditioned synthesis has its limitations and the results are difficult to validate, the paper points to the potential for generative models to be repurposed to be parts of social scientific methods.

Prompt · Processing（編程語言） · AI · 設計 · INTERACT ·

2023 年 11 月 30 日

The role of interface design on prompt-mediated creativity in Generative AI

Maddalena Torricelli,Mauro Martino,Andrea Baronchelli,Luca Maria Aiello

from arxiv, 5 pages, 8 figures

Generative AI models for the creation of images is becoming a staple in the toolkit of digital artists and visual designers. The interaction with these systems is mediated by prompting, a process in which users write a short text to describe the desired image's content and style. The study of prompts offers an unprecedented opportunity to gain insight into the process of human creativity, yet our understanding of how people use them remains limited. We analyze more than 145,000 prompts from the logs of two Generative AI platforms (Stable Diffusion and Pick-a-Pic) to shed light on how people explore new concepts over time, and how their exploration might be influenced by different design choices in human-computer interfaces to Generative AI. We find that users exhibit a tendency towards exploration of new topics over exploitation of concepts visited previously. However, a comparative analysis of the two platforms, which differ both in scope and functionalities, reveals that the introduction of features diverting user focus from prompting and providing instead shortcuts for generating new image variants with simple clicks is associated with a considerable reduction in both exploration of novel concepts and detail in the submitted prompts. These results carry direct implications for the design of human interfaces to Generative AI and raise new questions regarding how the process of prompting should be aided in ways that best support creativity.

圖片分類 · 前饋網絡 · INTERACT · Networking · 前饋 ·

2021 年 5 月 7 日

ResMLP: Feedforward networks for image classification with data-efficient training

Hugo Touvron,Piotr Bojanowski,Mathilde Caron,Matthieu Cord,Alaaeldin El-Nouby,Edouard Grave,Armand Joulin,Gabriel Synnaeve,Jakob Verbeek,Hervé Jégou

We present ResMLP, an architecture built entirely upon multi-layer perceptrons for image classification. It is a simple residual network that alternates (i) a linear layer in which image patches interact, independently and identically across channels, and (ii) a two-layer feed-forward network in which channels interact independently per patch. When trained with a modern training strategy using heavy data-augmentation and optionally distillation, it attains surprisingly good accuracy/complexity trade-offs on ImageNet. We will share our code based on the Timm library and pre-trained models.

contrastive · 學成 · 對比學習 · Extensibility · SSL ·

2020 年 6 月 18 日

Contrastive learning of global and local features for medical image segmentation with limited annotations

Krishna Chaitanya,Ertunc Erdil,Neerav Karani,Ender Konukoglu

from arxiv, 16 pages, 2 figures, 7 tables. This article is a pre-print and is currently under review at a conference

A key requirement for the success of supervised deep learning is a large labeled dataset - a condition that is difficult to meet in medical image analysis. Self-supervised learning (SSL) can help in this regard by providing a strategy to pre-train a neural network with unlabeled data, followed by fine-tuning for a downstream task with limited annotations. Contrastive learning, a particular variant of SSL, is a powerful technique for learning image-level representations. In this work, we propose strategies for extending the contrastive learning framework for segmentation of volumetric medical images in the semi-supervised setting with limited annotations, by leveraging domain-specific and problem-specific cues. Specifically, we propose (1) novel contrasting strategies that leverage structural similarity across volumetric medical images (domain-specific cue) and (2) a local version of the contrastive loss to learn distinctive representations of local regions that are useful for per-pixel segmentation (problem-specific cue). We carry out an extensive evaluation on three Magnetic Resonance Imaging (MRI) datasets. In the limited annotation setting, the proposed method yields substantial improvements compared to other self-supervision and semi-supervised learning techniques. When combined with a simple data augmentation technique, the proposed method reaches within 8% of benchmark performance using only two labeled MRI volumes for training, corresponding to only 4% (for ACDC) of the training data used to train the benchmark.

學成 · Performer · 深度學習 · Processing（編程語言） · 圖像處理 ·

2018 年 7 月 31 日

Deep learning in agriculture: A survey

Andreas Kamilaris,Francesc X. Prenafeta-Boldu

Deep learning constitutes a recent, modern technique for image processing and data analysis, with promising results and large potential. As deep learning has been successfully applied in various domains, it has recently entered also the domain of agriculture. In this paper, we perform a survey of 40 research efforts that employ deep learning techniques, applied to various agricultural and food production challenges. We examine the particular agricultural problems under study, the specific models and frameworks employed, the sources, nature and pre-processing of data used, and the overall performance achieved according to the metrics used at each work under study. Moreover, we study comparisons of deep learning with other existing popular techniques, in respect to differences in classification or regression performance. Our findings indicate that deep learning provides high accuracy, outperforming existing commonly used image processing techniques.