云南虫谷在线观看免费观看电视剧_日韩人妻少妇无码第一页_免费的青榴视频在线观看_MM1313亚洲国产精品美女_日韩高清无码一区二区三区_亚洲国产成人BT天堂_久久99精品久久久久久中文字幕的

Deformable image registration is a fundamental task in medical image analysis and plays a crucial role in a wide range of clinical applications. Recently, deep learning-based approaches have been widely studied for deformable medical image registration and achieved promising results. However, existing deep learning image registration techniques do not theoretically guarantee topology-preserving transformations. This is a key property to preserve anatomical structures and achieve plausible transformations that can be used in real clinical settings. We propose a novel framework for deformable image registration. Firstly, we introduce a novel regulariser based on conformal-invariant properties in a nonlinear elasticity setting. Our regulariser enforces the deformation field to be smooth, invertible and orientation-preserving. More importantly, we strictly guarantee topology preservation yielding to a clinical meaningful registration. Secondly, we boost the performance of our regulariser through coordinate MLPs, where one can view the to-be-registered images as continuously differentiable entities. We demonstrate, through numerical and visual experiments, that our framework is able to outperform current techniques for image registration.

相關內容

圖像配準

關注 810

圖(tu)(tu)像(xiang)(xiang)(xiang)(xiang)(xiang)(xiang)配準(zhun)是圖(tu)(tu)像(xiang)(xiang)(xiang)(xiang)(xiang)(xiang)處(chu)理研究領域(yu)中(zhong)的(de)(de)(de)一(yi)個(ge)典型問(wen)題(ti)和(he)技術難(nan)點，其目的(de)(de)(de)在于比較(jiao)或融(rong)(rong)合針對(dui)(dui)同(tong)一(yi)對(dui)(dui)象(xiang)(xiang)在不同(tong)條件下獲取(qu)的(de)(de)(de)圖(tu)(tu)像(xiang)(xiang)(xiang)(xiang)(xiang)(xiang)，例如圖(tu)(tu)像(xiang)(xiang)(xiang)(xiang)(xiang)(xiang)會(hui)來自不同(tong)的(de)(de)(de)采集(ji)設備(bei)，取(qu)自不同(tong)的(de)(de)(de)時(shi)間，不同(tong)的(de)(de)(de)拍攝視角等等，有(you)時(shi)也需要用(yong)到(dao)針對(dui)(dui)不同(tong)對(dui)(dui)象(xiang)(xiang)的(de)(de)(de)圖(tu)(tu)像(xiang)(xiang)(xiang)(xiang)(xiang)(xiang)配準(zhun)問(wen)題(ti)。具(ju)體地說，對(dui)(dui)于一(yi)組(zu)圖(tu)(tu)像(xiang)(xiang)(xiang)(xiang)(xiang)(xiang)數(shu)據集(ji)中(zhong)的(de)(de)(de)兩(liang)幅圖(tu)(tu)像(xiang)(xiang)(xiang)(xiang)(xiang)(xiang)，通過(guo)尋找(zhao)一(yi)種空(kong)間變(bian)換把一(yi)幅圖(tu)(tu)像(xiang)(xiang)(xiang)(xiang)(xiang)(xiang)映射到(dao)另一(yi)幅圖(tu)(tu)像(xiang)(xiang)(xiang)(xiang)(xiang)(xiang)，使得(de)兩(liang)圖(tu)(tu)中(zhong)對(dui)(dui)應于空(kong)間同(tong)一(yi)位(wei)置的(de)(de)(de)點一(yi)一(yi)對(dui)(dui)應起來，從而達(da)到(dao)信息融(rong)(rong)合的(de)(de)(de)目的(de)(de)(de)。該技術在計算機(ji)視覺、醫學(xue)圖(tu)(tu)像(xiang)(xiang)(xiang)(xiang)(xiang)(xiang)處(chu)理以(yi)及材料力(li)(li)學(xue)等領域(yu)都具(ju)有(you)廣泛的(de)(de)(de)應用(yong)。根據具(ju)體應用(yong)的(de)(de)(de)不同(tong)，有(you)的(de)(de)(de)側重(zhong)于通過(guo)變(bian)換結果(guo)融(rong)(rong)合兩(liang)幅圖(tu)(tu)像(xiang)(xiang)(xiang)(xiang)(xiang)(xiang)，有(you)的(de)(de)(de)側重(zhong)于研究變(bian)換本身以(yi)獲得(de)對(dui)(dui)象(xiang)(xiang)的(de)(de)(de)一(yi)些力(li)(li)學(xue)屬性(xing)。

MoDELS · 噪聲 · Continuity · Elevate · 協方差矩陣 ·

2023 年 5 月 5 日

Noise calibration for the stochastic rotating shallow water model

Dan Crisan,Oana Lang,Alexander Lobbe,Peter Jan van Leeuwen,Roland Potthast

from arxiv, Comments welcome

Stochastic partial differential equations have been used in a variety of contexts to model the evolution of uncertain dynamical systems. In recent years, their applications to geophysical fluid dynamics has increased massively. For a judicious usage in modelling fluid evolution, one needs to calibrate the amplitude of the noise to data. In this paper we address this requirement for the stochastic rotating shallow water (SRSW) model. This work is a continuation of [LvLCP23], where a data assimilation methodology has been introduced for the SRSW model. The noise used in [LvLCP23] was introduced as an arbitrary random phase shift in the Fourier space. This is not necessarily consistent with the uncertainty induced by a model reduction procedure. In this paper, we introduce a new method of noise calibration of the SRSW model which is compatible with the model reduction technique. The method is generic and can be applied to arbitrary stochastic parametrizations. It is also agnostic as to the source of data (real or synthetic). It is based on a principal component analysis technique to generate the eigenvectors and the eigenvalues of the covariance matrix of the stochastic parametrization. For SRSW model covered in this paper, we calibrate the noise by using the elevation variable of the model, as this is an observable easily obtainable in practical application, and use synthetic data as input for the calibration procedure.

知識 (knowledge) · 語言模型化 · Unstructured · entity · MoDELS ·

2023 年 5 月 5 日

UNTER: A Unified Knowledge Interface for Enhancing Pre-trained Language Models

Deming Ye,Yankai Lin,Zhengyan Zhang,Maosong Sun

from arxiv, 8 pages

Recent research demonstrates that external knowledge injection can advance pre-trained language models (PLMs) in a variety of downstream NLP tasks. However, existing knowledge injection methods are either applicable to structured knowledge or unstructured knowledge, lacking a unified usage. In this paper, we propose a UNified knowledge inTERface, UNTER, to provide a unified perspective to exploit both structured knowledge and unstructured knowledge. In UNTER, we adopt the decoder as a unified knowledge interface, aligning span representations obtained from the encoder with their corresponding knowledge. This approach enables the encoder to uniformly invoke span-related knowledge from its parameters for downstream applications. Experimental results show that, with both forms of knowledge injected, UNTER gains continuous improvements on a series of knowledge-driven NLP tasks, including entity typing, named entity recognition and relation extraction, especially in low-resource scenarios.

圖像配準 · 離散化 · 線性的 · 優化器 · 控制器 ·

2023 年 5 月 4 日

Medical Image Registration using optimal control of a linear hyperbolic transport equation with a DG discretization

Bastian Zapf,Johannes Haubner,Lukas Baumg?rtner,Stephan Schmidt

Patient specific brain mesh generation from MRI can be a time consuming task and require manual corrections, e.g., for meshing the ventricular system or defining subdomains. To address this issue, we consider an image registration approach. The idea is to use the registration of an input magnetic resonance image (MRI) to a respective target in order to obtain a new mesh from a high-quality template mesh. To obtain the transformation, we solve an optimization problem that is constrained by a linear hyperbolic transport equation. We use a higher-order discontinuous Galerkin finite element method for discretization and show that, under a restrictive assumption, the numerical upwind scheme can be derived from the continuous weak formulation of the transport equation. We present a numerical implementation that builds on the established finite element packages FEniCS and dolfin-adjoint. To demonstrate the efficacy of the proposed approach, numerical results for the registration of an input to a target MRI of two distinct individuals are presented. Moreover, it is shown that the registration transforms a manually crafted input mesh into a new mesh for the target subject whilst preserving mesh quality. Challenges of the algorithm with the complex cortical folding structure are discussed.

INFORMS · 穩健性 · 卷積 · Networking · Neural Networks ·

2023 年 5 月 4 日

ItoV: Efficiently Adapting Deep Learning-based Image Watermarking to Video Watermarking

Guanhui Ye,Jiashi Gao,Yuchen Wang,Liyan Song,Xuetao Wei

Robust watermarking tries to conceal information within a cover image/video imperceptibly that is resistant to various distortions. Recently, deep learning-based approaches for image watermarking have made significant advancements in robustness and invisibility. However, few studies focused on video watermarking using deep neural networks due to the high complexity and computational costs. Our paper aims to answer this research question: Can well-designed deep learning-based image watermarking be efficiently adapted to video watermarking? Our answer is positive. First, we revisit the workflow of deep learning-based watermarking methods that leads to a critical insight: temporal information in the video may be essential for general computer vision tasks but not for specific video watermarking. Inspired by this insight, we propose a method named ItoV for efficiently adapting deep learning-based Image watermarking to Video watermarking. Specifically, ItoV merges the temporal dimension of the video with the channel dimension to enable deep neural networks to treat videos as images. We further explore the effects of different convolutional blocks in video watermarking. We find that spatial convolution is the primary influential component in video watermarking and depthwise convolutions significantly reduce computational cost with negligible impact on performance. In addition, we propose a new frame loss to constrain that the watermark intensity in each video clip frame is consistent, significantly improving the invisibility. Extensive experiments show the superior performance of the adapted video watermarking method compared with the state-of-the-art methods on Kinetics-600 and Inter4K datasets, which demonstrate the efficacy of our method ItoV.

在線 · INFORMS · Performer · Analysis · 有偏 ·

2023 年 5 月 4 日

Auto-bidding Equilibrium in ROI-Constrained Online Advertising Markets

Juncheng Li,Pingzhong Tang

from arxiv, Improve NP-hardness result to APX-hardness for finding revenue/welfare-optimal equilibrium

Most of the work in auction design literature assumes that bidders behave rationally based on the information available for each individual auction. However, in today's online advertising markets, one of the most important real-life applications of auction design, the data and computational power required to bid optimally are only available to the auction designer, and an advertiser can only participate by setting performance objectives (clicks, conversions, etc.) for the campaign. In this paper, we focus on value-maximizing campaigns with return-on-investment (ROI) constraints, which is widely adopted in many global-scale auto-bidding platforms. Through theoretical analysis and empirical experiments on both synthetic and realistic data, we find that second price auction exhibits many undesirable properties and loses its dominant theoretical advantages in single-item scenarios. In particular, second price auction brings equilibrium multiplicity, non-monotonicity, vulnerability to exploitation by both bidders and even auctioneers, and PPAD-hardness for the system to reach a steady-state. We also explore the broader impacts of the auto-bidding mechanism beyond efficiency and strategyproofness. In particular, the multiplicity of equilibria and the input sensitivity make advertisers' utilities unstable. In addition, the interference among both bidders and advertising slots introduces bias into A/B testing, which hinders the development of even non-bidding components of the platform. The aforementioned phenomena have been widely observed in practice, and our results indicate that one of the reasons might be intrinsic to the underlying auto-bidding mechanism. To deal with these challenges, we provide suggestions and candidate solutions for practitioners.

MoDELS · 圖像檢索 · 對抗學習 · Learning · 判別器 ·

2023 年 5 月 4 日

Boundary-aware Backward-Compatible Representation via Adversarial Learning in Image Retrieval

Tan Pan,Furong Xu,Xudong Yang,Sifeng He,Chen Jiang,Qingpei Guo,Feng Qian Xiaobo Zhang,Yuan Cheng,Lei Yang,Wei Chu

from arxiv, accepted by CVPR 2023

Image retrieval plays an important role in the Internet world. Usually, the core parts of mainstream visual retrieval systems include an online service of the embedding model and a large-scale vector database. For traditional model upgrades, the old model will not be replaced by the new one until the embeddings of all the images in the database are re-computed by the new model, which takes days or weeks for a large amount of data. Recently, backward-compatible training (BCT) enables the new model to be immediately deployed online by making the new embeddings directly comparable to the old ones. For BCT, improving the compatibility of two models with less negative impact on retrieval performance is the key challenge. In this paper, we introduce AdvBCT, an Adversarial Backward-Compatible Training method with an elastic boundary constraint that takes both compatibility and discrimination into consideration. We first employ adversarial learning to minimize the distribution disparity between embeddings of the new model and the old model. Meanwhile, we add an elastic boundary constraint during training to improve compatibility and discrimination efficiently. Extensive experiments on GLDv2, Revisited Oxford (ROxford), and Revisited Paris (RParis) demonstrate that our method outperforms other BCT methods on both compatibility and discrimination. The implementation of AdvBCT will be publicly available at //github.com/Ashespt/AdvBCT.

可理解性 · 泛函 · IR · 可微函數 · 可辨認的 ·

2023 年 5 月 3 日

Understanding Differential Search Index for Text Retrieval

Xiaoyang Chen,Yanjiang Liu,Ben He,Le Sun,Yingfei Sun

from arxiv, Accepted at Findings of ACL 2023

The Differentiable Search Index (DSI) is a novel information retrieval (IR) framework that utilizes a differentiable function to generate a sorted list of document identifiers in response to a given query. However, due to the black-box nature of the end-to-end neural architecture, it remains to be understood to what extent DSI possesses the basic indexing and retrieval abilities. To mitigate this gap, in this study, we define and examine three important abilities that a functioning IR framework should possess, namely, exclusivity, completeness, and relevance ordering. Our analytical experimentation shows that while DSI demonstrates proficiency in memorizing the unidirectional mapping from pseudo queries to document identifiers, it falls short in distinguishing relevant documents from random ones, thereby negatively impacting its retrieval effectiveness. To address this issue, we propose a multi-task distillation approach to enhance the retrieval quality without altering the structure of the model and successfully endow it with improved indexing abilities. Through experiments conducted on various datasets, we demonstrate that our proposed method outperforms previous DSI baselines.

contrastive · 圖 · 簇 · INFORMS · state-of-the-art ·

2021 年 10 月 22 日

Multi-view Contrastive Graph Clustering

Erlin Pan,Zhao Kang

from arxiv, Accepted by NeurIPS 2021

With the explosive growth of information technology, multi-view graph data have become increasingly prevalent and valuable. Most existing multi-view clustering techniques either focus on the scenario of multiple graphs or multi-view attributes. In this paper, we propose a generic framework to cluster multi-view attributed graph data. Specifically, inspired by the success of contrastive learning, we propose multi-view contrastive graph clustering (MCGC) method to learn a consensus graph since the original graph could be noisy or incomplete and is not directly applicable. Our method composes of two key steps: we first filter out the undesirable high-frequency noise while preserving the graph geometric features via graph filtering and obtain a smooth representation of nodes; we then learn a consensus graph regularized by graph contrastive loss. Results on several benchmark datasets show the superiority of our method with respect to state-of-the-art approaches. In particular, our simple approach outperforms existing deep learning-based methods.

圖卷積神經網絡/圖卷積網絡 · INFORMS · 圖卷積 · Integration · 描述符 ·

2021 年 5 月 10 日

Z-GCNETs: Time Zigzags at Graph Convolutional Networks for Time Series Forecasting

Yuzhou Chen,Ignacio Segovia-Dominguez,Yulia R. Gel

from arxiv, Accepted at the International Conference on Machine Learning (ICML) 2021

There recently has been a surge of interest in developing a new class of deep learning (DL) architectures that integrate an explicit time dimension as a fundamental building block of learning and representation mechanisms. In turn, many recent results show that topological descriptors of the observed data, encoding information on the shape of the dataset in a topological space at different scales, that is, persistent homology of the data, may contain important complementary information, improving both performance and robustness of DL. As convergence of these two emerging ideas, we propose to enhance DL architectures with the most salient time-conditioned topological information of the data and introduce the concept of zigzag persistence into time-aware graph convolutional networks (GCNs). Zigzag persistence provides a systematic and mathematically rigorous framework to track the most important topological features of the observed data that tend to manifest themselves over time. To integrate the extracted time-conditioned topological descriptors into DL, we develop a new topological summary, zigzag persistence image, and derive its theoretical stability guarantees. We validate the new GCNs with a time-aware zigzag topological layer (Z-GCNETs), in application to traffic forecasting and Ethereum blockchain price prediction. Our results indicate that Z-GCNET outperforms 13 state-of-the-art methods on 4 time series datasets.

圖像字幕 · Extensibility · 圖 · 圖卷積神經網絡/圖卷積網絡 · Performer ·

2018 年 9 月 19 日

Exploring Visual Relationship for Image Captioning

Ting Yao,Yingwei Pan,Yehao Li,Tao Mei

from arxiv, ECCV 2018

It is always well believed that modeling relationships between objects would be helpful for representing and eventually describing an image. Nevertheless, there has not been evidence in support of the idea on image description generation. In this paper, we introduce a new design to explore the connections between objects for image captioning under the umbrella of attention-based encoder-decoder framework. Specifically, we present Graph Convolutional Networks plus Long Short-Term Memory (dubbed as GCN-LSTM) architecture that novelly integrates both semantic and spatial object relationships into image encoder. Technically, we build graphs over the detected objects in an image based on their spatial and semantic connections. The representations of each region proposed on objects are then refined by leveraging graph structure through GCN. With the learnt region-level features, our GCN-LSTM capitalizes on LSTM-based captioning framework with attention mechanism for sentence generation. Extensive experiments are conducted on COCO image captioning dataset, and superior results are reported when comparing to state-of-the-art approaches. More remarkably, GCN-LSTM increases CIDEr-D performance from 120.1% to 128.7% on COCO testing set.