国产特级黄色片A级无毛视频_亚洲色精品一区二区色欲AV_国产美女精品91_日韩亚洲欧美综合一区二区三区_美女少妇高潮一区二区色欲影院_久久伊人无码精品一区二区_亚洲最新毛片一卡二卡

This paper studies a reconstruction-based approach for weakly-supervised animal detection from aerial images in marine environments. Such an approach leverages an anomaly detection framework that computes metrics directly on the input space, enhancing interpretability and anomaly localization compared to feature embedding methods. Building upon the success of Vector-Quantized Variational Autoencoders in anomaly detection on computer vision datasets, we adapt them to the marine animal detection domain and address the challenge of handling noisy data. To evaluate our approach, we compare it with existing methods in the context of marine animal detection from aerial image data. Experiments conducted on two dedicated datasets demonstrate the superior performance of the proposed method over recent studies in the literature. Our framework offers improved interpretability and localization of anomalies, providing valuable insights for monitoring marine ecosystems and mitigating the impact of human activities on marine animals.

相關內容

自編碼器

關注 140

自(zi)(zi)動(dong)(dong)編(bian)碼器(qi)(qi)是一(yi)種人工神經網絡，用(yong)于以無監督的(de)方式學(xue)習有效的(de)數(shu)據(ju)編(bian)碼。自(zi)(zi)動(dong)(dong)編(bian)碼器(qi)(qi)的(de)目的(de)是通(tong)過(guo)訓練網絡忽略信號“噪(zao)聲”來學(xue)習一(yi)組數(shu)據(ju)的(de)表示（編(bian)碼），通(tong)常用(yong)于降(jiang)維。與簡化方面一(yi)起(qi)，學(xue)習了重(zhong)構方面，在此(ci)，自(zi)(zi)動(dong)(dong)編(bian)碼器(qi)(qi)嘗試從簡化編(bian)碼中生成盡可能接近(jin)其原始輸入的(de)表示形(xing)(xing)式，從而得到其名稱。基(ji)本模(mo)型存在幾種變體，其目的(de)是迫使學(xue)習的(de)輸入表示形(xing)(xing)式具有有用(yong)的(de)屬性。自(zi)(zi)動(dong)(dong)編(bian)碼器(qi)(qi)可有效地(di)解決(jue)許多應用(yong)問題，從面部識別(bie)到獲取單詞的(de)語義。

MoDELS · 縮放 · INTERACT · 時間步 · CASE ·

2023 年 9 月 5 日

Multiscale constitutive framework of 1D blood flow modeling: Asymptotic limits and numerical methods

Giulia Bertaglia,Lorenzo Pareschi

In this paper, a multiscale constitutive framework for one-dimensional blood flow modeling is presented and discussed. By analyzing the asymptotic limits of the proposed model, it is shown that different types of blood propagation phenomena in arteries and veins can be described through an appropriate choice of scaling parameters, which are related to distinct characterizations of the fluid-structure interaction mechanism (whether elastic or viscoelastic) that exist between vessel walls and blood flow. In these asymptotic limits, well-known blood flow models from the literature are recovered. Additionally, by analyzing the perturbation of the local elastic equilibrium of the system, a new viscoelastic blood flow model is derived. The proposed approach is highly flexible and suitable for studying the human cardiovascular system, which is composed of vessels with high morphological and mechanical variability. The resulting multiscale hyperbolic model of blood flow is solved using an asymptotic-preserving Implicit-Explicit Runge-Kutta Finite Volume method, which ensures the consistency of the numerical scheme with the different asymptotic limits of the mathematical model without affecting the choice of the time step by restrictions related to the smallness of the scaling parameters. Several numerical tests confirm the validity of the proposed methodology, including a case study investigating the hemodynamics of a thoracic aorta in the presence of a stent.

相關系數 · 估計/估計量 · 有向 · AIM · Performer ·

2023 年 9 月 5 日

Correlation visualization under missing values: a comparison between imputation and direct parameter estimation methods

Nhat-Hao Pham,Khanh-Linh Vo,Mai Anh Vu,Thu Nguyen,Michael A. Riegler,P?l Halvorsen,Binh T. Nguyen

Correlation matrix visualization is essential for understanding the relationships between variables in a dataset, but missing data can pose a significant challenge in estimating correlation coefficients. In this paper, we compare the effects of various missing data methods on the correlation plot, focusing on two common missing patterns: random and monotone. We aim to provide practical strategies and recommendations for researchers and practitioners in creating and analyzing the correlation plot. Our experimental results suggest that while imputation is commonly used for missing data, using imputed data for plotting the correlation matrix may lead to a significantly misleading inference of the relation between the features. We recommend using DPER, a direct parameter estimation approach, for plotting the correlation matrix based on its performance in the experiments.

線性的 · 優化器 · 泛函 · 設計 · 噪聲分布 ·

2023 年 9 月 5 日

Bayesian experimental design for linear elasticity

Sarah Eberle-Blick,Nuutti Hyv?nen

from arxiv, 23 pages, 11 figures

This work considers Bayesian experimental design for the inverse boundary value problem of linear elasticity in a two-dimensional setting. The aim is to optimize the positions of compactly supported pressure activations on the boundary of the examined body in order to maximize the value of the resulting boundary deformations as data for the inverse problem of reconstructing the Lam\'e parameters inside the object. We resort to a linearized measurement model and adopt the framework of Bayesian experimental design, under the assumption that the prior and measurement noise distributions are mutually independent Gaussians. This enables the use of the standard Bayesian A-optimality criterion for deducing optimal positions for the pressure activations. The (second) derivatives of the boundary measurements with respect to the Lam\'e parameters and the positions of the boundary pressure activations are deduced to allow minimizing the corresponding objective function, i.e., the trace of the covariance matrix of the posterior distribution, by a gradient-based optimization algorithm. Two-dimensional numerical experiments are performed to demonstrate the functionality of our approach.

多峰值 · Learning · 可辨認的 · Processing（編程語言） · 深度學習 ·

2023 年 9 月 3 日

A scoping review on multimodal deep learning in biomedical images and texts

Zhaoyi Sun,Mingquan Lin,Qingqing Zhu,Qianqian Xie,Fei Wang,Zhiyong Lu,Yifan Peng

from arxiv, This paper has been accepted by the Journal of Biomedical Informatics

Computer-assisted diagnostic and prognostic systems of the future should be capable of simultaneously processing multimodal data. Multimodal deep learning (MDL), which involves the integration of multiple sources of data, such as images and text, has the potential to revolutionize the analysis and interpretation of biomedical data. However, it only caught researchers' attention recently. To this end, there is a critical need to conduct a systematic review on this topic, identify the limitations of current work, and explore future directions. In this scoping review, we aim to provide a comprehensive overview of the current state of the field and identify key concepts, types of studies, and research gaps with a focus on biomedical images and texts joint learning, mainly because these two were the most commonly available data types in MDL research. This study reviewed the current uses of multimodal deep learning on five tasks: (1) Report generation, (2) Visual question answering, (3) Cross-modal retrieval, (4) Computer-aided diagnosis, and (5) Semantic segmentation. Our results highlight the diverse applications and potential of MDL and suggest directions for future research in the field. We hope our review will facilitate the collaboration of natural language processing (NLP) and medical imaging communities and support the next generation of decision-making and computer-assisted diagnostic system development.

可約的 · MoDELS · Extensibility · Integration · 離散化 ·

2023 年 9 月 2 日

Projection-based reduced order modeling of an iterative coupling scheme for thermo-poroelasticity

Francesco Ballarin,Sanghyun Lee,Son-Young Yi

This paper explores an iterative coupling approach to solve thermo-poroelasticity problems, with its application as a high-fidelity discretization utilizing finite elements during the training of projection-based reduced order models. One of the main challenges in addressing coupled multi-physics problems is the complexity and computational expenses involved. In this study, we introduce a decoupled iterative solution approach, integrated with reduced order modeling, aimed at augmenting the efficiency of the computational algorithm. The iterative coupling technique we employ builds upon the established fixed-stress splitting scheme that has been extensively investigated for Biot's poroelasticity. By leveraging solutions derived from this coupled iterative scheme, the reduced order model employs an additional Galerkin projection onto a reduced basis space formed by a small number of modes obtained through proper orthogonal decomposition. The effectiveness of the proposed algorithm is demonstrated through numerical experiments, showcasing its computational prowess.

會話智能體 · Agent · 設計 · MoDELS · HCI ·

2023 年 9 月 1 日

Designing a realistic peer-like embodied conversational agent for supporting children's storytelling

Zhixin Li,Ying Xu

from arxiv, 6 pages with 2 figures. The paper has been peer-reviewed and presented at the "CHI 2023 Workshop on Child-centred AI Design: Definition, Operation and Considerations, April 23, 2023, Hamburg, Germany

Advances in artificial intelligence have facilitated the use of large language models (LLMs) and AI-generated synthetic media in education, which may inspire HCI researchers to develop technologies, in particular, embodied conversational agents (ECAs) to simulate the kind of scaffolding children might receive from a human partner. In this paper, we will propose a design prototype of a peer-like ECA named STARie that integrates multiple AI models - GPT-3, Speech Synthesis (Real-time Voice Cloning), VOCA (Voice Operated Character Animation), and FLAME (Faces Learned with an Articulated Model and Expressions) that aims to support narrative production in collaborative storytelling, specifically for children aged 4-8. However, designing a child-centered ECA raises concerns about age appropriateness, children privacy, gender choices of ECAs, and the uncanny valley effect. Thus, this paper will also discuss considerations and ethical concerns that must be taken into account when designing such an ECA. This proposal offers insights into the potential use of AI-generated synthetic media in child-centered AI design and how peer-like AI embodiment may support children\textquotesingle s storytelling.

Continuity · Learning · Performer · 學習器 · MoDELS ·

2023 年 9 月 1 日

New metrics for analyzing continual learners

Nicolas Michel,Giovanni Chierchia,Romain Negrel,Jean-Fran?ois Bercher,Toshihiko Yamasaki

from arxiv, 6 pages, presented at MIRU 2023

Deep neural networks have shown remarkable performance when trained on independent and identically distributed data from a fixed set of classes. However, in real-world scenarios, it can be desirable to train models on a continuous stream of data where multiple classification tasks are presented sequentially. This scenario, known as Continual Learning (CL) poses challenges to standard learning algorithms which struggle to maintain knowledge of old tasks while learning new ones. This stability-plasticity dilemma remains central to CL and multiple metrics have been proposed to adequately measure stability and plasticity separately. However, none considers the increasing difficulty of the classification task, which inherently results in performance loss for any model. In that sense, we analyze some limitations of current metrics and identify the presence of setup-induced forgetting. Therefore, we propose new metrics that account for the task's increasing difficulty. Through experiments on benchmark datasets, we demonstrate that our proposed metrics can provide new insights into the stability-plasticity trade-off achieved by models in the continual learning environment.

state-of-the-art · 稀疏 · Automator · Vision · 計算機視覺 ·

2023 年 9 月 1 日

Sparse resultant based minimal solvers in computer vision and their connection with the action matrix

Snehal Bhayani,Janne Heikkil?,Zuzana Kukelova

from arxiv, arXiv admin note: text overlap with arXiv:1912.10268

Many computer vision applications require robust and efficient estimation of camera geometry from a minimal number of input data measurements, i.e., solving minimal problems in a RANSAC framework. Minimal problems are usually formulated as complex systems of sparse polynomials. The systems usually are overdetermined and consist of polynomials with algebraically constrained coefficients. Most state-of-the-art efficient polynomial solvers are based on the action matrix method that has been automated and highly optimized in recent years. On the other hand, the alternative theory of sparse resultants and Newton polytopes has been less successful for generating efficient solvers, primarily because the polytopes do not respect the constraints on the coefficients. Therefore, in this paper, we propose a simple iterative scheme to test various subsets of the Newton polytopes and search for the most efficient solver. Moreover, we propose to use an extra polynomial with a special form to further improve the solver efficiency via a Schur complement computation. We show that for some camera geometry problems our extra polynomial-based method leads to smaller and more stable solvers than the state-of-the-art Grobner basis-based solvers. The proposed method can be fully automated and incorporated into existing tools for automatic generation of efficient polynomial solvers. It provides a competitive alternative to popular Grobner basis-based methods for minimal problems in computer vision. We also study the conditions under which the minimal solvers generated by the state-of-the-art action matrix-based methods and the proposed extra polynomial resultant-based method, are equivalent. Specifically we consider a step-by-step comparison between the approaches based on the action matrix and the sparse resultant, followed by a set of substitutions, which would lead to equivalent minimal solvers.

Color · 推斷 · 有偏 · 數據可視化 · 分解的 ·

2023 年 8 月 31 日

Effects of data distribution and granularity on color semantics for colormap data visualizations

Clementine Zimnicki,Chin Tseng,Danielle Albers Szafir,Karen B. Schloss

To create effective data visualizations, it helps to represent data using visual features in intuitive ways. When visualization designs match observer expectations, visualizations are easier to interpret. Prior work suggests that several factors influence such expectations. For example, the dark-is-more bias leads observers to infer that darker colors map to larger quantities, and the opaque-is-more bias leads them to infer that regions appearing more opaque (given the background color) map to larger quantities. Previous work suggested that the background color only plays a role if visualizations appear to vary in opacity. The present study challenges this claim. We hypothesized that the background color modulate inferred mappings for colormaps that should not appear to vary in opacity (by previous measures) if the visualization appeared to have a "hole" that revealed the background behind the map (hole hypothesis). We found that spatial aspects of the map contributed to inferred mappings, though the effects were inconsistent with the hole hypothesis. Our work raises new questions about how spatial distributions of data influence color semantics in colormap data visualizations.

圖片分類 · 前饋網絡 · INTERACT · Networking · 前饋 ·

2021 年 5 月 7 日

ResMLP: Feedforward networks for image classification with data-efficient training

Hugo Touvron,Piotr Bojanowski,Mathilde Caron,Matthieu Cord,Alaaeldin El-Nouby,Edouard Grave,Armand Joulin,Gabriel Synnaeve,Jakob Verbeek,Hervé Jégou

We present ResMLP, an architecture built entirely upon multi-layer perceptrons for image classification. It is a simple residual network that alternates (i) a linear layer in which image patches interact, independently and identically across channels, and (ii) a two-layer feed-forward network in which channels interact independently per patch. When trained with a modern training strategy using heavy data-augmentation and optionally distillation, it attains surprisingly good accuracy/complexity trade-offs on ImageNet. We will share our code based on the Timm library and pre-trained models.