亚洲男人的天堂2018av,欧美草比,久久久久久免费视频精选,国色天香在线看免费,久久久久亚洲av成人片仓井空

<tfoot id='1jrzs'></tfoot>

<legend id='1jrzs'><style id='1jrzs'><dir id='1jrzs'><q id='1jrzs'></q></dir></style></legend>

<i id='1jrzs'><tr id='1jrzs'><dt id='1jrzs'><q id='1jrzs'><span id='1jrzs'><b id='1jrzs'><form id='1jrzs'><ins id='1jrzs'></ins><ul id='1jrzs'></ul><sub id='1jrzs'></sub></form><legend id='1jrzs'></legend><bdo id='1jrzs'><pre id='1jrzs'><center id='1jrzs'></center></pre></bdo></b><th id='1jrzs'></th></span></q></dt></tr></i><div id='1jrzs'><tfoot id='1jrzs'></tfoot><dl id='1jrzs'><fieldset id='1jrzs'></fieldset></dl></div>

·

MoDELS · Guidance · 膨脹卷積 · surge · 相互獨立的 ·

2023 年 5 月 16 日

Listen, Denoise, Action! Audio-Driven Motion Synthesis with Diffusion Models

Simon Alexanderson,Rajmund Nagy,Jonas Beskow,Gustav Eje Henter

from arxiv, 20 pages, 9 figures. Published in ACM ToG and presented at SIGGRAPH 2023

Diffusion models have experienced a surge of interest as highly expressive yet efficiently trainable probabilistic models. We show that these models are an excellent fit for synthesising human motion that co-occurs with audio, e.g., dancing and co-speech gesticulation, since motion is complex and highly ambiguous given audio, calling for a probabilistic description. Specifically, we adapt the DiffWave architecture to model 3D pose sequences, putting Conformers in place of dilated convolutions for improved modelling power. We also demonstrate control over motion style, using classifier-free guidance to adjust the strength of the stylistic expression. Experiments on gesture and dance generation confirm that the proposed method achieves top-of-the-line motion quality, with distinctive styles whose expression can be made more or less pronounced. We also synthesise path-driven locomotion using the same model architecture. Finally, we generalise the guidance procedure to obtain product-of-expert ensembles of diffusion models and demonstrate how these may be used for, e.g., style interpolation, a contribution we believe is of independent interest. See //www.speech.kth.se/research/listen-denoise-action/ for video examples, data, and code.

相關內容

MoDELS

ACM/IEEE第23屆模型驅動工程語言和系統國際會議，是模型驅動軟件和系統工程的首要會議系列，由ACM-SIGSOFT和IEEE-TCSE支持組織。自1998年以來，模型涵蓋了建模的各個方面，從語言和方法到工具和應用程序。模特的參加者來自不同的背景，包括研究人員、學者、工程師和工業專業人士。MODELS 2019是一個論壇，參與者可以圍繞建模和模型驅動的軟件和系統交流前沿研究成果和創新實踐經驗。今年的版本將為建模社區提供進一步推進建模基礎的機會，并在網絡物理系統、嵌入式系統、社會技術系統、云計算、大數據、機器學習、安全、開源等新興領域提出建模的創新應用以及可持續性。官網鏈接： · 極大似然 · 歐氏空間 · 講稿 · HTTPS ·

2023 年 7 月 3 日

A Survey on Generative Diffusion Model

Hanqun Cao,Cheng Tan,Zhangyang Gao,Yilun Xu,Guangyong Chen,Pheng-Ann Heng,Stan Z. Li

Deep generative models are a prominent approach for data generation, and have been used to produce high quality samples in various domains. Diffusion models, an emerging class of deep generative models, have attracted considerable attention owing to their exceptional generative quality. Despite this, they have certain limitations, including a time-consuming iterative generation process and confinement to high-dimensional Euclidean space. This survey presents a plethora of advanced techniques aimed at enhancing diffusion models, including sampling acceleration and the design of new diffusion processes. In addition, we delve into strategies for implementing diffusion models in manifold and discrete spaces, maximum likelihood training for diffusion models, and methods for creating bridges between two arbitrary distributions. The innovations we discuss represent the efforts for improving the functionality and efficiency of diffusion models in recent years. To examine the efficacy of existing models, a benchmark of FID score, IS, and NLL is presented in a specific NFE. Furthermore, diffusion models are found to be useful in various domains such as computer vision, audio, sequence modeling, and AI for science. The paper concludes with a summary of this field, along with existing limitations and future directions. Summation of existing well-classified methods is in our Github: //github.com/chq1155/A-Survey-on-Generative-Diffusion-Model

MoDELS · 異常檢測 · Performer · 無監督 · 重構誤差 ·

2023 年 7 月 2 日

Exploring Diffusion Models for Unsupervised Video Anomaly Detection

Anil Osman Tur,Nicola Dall'Asen,Cigdem Beyan,Elisa Ricci

from arxiv, Accepted to IEEE ICIP 2023

This paper investigates the performance of diffusion models for video anomaly detection (VAD) within the most challenging but also the most operational scenario in which the data annotations are not used. As being sparse, diverse, contextual, and often ambiguous, detecting abnormal events precisely is a very ambitious task. To this end, we rely only on the information-rich spatio-temporal data, and the reconstruction power of the diffusion models such that a high reconstruction error is utilized to decide the abnormality. Experiments performed on two large-scale video anomaly detection datasets demonstrate the consistent improvement of the proposed method over the state-of-the-art generative models while in some cases our method achieves better scores than the more complex models. This is the first study using a diffusion model and examining its parameters' influence to present guidance for VAD in surveillance scenarios.

Projection · Attention · binary · Performer · 估計/估計量 ·

2023 年 7 月 1 日

Fuzzy-Conditioned Diffusion and Diffusion Projection Attention Applied to Facial Image Correction

from arxiv, Code available on //github.com/majedelhelou/FC-Diffusion

Image diffusion has recently shown remarkable performance in image synthesis and implicitly as an image prior. Such a prior has been used with conditioning to solve the inpainting problem, but only supporting binary user-based conditioning. We derive a fuzzy-conditioned diffusion, where implicit diffusion priors can be exploited with controllable strength. Our fuzzy conditioning can be applied pixel-wise, enabling the modification of different image components to varying degrees. Additionally, we propose an application to facial image correction, where we combine our fuzzy-conditioned diffusion with diffusion-derived attention maps. Our map estimates the degree of anomaly, and we obtain it by projecting on the diffusion space. We show how our approach also leads to interpretable and autonomous facial image correction.

估計/估計量 · 規范化的 · 小樣本學習 · 翻轉 · Microsoft Surface ·

2023 年 6 月 30 日

FlipNeRF: Flipped Reflection Rays for Few-shot Novel View Synthesis

Seunghyeon Seo,Yeonjin Chang,Nojun Kwak

from arxiv, 6 figures

Neural Radiance Field (NeRF) has been a mainstream in novel view synthesis with its remarkable quality of rendered images and simple architecture. Although NeRF has been developed in various directions improving continuously its performance, the necessity of a dense set of multi-view images still exists as a stumbling block to progress for practical application. In this work, we propose FlipNeRF, a novel regularization method for few-shot novel view synthesis by utilizing our proposed flipped reflection rays. The flipped reflection rays are explicitly derived from the input ray directions and estimated normal vectors, and play a role of effective additional training rays while enabling to estimate more accurate surface normals and learn the 3D geometry effectively. Since the surface normal and the scene depth are both derived from the estimated densities along a ray, the accurate surface normal leads to more exact depth estimation, which is a key factor for few-shot novel view synthesis. Furthermore, with our proposed Uncertainty-aware Emptiness Loss and Bottleneck Feature Consistency Loss, FlipNeRF is able to estimate more reliable outputs with reducing floating artifacts effectively across the different scene structures, and enhance the feature-level consistency between the pair of the rays cast toward the photo-consistent pixels without any additional feature extractor, respectively. Our FlipNeRF achieves the SOTA performance on the multiple benchmarks across all the scenarios.

Performer · MoDELS · Prompt · HTTPS · BLIP ·

2023 年 6 月 30 日

Zero-shot Nuclei Detection via Visual-Language Pre-trained Models

Yongjian Wu,Yang Zhou,Jiya Saiyin,Bingzheng Wei,Maode Lai,Jianzhong Shou,Yubo Fan,Yan Xu

from arxiv, This article has been accepted by MICCAI 2023,but has not been fully edited. Content may change prior to final publication

Large-scale visual-language pre-trained models (VLPM) have proven their excellent performance in downstream object detection for natural scenes. However, zero-shot nuclei detection on H\&E images via VLPMs remains underexplored. The large gap between medical images and the web-originated text-image pairs used for pre-training makes it a challenging task. In this paper, we attempt to explore the potential of the object-level VLPM, Grounded Language-Image Pre-training (GLIP) model, for zero-shot nuclei detection. Concretely, an automatic prompts design pipeline is devised based on the association binding trait of VLPM and the image-to-text VLPM BLIP, avoiding empirical manual prompts engineering. We further establish a self-training framework, using the automatically designed prompts to generate the preliminary results as pseudo labels from GLIP and refine the predicted boxes in an iterative manner. Our method achieves a remarkable performance for label-free nuclei detection, surpassing other comparison methods. Foremost, our work demonstrates that the VLPM pre-trained on natural image-text pairs exhibits astonishing potential for downstream tasks in the medical field as well. Code will be released at //github.com/wuyongjianCODE/VLPMNuD.

模型選擇 · MoDELS · 遷移學習 · Learning · 類別 ·

2023 年 6 月 30 日

Limits of Model Selection under Transfer Learning

Steve Hanneke,Samory Kpotufe,Yasaman Mahdaviyeh

from arxiv, Accepted for presentation at the Conference on Learning Theory (COLT) 2023

Theoretical studies on transfer learning or domain adaptation have so far focused on situations with a known hypothesis class or model; however in practice, some amount of model selection is usually involved, often appearing under the umbrella term of hyperparameter-tuning: for example, one may think of the problem of tuning for the right neural network architecture towards a target task, while leveraging data from a related source task. Now, in addition to the usual tradeoffs on approximation vs estimation errors involved in model selection, this problem brings in a new complexity term, namely, the transfer distance between source and target distributions, which is known to vary with the choice of hypothesis class. We present a first study of this problem, focusing on classification; in particular, the analysis reveals some remarkable phenomena: adaptive rates, i.e., those achievable with no distributional information, can be arbitrarily slower than oracle rates, i.e., when given knowledge on distances.

Learning · contrastive · 機器人 · 對比學習 · 損失 ·

2023 年 6 月 29 日

Stable Motion Primitives via Imitation and Contrastive Learning

Rodrigo Pérez-Dattari,Jens Kober

Learning from humans allows non-experts to program robots with ease, lowering the resources required to build complex robotic solutions. Nevertheless, such data-driven approaches often lack the ability to provide guarantees regarding their learned behaviors, which is critical for avoiding failures and/or accidents. In this work, we focus on reaching/point-to-point motions, where robots must always reach their goal, independently of their initial state. This can be achieved by modeling motions as dynamical systems and ensuring that they are globally asymptotically stable. Hence, we introduce a novel Contrastive Learning loss for training Deep Neural Networks (DNN) that, when used together with an Imitation Learning loss, enforces the aforementioned stability in the learned motions. Differently from previous work, our method does not restrict the structure of its function approximator, enabling its use with arbitrary DNNs and allowing it to learn complex motions with high accuracy. We validate it using datasets and a real robot. In the former case, motions are 2 and 4 dimensional, modeled as first- and second-order dynamical systems. In the latter, motions are 3, 4, and 6 dimensional, of first and second order, and are used to control a 7DoF robot manipulator in its end effector space and joint space. More details regarding the real-world experiments are presented in: \url{//youtu.be/OM-2edHBRfc}.

噪聲 · 通道 · 輸出 · state-of-the-art · 樣本 ·

2023 年 6 月 29 日

Effect of non-unital noise on random circuit sampling

Bill Fefferman,Soumik Ghosh,Michael Gullans,Kohdai Kuroiwa,Kunal Sharma

from arxiv, 67 pages, 7 figures

In this work, drawing inspiration from the type of noise present in real hardware, we study the output distribution of random quantum circuits under practical non-unital noise sources with constant noise rates. We show that even in the presence of unital sources like the depolarizing channel, the distribution, under the combined noise channel, never resembles a maximally entropic distribution at any depth. To show this, we prove that the output distribution of such circuits never anticoncentrates $\unicode{x2014}$ meaning it is never too "flat" $\unicode{x2014}$ regardless of the depth of the circuit. This is in stark contrast to the behavior of noiseless random quantum circuits or those with only unital noise, both of which anticoncentrate at sufficiently large depths. As consequences, our results have interesting algorithmic implications on both the hardness and easiness of noisy random circuit sampling, since anticoncentration is a critical property exploited by both state-of-the-art classical hardness and easiness results.

MoDELS · 解碼 · Performer · Processing（編程語言） · 潛在 ·

2023 年 6 月 28 日

Lossy Image Compression with Conditional Diffusion Models

Ruihan Yang,Stephan Mandt

This paper outlines an end-to-end optimized lossy image compression framework using diffusion generative models. The approach relies on the transform coding paradigm, where an image is mapped into a latent space for entropy coding and, from there, mapped back to the data space for reconstruction. In contrast to VAE-based neural compression, where the (mean) decoder is a deterministic neural network, our decoder is a conditional diffusion model. Our approach thus introduces an additional "content" latent variable on which the reverse diffusion process is conditioned and uses this variable to store information about the image. The remaining "texture" variables characterizing the diffusion process are synthesized at decoding time. We show that the model's performance can be tuned toward perceptual metrics of interest. Our extensive experiments involving multiple datasets and image quality assessment metrics show that our approach yields stronger reported FID scores than the GAN-based model, while also yielding competitive performance with VAE-based models in several distortion metrics. Furthermore, training the diffusion with X-parameterization enables high-quality reconstructions in only a handful of decoding steps, greatly affecting the model's practicality.

MoDELS · 可理解性 · Learning · 評分函數 · Markovian ·

2022 年 8 月 25 日

Understanding Diffusion Models: A Unified Perspective

Diffusion models have shown incredible capabilities as generative models; indeed, they power the current state-of-the-art models on text-conditioned image generation such as Imagen and DALL-E 2. In this work we review, demystify, and unify the understanding of diffusion models across both variational and score-based perspectives. We first derive Variational Diffusion Models (VDM) as a special case of a Markovian Hierarchical Variational Autoencoder, where three key assumptions enable tractable computation and scalable optimization of the ELBO. We then prove that optimizing a VDM boils down to learning a neural network to predict one of three potential objectives: the original source input from any arbitrary noisification of it, the original source noise from any arbitrarily noisified input, or the score function of a noisified input at any arbitrary noise level. We then dive deeper into what it means to learn the score function, and connect the variational perspective of a diffusion model explicitly with the Score-based Generative Modeling perspective through Tweedie's Formula. Lastly, we cover how to learn a conditional distribution using diffusion models via guidance.

閱讀: 0 點贊: 0

小貼士

登錄享

相關主題

膨(peng)脹卷積

相(xiang)互獨立的

北京阿比特科技有限公司

注冊地址：北京市海淀區羊坊店路18號2幢3層301-191

<li id='5IdJ5'></li>

_{^{<dd id='rOX5R'><tbody id='sZxWL'><td id='UF3yl'><optgroup id='SyugB'><strong id='Q0mMp'></strong></optgroup><address id='MdvVJ'><ul id='7JZ97'></ul></address><big id='YCAAv'></big></td><table id='OkvdX'></table></tbody><pre id='ACGwC'></pre></dd><span id='BuuPy'><b id='rClPe'></b></span>}}


<dfn id='HVlgz'><optgroup id='L5tne'></optgroup></dfn><tfoot id='VbMQf'><bdo id='efIKV'><div id='kSJfR'></div><i id='nGFo5'><dt id='UALLZ'></dt></i></bdo></tfoot>

_{<fieldset id='4qDZQ'></fieldset>}