亚洲男人的天堂2018av,欧美草比,久久久久久免费视频精选,国色天香在线看免费,久久久久亚洲av成人片仓井空

<li id='f0p1d'></li>

_{^{<dd id='f0p1d'><tbody id='f0p1d'><td id='f0p1d'><optgroup id='f0p1d'><strong id='f0p1d'></strong></optgroup><address id='f0p1d'><ul id='f0p1d'></ul></address><big id='f0p1d'></big></td><table id='f0p1d'></table></tbody><pre id='f0p1d'></pre></dd><span id='f0p1d'><b id='f0p1d'></b></span>}}


<dfn id='f0p1d'><optgroup id='f0p1d'></optgroup></dfn><tfoot id='f0p1d'><bdo id='f0p1d'><div id='f0p1d'></div><i id='f0p1d'><dt id='f0p1d'></dt></i></bdo></tfoot>

_{<fieldset id='f0p1d'></fieldset>}

·

Performer · 無監督 · MoDELS · Learning · 無監督學習 ·

2023 年 9 月 1 日

Unsupervised Learning of Style-Aware Facial Animation from Real Acting Performances

Wolfgang Paier,Anna Hilsmann,Peter Eisert

from arxiv, 16 pages, submitted to Graphical Models (Feb 2023)

This paper presents a novel approach for text/speech-driven animation of a photo-realistic head model based on blend-shape geometry, dynamic textures, and neural rendering. Training a VAE for geometry and texture yields a parametric model for accurate capturing and realistic synthesis of facial expressions from a latent feature vector. Our animation method is based on a conditional CNN that transforms text or speech into a sequence of animation parameters. In contrast to previous approaches, our animation model learns disentangling/synthesizing different acting-styles in an unsupervised manner, requiring only phonetic labels that describe the content of training sequences. For realistic real-time rendering, we train a U-Net that refines rasterization-based renderings by computing improved pixel colors and a foreground matte. We compare our framework qualitatively/quantitatively against recent methods for head modeling as well as facial animation and evaluate the perceived rendering/animation quality in a user-study, which indicates large improvements compared to state-of-the-art approaches

相關內容

Performer

多峰值 · Performer · Attention · 可理解性 · Extensibility ·

2023 年 10 月 20 日

A Suite of Generative Tasks for Multi-Level Multimodal Webpage Understanding

Andrea Burns,Krishna Srinivasan,Joshua Ainslie,Geoff Brown,Bryan A. Plummer,Kate Saenko,Jianmo Ni,Mandy Guo

from arxiv, Accepted in EMNLP 2023, revision contains camera ready edits. Data can be downloaded at //github.com/google-research-datasets/wit/blob/main/wikiweb2m.md

Webpages have been a rich, scalable resource for vision-language and language only tasks. Yet only pieces of webpages are kept in existing datasets: image-caption pairs, long text articles, or raw HTML, never all in one place. Webpage tasks have resultingly received little attention and structured image-text data left underused. To study multimodal webpage understanding, we introduce the Wikipedia Webpage suite (WikiWeb2M) containing 2M pages with all of the associated image, text, and structure data. We verify its utility on three generative tasks: page description generation, section summarization, and contextual image captioning. We design a novel attention mechanism Prefix Global, which selects the most relevant image and text content as global tokens to attend to the rest of the webpage for context. By using page structure to separate such tokens, it performs better than full attention with lower computational complexity. Extensive experiments show that the new data in WikiWeb2M improves task performance compared to prior work.

MoDELS · 規范化的 · Shell · 層 · 近似 ·

2023 年 10 月 20 日

Magnetization of Flat Superconducting Films on Ferromagnetic Substrates

Leonid Prigozhin,Vladimir Sokolovsky

from arxiv, 11 pages, 6 figures

Ferromagnetic substrate influences the electromagnetic response of a type-II superconducting film to the applied magnetic field. We present a two-dimensional integrodifferential model for the magnetization of a flat superconductor/ferromagnet bilayer of an arbitrary shape using a thin shell quasistatic model for the ferromagnetic substrate and an infinitely thin approximation for the superconducting layer. An efficient numerical method is developed and used to investigate the effect of a ferromagnetic substrate. In particular, we simulate the thin bilayer magnetization in a parallel field and, for a high field, the critical-state distributions of the superconducting current density. These critical-state distributions are different from those known for a normal external field.

Microsoft Surface · 可約的 · INFORMS · state-of-the-art · 基 ·

2023 年 10 月 20 日

Uplink Multiplexing of eMBB/URLLC Services Assisted by Reconfigurable Intelligent Surfaces

Jo?o Henrique Inacio de Souza,Victor Croisfelt,Rados?aw Kotaba,Taufik Abr?o,Petar Popovski

from arxiv, This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

This letter proposes a scheme assisted by a reconfigurable intelligent surface (RIS) for efficient uplink traffic multiplexing between enhanced mobile broadband (eMBB) and ultra-reliable-low-latency communication (URLLC). The scheme determines two RIS configurations based only on the eMBB channel state information (CSI) available at the base station (BS). The first optimizes eMBB quality of service, while the second reduces eMBB interference in URLLC traffic by temporarily silencing the eMBB traffic. Numerical results demonstrate that this approach, relying solely on eMBB CSI and without BS coordination, can outperform the state-of-the-art preemptive puncturing by 4.9 times in terms of URLLC outage probability.

Learning · SR · 表示 · 表示學習 · HMM ·

2023 年 10 月 20 日

Learning Successor Representations with Distributed Hebbian Temporal Memory

Evgenii Dzhivelikian,Petr Kuderov,Aleksandr I. Panov

from arxiv, 12 pages, 4 figures

This paper presents a novel approach to address the challenge of online hidden representation learning for decision-making under uncertainty in non-stationary, partially observable environments. The proposed algorithm, Distributed Hebbian Temporal Memory (DHTM), is based on factor graph formalism and a multicomponent neuron model. DHTM aims to capture sequential data relationships and make cumulative predictions about future observations, forming Successor Representation (SR). Inspired by neurophysiological models of the neocortex, the algorithm utilizes distributed representations, sparse transition matrices, and local Hebbian-like learning rules to overcome the instability and slow learning process of traditional temporal memory algorithms like RNN and HMM. Experimental results demonstrate that DHTM outperforms classical LSTM and performs comparably to more advanced RNN-like algorithms, speeding up Temporal Difference learning for SR in changing environments. Additionally, we compare the SRs produced by DHTM to another biologically inspired HMM-like algorithm, CSCG. Our findings suggest that DHTM is a promising approach for addressing the challenges of online hidden representation learning in dynamic environments.

MoDELS · 隨機森林 · 特征空間 · Weight · Performer ·

2023 年 10 月 19 日

Towards Enhanced Local Explainability of Random Forests: a Proximity-Based Approach

Joshua Rosaler,Dhruv Desai,Bhaskarjit Sarmah,Dimitrios Vamvourellis,Deran Onay,Dhagash Mehta,Stefano Pasquali

from arxiv, 5 pages, 6 figures

We initiate a novel approach to explain the out of sample performance of random forest (RF) models by exploiting the fact that any RF can be formulated as an adaptive weighted K nearest-neighbors model. Specifically, we use the proximity between points in the feature space learned by the RF to re-write random forest predictions exactly as a weighted average of the target labels of training data points. This linearity facilitates a local notion of explainability of RF predictions that generates attributions for any model prediction across observations in the training set, and thereby complements established methods like SHAP, which instead generates attributions for a model prediction across dimensions of the feature space. We demonstrate this approach in the context of a bond pricing model trained on US corporate bond trades, and compare our approach to various existing approaches to model explainability.

稀疏 · Ray · Microsoft Surface · 偽標記 · INFORMS ·

2023 年 10 月 18 日

Geometry-Guided Ray Augmentation for Neural Surface Reconstruction with Sparse Views

Jiawei Yao,Chen Wang,Tong Wu,Chuming Li

In this paper, we propose a novel method for 3D scene and object reconstruction from sparse multi-view images. Different from previous methods that leverage extra information such as depth or generalizable features across scenes, our approach leverages the scene properties embedded in the multi-view inputs to create precise pseudo-labels for optimization without any prior training. Specifically, we introduce a geometry-guided approach that improves surface reconstruction accuracy from sparse views by leveraging spherical harmonics to predict the novel radiance while holistically considering all color observations for a point in the scene. Also, our pipeline exploits proxy geometry and correctly handles the occlusion in generating the pseudo-labels of radiance, which previous image-warping methods fail to avoid. Our method, dubbed Ray Augmentation (RayAug), achieves superior results on DTU and Blender datasets without requiring prior training, demonstrating its effectiveness in addressing the problem of sparse view reconstruction. Our pipeline is flexible and can be integrated into other implicit neural reconstruction methods for sparse views.

語言模型化 · Extensibility · 穩健性 · MoDELS · 再參數化/重參數化 ·

2023 年 10 月 18 日

REMARK-LLM: A Robust and Efficient Watermarking Framework for Generative Large Language Models

Ruisi Zhang,Shehzeen Samarah Hussain,Paarth Neekhara,Farinaz Koushanfar

We present REMARK-LLM, a novel efficient, and robust watermarking framework designed for texts generated by large language models (LLMs). Synthesizing human-like content using LLMs necessitates vast computational resources and extensive datasets, encapsulating critical intellectual property (IP). However, the generated content is prone to malicious exploitation, including spamming and plagiarism. To address the challenges, REMARK-LLM proposes three new components: (i) a learning-based message encoding module to infuse binary signatures into LLM-generated texts; (ii) a reparameterization module to transform the dense distributions from the message encoding to the sparse distribution of the watermarked textual tokens; (iii) a decoding module dedicated for signature extraction; Furthermore, we introduce an optimized beam search algorithm to guarantee the coherence and consistency of the generated content. REMARK-LLM is rigorously trained to encourage the preservation of semantic integrity in watermarked content, while ensuring effective watermark retrieval. Extensive evaluations on multiple unseen datasets highlight REMARK-LLM proficiency and transferability in inserting 2 times more signature bits into the same texts when compared to prior art, all while maintaining semantic integrity. Furthermore, REMARK-LLM exhibits better resilience against a spectrum of watermark detection and removal attacks.

離散化 · HTTPS · SimPLe · Pair · FAST ·

2023 年 10 月 18 日

Speaking Style Conversion in the Waveform Domain Using Discrete Self-Supervised Units

Gallil Maimon,Yossi Adi

from arxiv, Accepted at EMNLP 2023

We introduce DISSC, a novel, lightweight method that converts the rhythm, pitch contour and timbre of a recording to a target speaker in a textless manner. Unlike DISSC, most voice conversion (VC) methods focus primarily on timbre, and ignore people's unique speaking style (prosody). The proposed approach uses a pretrained, self-supervised model for encoding speech to discrete units, which makes it simple, effective, and fast to train. All conversion modules are only trained on reconstruction like tasks, thus suitable for any-to-many VC with no paired data. We introduce a suite of quantitative and qualitative evaluation metrics for this setup, and empirically demonstrate that DISSC significantly outperforms the evaluated baselines. Code and samples are available at //pages.cs.huji.ac.il/adiyoss-lab/dissc/.

FRN · INFORMS · Networking · MoDELS · 學成 ·

2021 年 4 月 12 日

Feature Decomposition and Reconstruction Learning for Effective Facial Expression Recognition

Delian Ruan, YanYan,Shenqi Lai,Zhenhua Chai,Chunhua Shen,Hanzi Wang

from arxiv, IEEE/CVF Conference on Computer Vision and Pattern Recognition 2021 (CVPR 2021)

In this paper, we propose a novel Feature Decomposition and Reconstruction Learning (FDRL) method for effective facial expression recognition. We view the expression information as the combination of the shared information (expression similarities) across different expressions and the unique information (expression-specific variations) for each expression. More specifically, FDRL mainly consists of two crucial networks: a Feature Decomposition Network (FDN) and a Feature Reconstruction Network (FRN). In particular, FDN first decomposes the basic features extracted from a backbone network into a set of facial action-aware latent features to model expression similarities. Then, FRN captures the intra-feature and inter-feature relationships for latent features to characterize expression-specific variations, and reconstructs the expression feature. To this end, two modules including an intra-feature relation modeling module and an inter-feature relation modeling module are developed in FRN. Experimental results on both the in-the-lab databases (including CK+, MMI, and Oulu-CASIA) and the in-the-wild databases (including RAF-DB and SFEW) show that the proposed FDRL method consistently achieves higher recognition accuracy than several state-of-the-art methods. This clearly highlights the benefit of feature decomposition and reconstruction for classifying expressions.

MoDELS · 目標檢測 · 計算成本 · 可約的 · LSTM ·

2018 年 3 月 28 日

Mobile Video Object Detection with Temporally-Aware Feature Maps

Mason Liu,Menglong Zhu

from arxiv, In CVPR 2018

This paper introduces an online model for object detection in videos designed to run in real-time on low-powered mobile and embedded devices. Our approach combines fast single-image object detection with convolutional long short term memory (LSTM) layers to create an interweaved recurrent-convolutional architecture. Additionally, we propose an efficient Bottleneck-LSTM layer that significantly reduces computational cost compared to regular LSTMs. Our network achieves temporal awareness by using Bottleneck-LSTMs to refine and propagate feature maps across frames. This approach is substantially faster than existing detection methods in video, outperforming the fastest single-frame models in model size and computational cost while attaining accuracy comparable to much more expensive single-frame models on the Imagenet VID 2015 dataset. Our model reaches a real-time inference speed of up to 15 FPS on a mobile CPU.

閱讀: 0 點贊: 0

小貼士

登錄享

相關主題

無監督學習

北京阿比特科技有限公司

注冊地址：北京市海淀區羊坊店路18號2幢3層301-191

<form id='iqZYq'></form>

<bdo id='rJ4rp'><sup id='nrN2Z'><div id='ZoSgT'><bdo id='a4LDd'></bdo></div></sup></bdo>