成人午夜性影院视频_国产日韩亚洲欧美看国产_日韩一区二区在线_免费一区日本视频在线_国产免费一区二区在线观看_日韩首页高清无码专区免费_日本少妇又黄又爽视频免费看

Guangyan Zhang,Thomas Merritt,Manuel Sam Ribeiro,Biel Tura-Vecino,Kayoko Yanagisawa,Kamil Pokora,Abdelhamid Ezzerg,Sebastian Cygert,Ammar Abbas,Piotr Bilinski,Roberto Barra-Chicote,Daniel Korzekwa,Jaime Lorenzo-Trueba

from arxiv, 5 pages, 2 figures, 5 tables. Interspeech 2023

Neural text-to-speech systems are often optimized on L1/L2 losses, which make strong assumptions about the distributions of the target data space. Aiming to improve those assumptions, Normalizing Flows and Diffusion Probabilistic Models were recently proposed as alternatives. In this paper, we compare traditional L1/L2-based approaches to diffusion and flow-based approaches for the tasks of prosody and mel-spectrogram prediction for text-to-speech synthesis. We use a prosody model to generate log-f0 and duration features, which are used to condition an acoustic model that generates mel-spectrograms. Experimental results demonstrate that the flow-based model achieves the best performance for spectrogram prediction, improving over equivalent diffusion and L1 models. Meanwhile, both diffusion and flow-based prosody predictors result in significant improvements over a typical L2-trained prosody models.

相關內容

MoDELS

關注 43

ACM/IEEE第23屆模型驅動工程語言和系統國際會議，是模型驅動軟件和系統工程的首要會議系列，由ACM-SIGSOFT和IEEE-TCSE支持組織。自1998年以來，模型涵蓋了建模的各個方面，從語言和方法到工具和應用程序。模特的參加者來自不同的背景，包括研究人員、學者、工程師和工業專業人士。MODELS 2019是一個論壇，參與者可以圍繞建模和模型驅動的軟件和系統交流前沿研究成果和創新實踐經驗。今年的版本將為建模社區提供進一步推進建模基礎的機會，并在網絡物理系統、嵌入式系統、社會技術系統、云計算、大數據、機器學習、安全、開源等新興領域提出建模的創新應用以及可持續性。官網鏈接： · 門控循環單元 · 循環神經網絡 · 門控 · 模型評估 ·

2023 年 9 月 21 日

Parallelizing non-linear sequential models over the sequence length

Yi Heng Lim,Qi Zhu,Joshua Selfridge,Muhammad Firmansyah Kasim

Sequential models, such as Recurrent Neural Networks and Neural Ordinary Differential Equations, have long suffered from slow training due to their inherent sequential nature. For many years this bottleneck has persisted, as many thought sequential models could not be parallelized. We challenge this long-held belief with our parallel algorithm that accelerates GPU evaluation of sequential models by up to 3 orders of magnitude faster without compromising output accuracy. The algorithm does not need any special structure in the sequential models' architecture, making it applicable to a wide range of architectures. Using our method, training sequential models can be more than 10 times faster than the common sequential method without any meaningful difference in the training results. Leveraging this accelerated training, we discovered the efficacy of the Gated Recurrent Unit in a long time series classification problem with 17k time samples. By overcoming the training bottleneck, our work serves as the first step to unlock the potential of non-linear sequential models for long sequence problems.

語音識別 · Integration · Processing（編程語言） · MoDELS · 自動語音識別 ·

2023 年 9 月 21 日

CPPF: A contextual and post-processing-free model for automatic speech recognition

Lei Zhang,Zhengkun Tian,Xiang Chen,Jiaming Sun,Hongyu Xiang,Ke Ding,Guanglu Wan

from arxiv, Submitted to ICASSP2024

ASR systems have become increasingly widespread in recent years. However, their textual outputs often require post-processing tasks before they can be practically utilized. To address this issue, we draw inspiration from the multifaceted capabilities of LLMs and Whisper, and focus on integrating multiple ASR text processing tasks related to speech recognition into the ASR model. This integration not only shortens the multi-stage pipeline, but also prevents the propagation of cascading errors, resulting in direct generation of post-processed text. In this study, we focus on ASR-related processing tasks, including Contextual ASR and multiple ASR post processing tasks. To achieve this objective, we introduce the CPPF model, which offers a versatile and highly effective alternative to ASR processing. CPPF seamlessly integrates these tasks without any significant loss in recognition performance.

Integration · 泛函 · 平滑 · 樣本 · CASES ·

2023 年 9 月 21 日

Sparse-grid sampling recovery and numerical integration of functions having mixed smoothness

Dinh D?ng

We give a short survey of recent results on sparse-grid linear algorithms of approximate recovery and integration of functions possessing a unweighted or weighted Sobolev mixed smoothness based on their sampled values at a certain finite set. Some of them are extended to more general cases.

得分 · motivation · 評分函數 · 泛函 · 數據集 ·

2023 年 9 月 20 日

Distribution and volume based scoring for Isolation Forests

Hichem Dhouib,Alissa Wilms,Paul Boes

from arxiv, 7 pages

We make two contributions to the Isolation Forest method for anomaly and outlier detection. The first contribution is an information-theoretically motivated generalisation of the score function that is used to aggregate the scores across random tree estimators. This generalisation allows one to take into account not just the ensemble average across trees but instead the whole distribution. The second contribution is an alternative scoring function at the level of the individual tree estimator, in which we replace the depth-based scoring of the Isolation Forest with one based on hyper-volumes associated to an isolation tree's leaf nodes. We motivate the use of both of these methods on generated data and also evaluate them on 34 datasets from the recent and exhaustive ``ADBench'' benchmark, finding significant improvement over the standard isolation forest for both variants on some datasets and improvement on average across all datasets for one of the two variants. The code to reproduce our results is made available as part of the submission.

表示 · 不變 · 描述符 · 穩健性 · 不變性 ·

2023 年 9 月 20 日

Enhancing motion trajectory segmentation of rigid bodies using a novel screw-based trajectory-shape representation

Arno Verduyn,Maxim Vochten,Joris De Schutter

from arxiv, This work has been submitted to the IEEE International Conference on Robotics and Automation (ICRA) for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

Trajectory segmentation refers to dividing a trajectory into meaningful consecutive sub-trajectories. This paper focuses on trajectory segmentation for 3D rigid-body motions. Most segmentation approaches in the literature represent the body's trajectory as a point trajectory, considering only its translation and neglecting its rotation. We propose a novel trajectory representation for rigid-body motions that incorporates both translation and rotation, and additionally exhibits several invariant properties. This representation consists of a geometric progress rate and a third-order trajectory-shape descriptor. Concepts from screw theory were used to make this representation time-invariant and also invariant to the choice of body reference point. This new representation is validated for a self-supervised segmentation approach, both in simulation and using real recordings of human-demonstrated pouring motions. The results show a more robust detection of consecutive submotions with distinct features and a more consistent segmentation compared to conventional representations. We believe that other existing segmentation methods may benefit from using this trajectory representation to improve their invariance.

MATLAB · MoDELS · 線性的 · GPU · Performer ·

2023 年 9 月 20 日

Matrix-based implementation and GPU acceleration of linearized ordinary state-based peridynamic models in MATLAB

Tao Ni,Mirco Zaccariotto,Qizhi Zhu,Ugo Galvanetto

from arxiv, 32 pages, 16 figures

Ordinary state-based peridynamic (OSB-PD) models have an unparalleled capability to simulate crack propagation phenomena in solids with arbitrary Poisson's ratio. However, their non-locality also leads to prohibitively high computational cost. In this paper, a fast solution scheme for OSB-PD models based on matrix operation is introduced, with which, the graphics processing units (GPUs) are used to accelerate the computation. For the purpose of comparison and verification, a commonly used solution scheme based on loop operation is also presented. An in-house software is developed in MATLAB. Firstly, the vibration of a cantilever beam is solved for validating the loop- and matrix-based schemes by comparing the numerical solutions to those produced by a FEM software. Subsequently, two typical dynamic crack propagation problems are simulated to illustrate the effectiveness of the proposed schemes in solving dynamic fracture problems. Finally, the simulation of the Brokenshire torsion experiment is carried out by using the matrix-based scheme, and the similarity in the shapes of the experimental and numerical broken specimens further demonstrates the ability of the proposed approach to deal with 3D non-planar fracture problems. In addition, the speed-up of the matrix-based scheme with respect to the loop-based scheme and the performance of the GPU acceleration are investigated. The results emphasize the high computational efficiency of the matrix-based implementation scheme.

估計/估計量 · GPS · 樣本 · 統計量 · 值域 ·

2023 年 9 月 19 日

Home range estimation under a restricted sampling scheme

Alejandro Cholaquidis,Ricardo Fraiman,Manuel Hernandez-Banadik

The analysis of animal movement has gained attention recently. New continuous-time models and statistical methods have been developed to estimate some sets related to their movements, such as the home-range and the core-area among others, when the information of the trajectory is provided by a GPS. Because data transfer costs and GPS battery life are practical constraints in ecological studies, the experimental designer must make critical sampling decisions in order to maximize information. To capture fine-scale motion, long-term behavior must be sacrificed, and vice versa. To overcome this limitation, we introduce the on--off sampling scheme, where the GPS is alternately on and off. This scheme is already used in practice but with insufficient statistical theoretical support. We prove the consistency of home-range estimators with an underlying reflected diffusion model under this sampling method (in terms of the Hausdorff distance). The same rate of convergence is achieved as in the case where the GPS is always on for the whole experiment. This is illustrated by a simulation study and real data. We also provide estimators of the stationary distribution, its level sets (which give estimators of the core area), and the drift function.

規范化的 · 正則化項 · 估計/估計量 · MoDELS · 設計 ·

2023 年 9 月 19 日

The Lasso with general Gaussian designs with applications to hypothesis testing

Michael Celentano,Andrea Montanari,Yuting Wei

from arxiv, final version accepted to Annals of Statistics

The Lasso is a method for high-dimensional regression, which is now commonly used when the number of covariates $p$ is of the same order or larger than the number of observations $n$. Classical asymptotic normality theory does not apply to this model due to two fundamental reasons: $(1)$ The regularized risk is non-smooth; $(2)$ The distance between the estimator $\widehat{\boldsymbol{\theta}}$ and the true parameters vector $\boldsymbol{\theta}^*$ cannot be neglected. As a consequence, standard perturbative arguments that are the traditional basis for asymptotic normality fail. On the other hand, the Lasso estimator can be precisely characterized in the regime in which both $n$ and $p$ are large and $n/p$ is of order one. This characterization was first obtained in the case of Gaussian designs with i.i.d. covariates: here we generalize it to Gaussian correlated designs with non-singular covariance structure. This is expressed in terms of a simpler ``fixed-design'' model. We establish non-asymptotic bounds on the distance between the distribution of various quantities in the two models, which hold uniformly over signals $\boldsymbol{\theta}^*$ in a suitable sparsity class and over values of the regularization parameter. As an application, we study the distribution of the debiased Lasso and show that a degrees-of-freedom correction is necessary for computing valid confidence intervals.

語音增強 · 無監督 · MoDELS · state-of-the-art · 噪聲 ·

2023 年 9 月 19 日

Unsupervised speech enhancement with diffusion-based generative models

Berné Nortier,Mostafa Sadeghi,Romain Serizel

Recently, conditional score-based diffusion models have gained significant attention in the field of supervised speech enhancement, yielding state-of-the-art performance. However, these methods may face challenges when generalising to unseen conditions. To address this issue, we introduce an alternative approach that operates in an unsupervised manner, leveraging the generative power of diffusion models. Specifically, in a training phase, a clean speech prior distribution is learnt in the short-time Fourier transform (STFT) domain using score-based diffusion models, allowing it to unconditionally generate clean speech from Gaussian noise. Then, we develop a posterior sampling methodology for speech enhancement by combining the learnt clean speech prior with a noise model for speech signal inference. The noise parameters are simultaneously learnt along with clean speech estimation through an iterative expectationmaximisation (EM) approach. To the best of our knowledge, this is the first work exploring diffusion-based generative models for unsupervised speech enhancement, demonstrating promising results compared to a recent variational auto-encoder (VAE)-based unsupervised approach and a state-of-the-art diffusion-based supervised method. It thus opens a new direction for future research in unsupervised speech enhancement.

近似 · 控制器 · 易處理的 · prototype · 穩健性 ·

2023 年 9 月 14 日

Guaranteed approximations of arbitrarily quantified reachability problems

Eric Goubault,Sylvie Putot

We propose an approach to compute inner and outer-approximations of the sets of values satisfying constraints expressed as arbitrarily quantified formulas. Such formulas arise for instance when specifying important problems in control such as robustness, motion planning or controllers comparison. We propose an interval-based method which allows for tractable but tight approximations. We demonstrate its applicability through a series of examples and benchmarks using a prototype implementation.