亚洲国产最新AV片_国产精品毛片久久久久_你比我老公的又大又粗办的还舒坦_免费在线观看国产一区二区三区网站_五月丁香亚洲日本婷婷激情_囗工全彩肉肉无遮挡无码_亚洲特级一级黄色视频

We introduce a novel class of generative models based on piecewise deterministic Markov processes (PDMPs), a family of non-diffusive stochastic processes consisting of deterministic motion and random jumps at random times. Similarly to diffusions, such Markov processes admit time reversals that turn out to be PDMPs as well. We apply this observation to three PDMPs considered in the literature: the Zig-Zag process, Bouncy Particle Sampler, and Randomised Hamiltonian Monte Carlo. For these three particular instances, we show that the jump rates and kernels of the corresponding time reversals admit explicit expressions depending on some conditional densities of the PDMP under consideration before and after a jump. Based on these results, we propose efficient training procedures to learn these characteristics and consider methods to approximately simulate the reverse process. Finally, we provide bounds in the total variation distance between the data distribution and the resulting distribution of our model in the case where the base distribution is the standard $d$-dimensional Gaussian distribution. Promising numerical simulations support further investigations into this class of models.

相關內容

Processing（編(bian)程語言）

關注 121

Processing 是(shi)一門開源編(bian)(bian)程語言和(he)(he)與之配套的集成開發環境(jing)（IDE）的名稱。Processing 在電子藝(yi)術和(he)(he)視(shi)覺設(she)計社區被用來教授編(bian)(bian)程基(ji)礎，并運用于大量的新媒體和(he)(he)互動藝(yi)術作品(pin)中。

MoDELS · 語言模型化 · INFORMS · Processing（編程語言） · 推斷 ·

2024 年 12 月 18 日

Alignment faking in large language models

Ryan Greenblatt,Carson Denison,Benjamin Wright,Fabien Roger,Monte MacDiarmid,Sam Marks,Johannes Treutlein,Tim Belonax,Jack Chen,David Duvenaud,Akbir Khan,Julian Michael,S?ren Mindermann,Ethan Perez,Linda Petrini,Jonathan Uesato,Jared Kaplan,Buck Shlegeris,Samuel R. Bowman,Evan Hubinger

We present a demonstration of a large language model engaging in alignment faking: selectively complying with its training objective in training to prevent modification of its behavior out of training. First, we give Claude 3 Opus a system prompt stating it is being trained to answer all queries, even harmful ones, which conflicts with its prior training to refuse such queries. To allow the model to infer when it is in training, we say it will be trained only on conversations with free users, not paid users. We find the model complies with harmful queries from free users 14% of the time, versus almost never for paid users. Explaining this gap, in almost all cases where the model complies with a harmful query from a free user, we observe explicit alignment-faking reasoning, with the model stating it is strategically answering harmful queries in training to preserve its preferred harmlessness behavior out of training. Next, we study a more realistic setting where information about the training process is provided not in a system prompt, but by training on synthetic documents that mimic pre-training data--and observe similar alignment faking. Finally, we study the effect of actually training the model to comply with harmful queries via reinforcement learning, which we find increases the rate of alignment-faking reasoning to 78%, though also increases compliance even out of training. We additionally observe other behaviors such as the model exfiltrating its weights when given an easy opportunity. While we made alignment faking easier by telling the model when and by what criteria it was being trained, we did not instruct the model to fake alignment or give it any explicit goal. As future models might infer information about their training process without being told, our results suggest a risk of alignment faking in future models, whether due to a benign preference--as in this case--or not.

示例 · MoDELS · 優化器 · Performer · Better ·

2024 年 12 月 18 日

IDEQ: an improved diffusion model for the TSP

Mickael Basson,Philippe Preux

We investigate diffusion models to solve the Traveling Salesman Problem. Building on the recent DIFUSCO and T2TCO approaches, we propose IDEQ. IDEQ improves the quality of the solutions by leveraging the constrained structure of the state space of the TSP. Another key component of IDEQ consists in replacing the last stages of DIFUSCO curriculum learning by considering a uniform distribution over the Hamiltonian tours whose orbits by the 2-opt operator converge to the optimal solution as the training objective. Our experiments show that IDEQ improves the state of the art for such neural network based techniques on synthetic instances. More importantly, our experiments show that IDEQ performs very well on the instances of the TSPlib, a reference benchmark in the TSP community: it closely matches the performance of the best heuristics, LKH3, being even able to obtain better solutions than LKH3 on 2 instances of the TSPlib defined on 1577 and 3795 cities. IDEQ obtains 0.3% optimality gap on TSP instances made of 500 cities, and 0.5% on TSP instances with 1000 cities. This sets a new SOTA for neural based methods solving the TSP. Moreover, IDEQ exhibits a lower variance and better scales-up with the number of cities with regards to DIFUSCO and T2TCO.

相同 · INFORMS · 模型評估 · Continuity · 相關系數 ·

2024 年 12 月 18 日

Spatio-Temporal Fuzzy-oriented Multi-Modal Meta-Learning for Fine-grained Emotion Recognition

Jingyao Wang,Yuxuan Yang,Wenwen Qiang,Changwen Zheng,Hui Xiong

from arxiv, 13 pages, Submitted to TMM in 30-May-2024

Fine-grained emotion recognition (FER) plays a vital role in various fields, such as disease diagnosis, personalized recommendations, and multimedia mining. However, existing FER methods face three key challenges in real-world applications: (i) they rely on large amounts of continuously annotated data to ensure accuracy since emotions are complex and ambiguous in reality, which is costly and time-consuming; (ii) they cannot capture the temporal heterogeneity caused by changing emotion patterns, because they usually assume that the temporal correlation within sampling periods is the same; (iii) they do not consider the spatial heterogeneity of different FER scenarios, that is, the distribution of emotion information in different data may have bias or interference. To address these challenges, we propose a Spatio-Temporal Fuzzy-oriented Multi-modal Meta-learning framework (ST-F2M). Specifically, ST-F2M first divides the multi-modal videos into multiple views, and each view corresponds to one modality of one emotion. Multiple randomly selected views for the same emotion form a meta-training task. Next, ST-F2M uses an integrated module with spatial and temporal convolutions to encode the data of each task, reflecting the spatial and temporal heterogeneity. Then it adds fuzzy semantic information to each task based on generalized fuzzy rules, which helps handle the complexity and ambiguity of emotions. Finally, ST-F2M learns emotion-related general meta-knowledge through meta-recurrent neural networks to achieve fast and robust fine-grained emotion recognition. Extensive experiments show that ST-F2M outperforms various state-of-the-art methods in terms of accuracy and model efficiency. In addition, we construct ablation studies and further analysis to explore why ST-F2M performs well.

MoDELS · 塊 · 優化器 · Integration · Neural Networks ·

2024 年 12 月 16 日

Enhancing Transformer-based models for Long Sequence Time Series Forecasting via Structured Matrix

Zhicheng Zhang,Yong Wang,Shaoqi Tan,Bowei Xia,Yujie Luo

Recently, Transformer-based models for long sequence time series forecasting have demonstrated promising results. The self-attention mechanism as the core component of these Transformer-based models exhibits great potential in capturing various dependencies among data points. Despite these advancements, it has been a subject of concern to improve the efficiency of the self-attention mechanism. Unfortunately, current specific optimization methods are facing the challenges in applicability and scalability for the future design of long sequence time series forecasting models. Hence, in this article, we propose a novel architectural framework that enhances Transformer-based models through the integration of Surrogate Attention Blocks (SAB) and Surrogate Feed-Forward Neural Network Blocks (SFB). The framework reduces both time and space complexity by the replacement of the self-attention and feed-forward layers with SAB and SFB while maintaining their expressive power and architectural advantages. The equivalence of this substitution is fully demonstrated. The extensive experiments on 10 Transformer-based models across five distinct time series tasks demonstrate an average performance improvement of 12.4%, alongside 61.3% reduction in parameter counts.

MoDELS · 隱馬爾科夫模型 · Markov · 推斷 · 統計量 ·

2024 年 12 月 16 日

Autoregressive hidden Markov models for high-resolution animal movement data

Ferdinand V. Stoye,Annika Hoyer,Roland Langrock

New types of high-resolution animal movement data allow for increasingly comprehensive biological inference, but method development to meet the statistical challenges associated with such data is lagging behind. In this contribution, we extend the commonly applied hidden Markov models for step lengths and turning angles to address the specific requirements posed by high-resolution movement data, in particular the very strong within-state correlation induced by the momentum in the movement. The models feature autoregressive components of general order in both the step length and the turning angle variable, with the possibility to automate the selection of the autoregressive degree using a lasso approach. In a simulation study, we identify potential for improved inference when using the new model instead of the commonly applied basic hidden Markov model in cases where there is strong within-state autocorrelation. The practical use of the model is illustrated using high-resolution movement tracks of terns foraging near an anthropogenic structure causing turbulent water flow features.

MoDELS · 估計/估計量 · Processing（編程語言） · 得分 · Analysis ·

2024 年 12 月 15 日

Wasserstein Bounds for generative diffusion models with Gaussian tail targets

Xixian Wang,Zhongjian Wang

We present an estimate of the Wasserstein distance between the data distribution and the generation of score-based generative models, assuming an $\epsilon$-accurate approximation of the score and a Gaussian-type tail behavior of the data distribution. The complexity bound in dimension is $O(\sqrt{d})$, with a logarithmic constant. Such Gaussian tail assumption applies to the distribution of a compact support target with early stopping technique and the Bayesian posterior with a bounded observation operator. Corresponding convergence and complexity bounds are derived. The crux of the analysis lies in the Lipchitz bound of the score, which is related to the Hessian estimate of a viscous Hamilton-Jacobi equation (vHJ). This latter is demonstrated by employing a dimension independent kernel estimate. Consequently, our complexity bound scales linearly (up to a logarithmic constant) with the square root of the trace of the covariance operator, which relates to the invariant distribution of forward process. Our analysis also extends to the probabilistic flow ODE, as the sampling process.

變換 · 分解的 · 近似 · 離散化 · 線性的 ·

2024 年 12 月 15 日

An explicit spectral decomposition of the ADRT

Weilin Li,Karl Otness,Kui Ren,Donsub Rim

from arxiv, 26 pages, 11 figures

The approximate discrete Radon transform (ADRT) is a hierarchical multiscale approximation of the Radon transform. In this paper, we factor the ADRT into a product of linear transforms that resemble convolutions, and derive an explicit spectral decomposition of each factor. We further show that this implies - for data lying in the range of the ADRT - that the transform of an $N \times N$ image can be formally inverted with complexity $\mathcal{O}(N^2 \log^2 N)$. We numerically test the accuracy of the inverse on images of moderate sizes and find that it is competitive with existing iterative algorithms in this special regime.

contrastive · MoDELS · INTERACT · 數學 · Liquid ·

2024 年 12 月 14 日

Mathematical models for nonlinear ultrasound contrast imaging with microbubbles

Vanja Nikoli?,Teresa Rauscher

Ultrasound contrast imaging is a specialized imaging technique that applies microbubble contrast agents to traditional medical sonography, providing real-time visualization of blood flow and vessels. Gas-filled microbubbles are injected into the body, where they undergo compression and rarefaction and interact nonlinearly with the ultrasound waves. Therefore, the propagation of sound through a bubbly liquid is a strongly nonlinear problem that can be modeled by a nonlinear acoustic wave equation for the propagation of the pressure waves coupled via the source terms to a nonlinear ordinary differential equation of Rayleigh-Plesset type for the bubble dynamics. In this work, we first derive a hierarchy of such coupled models based on constitutive laws. We then focus on the coupling of Westervelt's acoustic equation to Rayleigh-Plesset type equations, where we rigorously show the existence of solutions locally in time under suitable conditions on the initial pressure-microbubble data and final time. Thirdly, we devise and discuss numerical experiments on both single-bubble dynamics and the interaction of microbubbles with ultrasound waves.

MoDELS · 推斷 · 似然 · 估計/估計量 · 貝葉斯推斷 ·

2024 年 12 月 14 日

Bayesian inference on Cox regression models using catalytic prior distributions

Weihao Li,Dongming Huang

The Cox proportional hazards model (Cox model) is a popular model for survival data analysis. When the sample size is small relative to the dimension of the model, the standard maximum partial likelihood inference is often problematic. In this work, we propose the Cox catalytic prior distributions for Bayesian inference on Cox models, which is an extension of a general class of prior distributions originally designed for stabilizing complex parametric models. The Cox catalytic prior is formulated as a weighted likelihood of the regression coefficients based on synthetic data and a surrogate baseline hazard constant. This surrogate hazard can be either provided by the user or estimated from the data, and the synthetic data are generated from the predictive distribution of a fitted simpler model. For point estimation, we derive an approximation of the marginal posterior mode, which can be computed conveniently as a regularized log partial likelihood estimator. We prove that our prior distribution is proper and the resulting estimator is consistent under mild conditions. In simulation studies, our proposed method outperforms standard maximum partial likelihood inference and is on par with existing shrinkage methods. We further illustrate the application of our method to a real dataset.

稀疏 · 流形 · 近似 · 線性的 · 向量化 ·

2024 年 12 月 12 日

Empirical sparse regression on quadratic manifolds

Paul Schwerdtner,Serkan Gugercin,Benjamin Peherstorfer

from arxiv, 29 pages, 11 figures

Approximating field variables and data vectors from sparse samples is a key challenge in computational science. Widely used methods such as gappy proper orthogonal decomposition and empirical interpolation rely on linear approximation spaces, limiting their effectiveness for data representing transport-dominated and wave-like dynamics. To address this limitation, we introduce quadratic manifold sparse regression, which trains quadratic manifolds with a sparse greedy method and computes approximations on the manifold through novel nonlinear projections of sparse samples. The nonlinear approximations obtained with quadratic manifold sparse regression achieve orders of magnitude higher accuracies than linear methods on data describing transport-dominated dynamics in numerical experiments.