亚洲男人的天堂2018av,欧美草比,久久久久久免费视频精选,国色天香在线看免费,久久久久亚洲av成人片仓井空

<dir id='nzyS6'><del id='Ffzoy'><del id='Go4XB'></del><pre id='RA4rl'><pre id='42QLj'><option id='GaGKp'><address id='D9aO7'></address><bdo id='6irlO'><tr id='V9101'><acronym id='v5udC'><pre id='vsIyQ'></pre></acronym><div id='wSWIz'></div></tr></bdo></option></pre><small id='EJY96'><address id='Hoo1G'><u id='JiSMz'><legend id='6aRUX'><option id='LQCh4'><abbr id='lzL9K'></abbr><li id='mvSVk'><pre id='mj2ds'></pre></li></option></legend><select id='hPEH3'></select></u></address></small></pre></del><sup id='mK60V'></sup><blockquote id='vbcUX'><dt id='YgRfU'></dt></blockquote><blockquote id='iMqpt'></blockquote></dir><tt id='oz7Cn'></tt><u id='WiNBQ'><tt id='6l6oQ'><form id='LEMnp'></form></tt><td id='lt98v'><dt id='gCWtk'></dt></td></u>

<code id='QWY4m'><i id='vxoOz'><q id='unZ8Z'><legend id='V2NvT'><pre id='Tupq2'><style id='bevNc'><acronym id='7yIhY'><i id='nuaWA'><form id='ghhwt'><option id='4EbEh'><center id='MvnTr'></center></option></form></i></acronym></style><tt id='wGIcl'></tt></pre></legend></q></i></code><center id='qbLMC'></center>

<dd id='Y6gHu'></dd>

<style id='9jrTk'></style><sub id='zCWio'><dfn id='0X0bt'><abbr id='EvCsd'><big id='K2Tsx'><bdo id='s8xa4'></bdo></big></abbr></dfn></sub>_{<dir id='CfWpJ'></dir>}

·

估計/估計量 · Reverberation · 語音增強 · INTERACT · MoDELS ·

2023 年 12 月 4 日

Head Orientation Estimation with Distributed Microphones Using Speech Radiation Patterns

Kaspar Müller,Bilgesu ?akmak,Paul Didier,Simon Doclo,Jan ?stergaard,Tobias Wolff

from arxiv, 6 pages, submitted to 57th Asilomar Conference on Signals, Systems, and Computers, Pacific Grove, CA, USA, 2023

Determining the head orientation of a talker is not only beneficial for various speech signal processing applications, such as source localization or speech enhancement, but also facilitates intuitive voice control and interaction with smart environments or modern car assistants. Most approaches for head orientation estimation are based on visual cues. However, this requires camera systems which often are not available. We present an approach which purely uses audio signals captured with only a few distributed microphones around the talker. Specifically, we propose a novel method that directly incorporates measured or modeled speech radiation patterns to infer the talker's orientation during active speech periods based on a cosine similarity measure. Moreover, an automatic gain adjustment technique is proposed for uncalibrated, irregular microphone setups, such as ad-hoc sensor networks. In experiments with signals recorded in both anechoic and reverberant environments, the proposed method outperforms state-of-the-art approaches, using either measured or modeled speech radiation patterns.

相關內容

估計/估計量

估計/估計量

Performer · 語音增強 · 不變 · MoDELS · Reverberation ·

2024 年 1 月 25 日

Improving Design of Input Condition Invariant Speech Enhancement

Wangyou Zhang,Jee-weon Jung,Shinji Watanabe,Yanmin Qian

from arxiv, Accepted by ICASSP 2024, 5 pages, 2 figures, 3 tables

Building a single universal speech enhancement (SE) system that can handle arbitrary input is a demanded but underexplored research topic. Towards this ultimate goal, one direction is to build a single model that handles diverse audio duration, sampling frequencies, and microphone variations in noisy and reverberant scenarios, which we define here as "input condition invariant SE". Such a model was recently proposed showing promising performance; however, its multi-channel performance degraded severely in real conditions. In this paper we propose novel architectures to improve the input condition invariant SE model so that performance in simulated conditions remains competitive while real condition degradation is much mitigated. For this purpose, we redesign the key components that comprise such a system. First, we identify that the channel-modeling module's generalization to unseen scenarios can be sub-optimal and redesign this module. We further introduce a two-stage training strategy to enhance training efficiency. Second, we propose two novel dual-path time-frequency blocks, demonstrating superior performance with fewer parameters and computational costs compared to the existing method. All proposals combined, experiments on various public datasets validate the efficacy of the proposed model, with significantly improved performance on real conditions. Recipe with full model details is released at //github.com/espnet/espnet.

大語言模型 · 圖 · 語言模型化 · 推薦系統 · MoDELS ·

2024 年 1 月 25 日

Enhancing Recommender Systems with Large Language Model Reasoning Graphs

Yan Wang,Zhixuan Chu,Xin Ouyang,Simeng Wang,Hongyan Hao,Yue Shen,Jinjie Gu,Siqiao Xue,James Y Zhang,Qing Cui,Longfei Li,Jun Zhou,Sheng Li

from arxiv, 12 pages, 6 figures

Recommendation systems aim to provide users with relevant suggestions, but often lack interpretability and fail to capture higher-level semantic relationships between user behaviors and profiles. In this paper, we propose a novel approach that leverages large language models (LLMs) to construct personalized reasoning graphs. These graphs link a user's profile and behavioral sequences through causal and logical inferences, representing the user's interests in an interpretable way. Our approach, LLM reasoning graphs (LLMRG), has four components: chained graph reasoning, divergent extension, self-verification and scoring, and knowledge base self-improvement. The resulting reasoning graph is encoded using graph neural networks, which serves as additional input to improve conventional recommender systems, without requiring extra user or item information. Our approach demonstrates how LLMs can enable more logical and interpretable recommender systems through personalized reasoning graphs. LLMRG allows recommendations to benefit from both engineered recommendation systems and LLM-derived reasoning graphs. We demonstrate the effectiveness of LLMRG on benchmarks and real-world scenarios in enhancing base recommendation models.

Guidance · Continuity · 策略改進 · 控制器 · Boosting（一種模型訓練加速方式） ·

2024 年 1 月 24 日

Boosting Continuous Control with Consistency Policy

Yuhui Chen,Haoran Li,Dongbin Zhao

from arxiv, 18 pages, 9 pages

Due to its training stability and strong expression, the diffusion model has attracted considerable attention in offline reinforcement learning. However, several challenges have also come with it: 1) The demand for a large number of diffusion steps makes the diffusion-model-based methods time inefficient and limits their applications in real-time control; 2) How to achieve policy improvement with accurate guidance for diffusion model-based policy is still an open problem. Inspired by the consistency model, we propose a novel time-efficiency method named Consistency Policy with Q-Learning (CPQL), which derives action from noise by a single step. By establishing a mapping from the reverse diffusion trajectories to the desired policy, we simultaneously address the issues of time efficiency and inaccurate guidance when updating diffusion model-based policy with the learned Q-function. We demonstrate that CPQL can achieve policy improvement with accurate guidance for offline reinforcement learning, and can be seamlessly extended for online RL tasks. Experimental results indicate that CPQL achieves new state-of-the-art performance on 11 offline and 21 online tasks, significantly improving inference speed by nearly 45 times compared to Diffusion-QL. We will release our code later.

3D · 控制器 · INFORMS · 分離的 · 分解的 ·

2024 年 1 月 24 日

Style-Consistent 3D Indoor Scene Synthesis with Decoupled Objects

Yunfan Zhang,Hong Huang,Zhiwei Xiong,Zhiqi Shen,Guosheng Lin,Hao Wang,Nicholas Vun

Controllable 3D indoor scene synthesis stands at the forefront of technological progress, offering various applications like gaming, film, and augmented/virtual reality. The capability to stylize and de-couple objects within these scenarios is a crucial factor, providing an advanced level of control throughout the editing process. This control extends not just to manipulating geometric attributes like translation and scaling but also includes managing appearances, such as stylization. Current methods for scene stylization are limited to applying styles to the entire scene, without the ability to separate and customize individual objects. Addressing the intricacies of this challenge, we introduce a unique pipeline designed for synthesis 3D indoor scenes. Our approach involves strategically placing objects within the scene, utilizing information from professionally designed bounding boxes. Significantly, our pipeline prioritizes maintaining style consistency across multiple objects within the scene, ensuring a cohesive and visually appealing result aligned with the desired aesthetic. The core strength of our pipeline lies in its ability to generate 3D scenes that are not only visually impressive but also exhibit features like photorealism, multi-view consistency, and diversity. These scenes are crafted in response to various natural language prompts, demonstrating the versatility and adaptability of our model.

PAC學習理論 · 通道 · 離散化 · 解碼 · Learning ·

2024 年 1 月 24 日

PAC Learnability for Reliable Communication over Discrete Memoryless Channels

Jiakun Liu,Wenyi Zhang,H. Vincent Poor

from arxiv, 10 pages, 4 figures

In practical communication systems, knowledge of channel models is often absent, and consequently, transceivers need be designed based on empirical data. In this work, we study data-driven approaches to reliably choosing decoding metrics and code rates that facilitate reliable communication over unknown discrete memoryless channels (DMCs). Our analysis is inspired by the PAC learning theory and does not rely on any assumptions on the statistical characteristics of DMCs. We show that a naive plug-in algorithm for choosing decoding metrics is likely to fail for finite training sets. We propose an alternative algorithm called the virtual sample algorithm and establish a non-asymptotic lower bound on its performance. The virtual sample algorithm is then used as a building block for constructing a learning algorithm that chooses a decoding metric and a code rate using which a transmitter and a receiver can reliably communicate at a rate arbitrarily close to the channel mutual information. Therefore, we conclude that DMCs are PAC learnable.

圖形處理器 · 圖 · Networking · Neural Networks · 相似度 ·

2024 年 1 月 23 日

Probabilistic Demand Forecasting with Graph Neural Networks

Nikita Kozodoi,Elizaveta Zinovyeva,Simon Valentin,Jo?o Pereira,Rodrigo Agundez

from arxiv, Preprint of the paper accepted to ECML PKDD 2023 ML4ITS Workshop

Demand forecasting is a prominent business use case that allows retailers to optimize inventory planning, logistics, and core business decisions. One of the key challenges in demand forecasting is accounting for relationships and interactions between articles. Most modern forecasting approaches provide independent article-level predictions that do not consider the impact of related articles. Recent research has attempted addressing this challenge using Graph Neural Networks (GNNs) and showed promising results. This paper builds on previous research on GNNs and makes two contributions. First, we integrate a GNN encoder into a state-of-the-art DeepAR model. The combined model produces probabilistic forecasts, which are crucial for decision-making under uncertainty. Second, we propose to build graphs using article attribute similarity, which avoids reliance on a pre-defined graph structure. Experiments on three real-world datasets show that the proposed approach consistently outperforms non-graph benchmarks. We also show that our approach produces article embeddings that encode article similarity and demand dynamics and are useful for other downstream business tasks beyond forecasting.

INFORMS · 可交換的 · Engineering · MoDELS · Continuity ·

2024 年 1 月 23 日

Engineering Yeast Cells to Facilitate Information Exchange

Nikolaos Ntetsikas,Styliana Kyriakoudi,Antonis Kirmizis,Bige Deniz Unluturk,Andreas Pitsillides,Ian F. Akyildiz,Marios Lestas

from arxiv, 18 pages, 9 figures (2 of which are not colored) all .png, recently accepted for publication at TMBMC

Although continuous advances in theoretical modelling of Molecular Communications (MC) are observed, there is still an insuperable gap between theory and experimental testbeds, especially at the microscale. In this paper, the development of the first testbed incorporating engineered yeast cells is reported. Different from the existing literature, eukaryotic yeast cells are considered for both the sender and the receiver, with {\alpha}-factor molecules facilitating the information transfer. The use of such cells is motivated mainly by the well understood biological mechanism of yeast mating, together with their genetic amenability. In addition, recent advances in yeast biosensing establish yeast as a suitable detector and a neat interface to in-body sensor networks. The system under consideration is presented first, and the mathematical models of the underlying biological processes leading to an end-to-end (E2E) system are given. The experimental setup is then described and used to obtain experimental results which validate the developed mathematical models. Beyond that, the ability of the system to effectively generate output pulses in response to repeated stimuli is demonstrated, reporting one event per two hours. However, fast RNA fluctuations indicate cell responses in less than three minutes, demonstrating the potential for much higher rates in the future.

估計/估計量 · 代價 · 可辨認的 · 蒙特卡洛樹搜索 · Performer ·

2024 年 1 月 23 日

Personalized Algorithmic Recourse with Preference Elicitation

Giovanni De Toni,Paolo Viappiani,Stefano Teso,Bruno Lepri,Andrea Passerini

from arxiv, Published in Transactions in Machine Learning Research (TMLR), January 2024. See //openreview.net/forum?id=8sg2I9zXgO for the official submission

Algorithmic Recourse (AR) is the problem of computing a sequence of actions that -- once performed by a user -- overturns an undesirable machine decision. It is paramount that the sequence of actions does not require too much effort for users to implement. Yet, most approaches to AR assume that actions cost the same for all users, and thus may recommend unfairly expensive recourse plans to certain users. Prompted by this observation, we introduce PEAR, the first human-in-the-loop approach capable of providing personalized algorithmic recourse tailored to the needs of any end-user. PEAR builds on insights from Bayesian Preference Elicitation to iteratively refine an estimate of the costs of actions by asking choice set queries to the target user. The queries themselves are computed by maximizing the Expected Utility of Selection, a principled measure of information gain accounting for uncertainty on both the cost estimate and the user's responses. PEAR integrates elicitation into a Reinforcement Learning agent coupled with Monte Carlo Tree Search to quickly identify promising recourse plans. Our empirical evaluation on real-world datasets highlights how PEAR produces high-quality personalized recourse in only a handful of iterations.

Attention · 可約的 · 掩碼 · Extensibility · SOTA ·

2024 年 1 月 23 日

Towards Efficient Diffusion-Based Image Editing with Instant Attention Masks

Siyu Zou,Jiji Tang,Yiyi Zhou,Jing He,Chaoyi Zhao,Rongsheng Zhang,Zhipeng Hu,Xiaoshuai Sun

from arxiv, Accepted by AAAI2024

Diffusion-based Image Editing (DIE) is an emerging research hot-spot, which often applies a semantic mask to control the target area for diffusion-based editing. However, most existing solutions obtain these masks via manual operations or off-line processing, greatly reducing their efficiency. In this paper, we propose a novel and efficient image editing method for Text-to-Image (T2I) diffusion models, termed Instant Diffusion Editing(InstDiffEdit). In particular, InstDiffEdit aims to employ the cross-modal attention ability of existing diffusion models to achieve instant mask guidance during the diffusion steps. To reduce the noise of attention maps and realize the full automatics, we equip InstDiffEdit with a training-free refinement scheme to adaptively aggregate the attention distributions for the automatic yet accurate mask generation. Meanwhile, to supplement the existing evaluations of DIE, we propose a new benchmark called Editing-Mask to examine the mask accuracy and local editing ability of existing methods. To validate InstDiffEdit, we also conduct extensive experiments on ImageNet and Imagen, and compare it with a bunch of the SOTA methods. The experimental results show that InstDiffEdit not only outperforms the SOTA methods in both image quality and editing results, but also has a much faster inference speed, i.e., +5 to +6 times.

情感分析 · entity · 門控 · 卷積 · MoDELS ·

2018 年 5 月 18 日

Aspect Based Sentiment Analysis with Gated Convolutional Networks

from arxiv, Accepted in ACL 2018

Aspect based sentiment analysis (ABSA) can provide more detailed information than general sentiment analysis, because it aims to predict the sentiment polarities of the given aspects or entities in text. We summarize previous approaches into two subtasks: aspect-category sentiment analysis (ACSA) and aspect-term sentiment analysis (ATSA). Most previous approaches employ long short-term memory and attention mechanisms to predict the sentiment polarity of the concerned targets, which are often complicated and need more training time. We propose a model based on convolutional neural networks and gating mechanisms, which is more accurate and efficient. First, the novel Gated Tanh-ReLU Units can selectively output the sentiment features according to the given aspect or entity. The architecture is much simpler than attention layer used in the existing models. Second, the computations of our model could be easily parallelized during training, because convolutional layers do not have time dependency as in LSTM layers, and gating units also work independently. The experiments on SemEval datasets demonstrate the efficiency and effectiveness of our models.

閱讀: 0 點贊: 0

小貼士

登錄享

相關主題

估計/估計量

北京阿比特科技有限公司

注冊地址：北京市海淀區羊坊店路18號2幢3層301-191

<dir id='7w5m3'><del id='7w5m3'><del id='7w5m3'></del><pre id='7w5m3'><pre id='7w5m3'><option id='7w5m3'><address id='7w5m3'></address><bdo id='7w5m3'><tr id='7w5m3'><acronym id='7w5m3'><pre id='7w5m3'></pre></acronym><div id='7w5m3'></div></tr></bdo></option></pre><small id='7w5m3'><address id='7w5m3'><u id='7w5m3'><legend id='7w5m3'><option id='7w5m3'><abbr id='7w5m3'></abbr><li id='7w5m3'><pre id='7w5m3'></pre></li></option></legend><select id='7w5m3'></select></u></address></small></pre></del><sup id='7w5m3'></sup><blockquote id='7w5m3'><dt id='7w5m3'></dt></blockquote><blockquote id='7w5m3'></blockquote></dir><tt id='7w5m3'></tt><u id='7w5m3'><tt id='7w5m3'><form id='7w5m3'></form></tt><td id='7w5m3'><dt id='7w5m3'></dt></td></u>

<code id='7w5m3'><i id='7w5m3'><q id='7w5m3'><legend id='7w5m3'><pre id='7w5m3'><style id='7w5m3'><acronym id='7w5m3'><i id='7w5m3'><form id='7w5m3'><option id='7w5m3'><center id='7w5m3'></center></option></form></i></acronym></style><tt id='7w5m3'></tt></pre></legend></q></i></code><center id='7w5m3'></center>

<dd id='7w5m3'></dd>

<style id='7w5m3'></style><sub id='7w5m3'><dfn id='7w5m3'><abbr id='7w5m3'><big id='7w5m3'><bdo id='7w5m3'></bdo></big></abbr></dfn></sub>_{<dir id='7w5m3'></dir>}