亚洲男人的天堂2018av,欧美草比,久久久久久免费视频精选,国色天香在线看免费,久久久久亚洲av成人片仓井空

<dir id='MWJzw'><del id='3LES8'><del id='TLsxJ'></del><pre id='EaNfO'><pre id='Worpg'><option id='H0LtS'><address id='52x3n'></address><bdo id='f1vSB'><tr id='eiwtR'><acronym id='HwgdL'><pre id='t3dEM'></pre></acronym><div id='xFusZ'></div></tr></bdo></option></pre><small id='3ak0d'><address id='3nzjE'><u id='OO5OK'><legend id='O38IJ'><option id='k4HRy'><abbr id='bY3qZ'></abbr><li id='JGc4I'><pre id='e8wqZ'></pre></li></option></legend><select id='t87J8'></select></u></address></small></pre></del><sup id='O9y59'></sup><blockquote id='FHVSO'><dt id='9CDLb'></dt></blockquote><blockquote id='dvju5'></blockquote></dir><tt id='hMafG'></tt><u id='QGRSq'><tt id='n89UT'><form id='dz9ru'></form></tt><td id='bUupJ'><dt id='MEVoV'></dt></td></u>

<code id='Pd0UT'><i id='8VXUU'><q id='a7Rrg'><legend id='pTOMc'><pre id='7Uoze'><style id='9KBZQ'><acronym id='vrYOY'><i id='iz8DP'><form id='IfgwH'><option id='XPzWZ'><center id='EJ1Iz'></center></option></form></i></acronym></style><tt id='5YKPV'></tt></pre></legend></q></i></code><center id='xt9dz'></center>

<dd id='kGYaN'></dd>

<style id='EqF7u'></style><sub id='liY5L'><dfn id='r66J6'><abbr id='TiTu3'><big id='OCtWK'><bdo id='UC5ja'></bdo></big></abbr></dfn></sub>_{<dir id='SoP5t'></dir>}

·

賭博機/老虎機 · 優化器 · Learning · 強化學習 · Weight ·

2024 年 3 月 20 日

Dynamic Reward Adjustment in Multi-Reward Reinforcement Learning for Counselor Reflection Generation

Do June Min,Veronica Perez-Rosas,Kenneth Resnicow,Rada Mihalcea

In this paper, we study the problem of multi-reward reinforcement learning to jointly optimize for multiple text qualities for natural language generation. We focus on the task of counselor reflection generation, where we optimize the generators to simultaneously improve the fluency, coherence, and reflection quality of generated counselor responses. We introduce two novel bandit methods, DynaOpt and C-DynaOpt, which rely on the broad strategy of combining rewards into a single value and optimizing them simultaneously. Specifically, we employ non-contextual and contextual multi-arm bandits to dynamically adjust multiple reward weights during training. Through automatic and manual evaluations, we show that our proposed techniques, DynaOpt and C-DynaOpt, outperform existing naive and bandit baselines, showcasing their potential for enhancing language models.

相關內容

賭博機/老虎機

賭博機/老虎機

Learning · Networking · 回合 · 獎勵函數 · AIM ·

2024 年 5 月 2 日

Foundational Policy Acquisition via Multitask Learning for Motor Skill Generation

Satoshi Yamamori,Jun Morimoto

from arxiv, 11 pages, 6 figures

In this study, we propose a multitask reinforcement learning algorithm for foundational policy acquisition to generate novel motor skills. Inspired by human sensorimotor adaptation mechanisms, we aim to train encoder-decoder networks that can be commonly used to learn novel motor skills in a single movement category. To train the policy network, we develop the multitask reinforcement learning method, where the policy needs to cope with changes in goals or environments with different reward functions or physical parameters of the environment in dynamic movement generation tasks. Here, as a concrete task, we evaluated the proposed method with the ball heading task using a monopod robot model. The results showed that the proposed method could adapt to novel target positions or inexperienced ball restitution coefficients. Furthermore, we demonstrated that the acquired foundational policy network originally learned for heading motion, can be used to generate an entirely new overhead kicking skill.

圖 · Learning · 在線 · binary · 概念類 ·

2024 年 5 月 1 日

Efficient Algorithms for Learning Monophonic Halfspaces in Graphs

Marco Bressan,Emmanuel Esposito,Maximilian Thiessen

We study the problem of learning a binary classifier on the vertices of a graph. In particular, we consider classifiers given by monophonic halfspaces, partitions of the vertices that are convex in a certain abstract sense. Monophonic halfspaces, and related notions such as geodesic halfspaces,have recently attracted interest, and several connections have been drawn between their properties(e.g., their VC dimension) and the structure of the underlying graph $G$. We prove several novel results for learning monophonic halfspaces in the supervised, online, and active settings. Our main result is that a monophonic halfspace can be learned with near-optimal passive sample complexity in time polynomial in $n = |V(G)|$. This requires us to devise a polynomial-time algorithm for consistent hypothesis checking, based on several structural insights on monophonic halfspaces and on a reduction to $2$-satisfiability. We prove similar results for the online and active settings. We also show that the concept class can be enumerated with delay $\operatorname{poly}(n)$, and that empirical risk minimization can be performed in time $2^{\omega(G)}\operatorname{poly}(n)$ where $\omega(G)$ is the clique number of $G$. These results answer open questions from the literature (Gonz\'alez et al., 2020), and show a contrast with geodesic halfspaces, for which some of the said problems are NP-hard (Seiffarth et al., 2023).

異常點 · MoDELS · 度量學習 · Softmax · Learning ·

2024 年 5 月 1 日

Deep Metric Learning-Based Out-of-Distribution Detection with Synthetic Outlier Exposure

Assefa Seyoum Wahd

In this paper, we present a novel approach that combines deep metric learning and synthetic data generation using diffusion models for out-of-distribution (OOD) detection. One popular approach for OOD detection is outlier exposure, where models are trained using a mixture of in-distribution (ID) samples and ``seen" OOD samples. For the OOD samples, the model is trained to minimize the KL divergence between the output probability and the uniform distribution while correctly classifying the in-distribution (ID) data. In this paper, we propose a label-mixup approach to generate synthetic OOD data using Denoising Diffusion Probabilistic Models (DDPMs). Additionally, we explore recent advancements in metric learning to train our models. In the experiments, we found that metric learning-based loss functions perform better than the softmax. Furthermore, the baseline models (including softmax, and metric learning) show a significant improvement when trained with the generated OOD data. Our approach outperforms strong baselines in conventional OOD detection metrics.

Networking · 估計/估計量 · 小樣本學習 · Learning · Performer ·

2024 年 5 月 1 日

A Few-Shot Learning Approach for Sound Source Distance Estimation Using Relation Networks

Amirreza Sobhdel,Roozbeh Razavi-Far

In this paper, we study the performance of few-shot learning, specifically meta learning empowered few-shot relation networks, over supervised deep learning and conventional machine learning approaches in the problem of Sound Source Distance Estimation (SSDE). In previous research on deep supervised SSDE, low accuracies have often resulted from the mismatch between the training data (from known environments) and the test data (from unknown environments). By performing comparative experiments on a sufficient amount of data, we show that the few-shot relation network outperforms other competitors including eXtreme Gradient Boosting (XGBoost), Support Vector Machine (SVM), Convolutional Neural Network (CNN), and MultiLayer Perceptron (MLP). Hence it is possible to calibrate a microphone-equipped system, with a few labeled samples of audio recorded in a particular unknown environment to adjust and generalize our classifier to the possible input data and gain higher accuracies.

優化器 · 可約的 · Microsoft Surface · 設計 · 貪心 ·

2024 年 4 月 30 日

A Joint Communication and Computation Design for Distributed RISs Assisted Probabilistic Semantic Communication in IIoT

Zhouxiang Zhao,Zhaohui Yang,Chongwen Huang,Li Wei,Qianqian Yang,Caijun Zhong,Wei Xu,Zhaoyang Zhang

In this paper, the problem of spectral-efficient communication and computation resource allocation for distributed reconfigurable intelligent surfaces (RISs) assisted probabilistic semantic communication (PSC) in industrial Internet-of-Things (IIoT) is investigated. In the considered model, multiple RISs are deployed to serve multiple users, while PSC adopts compute-then-transmit protocol to reduce the transmission data size. To support high-rate transmission, the semantic compression ratio, transmit power allocation, and distributed RISs deployment must be jointly considered. This joint communication and computation problem is formulated as an optimization problem whose goal is to maximize the sum semantic-aware transmission rate of the system under total transmit power, phase shift, RIS-user association, and semantic compression ratio constraints. To solve this problem, a many-to-many matching scheme is proposed to solve the RIS-user association subproblem, the semantic compression ratio subproblem is addressed following greedy policy, while the phase shift of RIS can be optimized using the tensor based beamforming. Numerical results verify the superiority of the proposed algorithm.

泛函 · MoDELS · 離散化 · 優化器 · INFORMS ·

2024 年 4 月 30 日

Towards Accurate Post-training Quantization for Diffusion Models

Changyuan Wang,Ziwei Wang,Xiuwei Xu,Yansong Tang,Jie Zhou,Jiwen Lu

In this paper, we propose an accurate data-free post-training quantization framework of diffusion models (ADP-DM) for efficient image generation. Conventional data-free quantization methods learn shared quantization functions for tensor discretization regardless of the generation timesteps, while the activation distribution differs significantly across various timesteps. The calibration images are acquired in random timesteps which fail to provide sufficient information for generalizable quantization function learning. Both issues cause sizable quantization errors with obvious image generation performance degradation. On the contrary, we design group-wise quantization functions for activation discretization in different timesteps and sample the optimal timestep for informative calibration image generation, so that our quantized diffusion model can reduce the discretization errors with negligible computational overhead. Specifically, we partition the timesteps according to the importance weights of quantization functions in different groups, which are optimized by differentiable search algorithms. We also select the optimal timestep for calibration image generation by structural risk minimizing principle in order to enhance the generalization ability in the deployment of quantized diffusion model. Extensive experimental results show that our method outperforms the state-of-the-art post-training quantization of diffusion model by a sizable margin with similar computational cost.

多峰值 · Prompt · CASES · Learning · MoDELS ·

2023 年 3 月 6 日

Multimodal Prompting with Missing Modalities for Visual Recognition

Yi-Lun Lee,Yi-Hsuan Tsai,Wei-Chen Chiu,Chen-Yu Lee

from arxiv, Accepted by CVPR 2023

In this paper, we tackle two challenges in multimodal learning for visual recognition: 1) when missing-modality occurs either during training or testing in real-world situations; and 2) when the computation resources are not available to finetune on heavy transformer models. To this end, we propose to utilize prompt learning and mitigate the above two challenges together. Specifically, our modality-missing-aware prompts can be plugged into multimodal transformers to handle general missing-modality cases, while only requiring less than 1% learnable parameters compared to training the entire model. We further explore the effect of different prompt configurations and analyze the robustness to missing modality. Extensive experiments are conducted to show the effectiveness of our prompt learning framework that improves the performance under various missing-modality cases, while alleviating the requirement of heavy model re-training. Code is available.

Performer · Extensibility · 聯邦學習 · 相似度 · 成對型 ·

2021 年 1 月 7 日

Personalized Cross-Silo Federated Learning on Non-IID Data

Yutao Huang,Lingyang Chu,Zirui Zhou,Lanjun Wang,Jiangchuan Liu,Jian Pei,Yong Zhang

from arxiv, Accepted by AAAI 2021. The API of this work is available at Huawei Cloud (//t.ly/nGN9), free registration is required before use

Non-IID data present a tough challenge for federated learning. In this paper, we explore a novel idea of facilitating pairwise collaborations between clients with similar data. We propose FedAMP, a new method employing federated attentive message passing to facilitate similar clients to collaborate more. We establish the convergence of FedAMP for both convex and non-convex models, and propose a heuristic method to further improve the performance of FedAMP when clients adopt deep neural networks as personalized models. Our extensive experiments on benchmark data sets demonstrate the superior performance of the proposed methods.

注意力機制 · 機器閱讀理解 · Extensibility · state-of-the-art · MoDELS ·

2018 年 4 月 25 日

Reinforced Mnemonic Reader for Machine Reading Comprehension

Minghao Hu,Yuxing Peng,Zhen Huang,Xipeng Qiu,Furu Wei,Ming Zhou

from arxiv, Published in 26th International Joint Conference on Artificial Intelligence (IJCAI), 2018

In this paper, we introduce the Reinforced Mnemonic Reader for machine reading comprehension tasks, which enhances previous attentive readers in two aspects. First, a reattention mechanism is proposed to refine current attentions by directly accessing to past attentions that are temporally memorized in a multi-round alignment architecture, so as to avoid the problems of attention redundancy and attention deficiency. Second, a new optimization approach, called dynamic-critical reinforcement learning, is introduced to extend the standard supervised method. It always encourages to predict a more acceptable answer so as to address the convergence suppression problem occurred in traditional reinforcement learning algorithms. Extensive experiments on the Stanford Question Answering Dataset (SQuAD) show that our model achieves state-of-the-art results. Meanwhile, our model outperforms previous systems by over 6% in terms of both Exact Match and F1 metrics on two adversarial SQuAD datasets.

MoDELS · 注意力機制 · RNN · 標注 · Networking ·

2017 年 12 月 20 日

Order-Free RNN with Visual Attention for Multi-Label Classification

Shang-Fu Chen,Yi-Chen Chen,Chih-Kuan Yeh,Yu-Chiang Frank Wang

from arxiv, Accepted at 32nd AAAI Conference on Artificial Intelligence (AAAI-18)

In this paper, we propose the joint learning attention and recurrent neural network (RNN) models for multi-label classification. While approaches based on the use of either model exist (e.g., for the task of image captioning), training such existing network architectures typically require pre-defined label sequences. For multi-label classification, it would be desirable to have a robust inference process, so that the prediction error would not propagate and thus affect the performance. Our proposed model uniquely integrates attention and Long Short Term Memory (LSTM) models, which not only addresses the above problem but also allows one to identify visual objects of interests with varying sizes without the prior knowledge of particular label ordering. More importantly, label co-occurrence information can be jointly exploited by our LSTM model. Finally, by advancing the technique of beam search, prediction of multiple labels can be efficiently achieved by our proposed network model.

閱讀: 0 點贊: 0

小貼士

登錄享

相關主題

賭博機(ji)/老虎機(ji)

優化(hua)器(qi)

強化(hua)學(xue)習

北京阿比特科技有限公司

注冊地址：北京市海淀區羊坊店路18號2幢3層301-191

<tr id='ybEz7'><strong id='zcqQK'></strong><small id='HqKA2'></small><button id='SilZ8'></button><li id='ypfM9'><noscript id='Nohj3'><big id='KsqZx'></big><dt id='jDXyy'></dt></noscript></li></tr><ol id='3DEwG'><option id='foO5l'><table id='zn8MA'><blockquote id='i62As'><tbody id='k6gX0'></tbody></blockquote></table></option></ol><u id='o0UdR'></u><kbd id='pvWAe'><kbd id='6VTg3'></kbd></kbd>

<code id='QN1WS'><strong id='PllGf'></strong></code>

<fieldset id='rLvEb'></fieldset>

<span id='R2qUQ'></span>

<ins id='PAQM6'></ins>

<acronym id='1TfyX'><em id='374AF'></em><td id='K66IW'><div id='6h2Lp'></div></td></acronym><address id='ZRQDJ'><big id='StOHb'><big id='LGECZ'></big><legend id='Jmabr'></legend></big></address>

<i id='lZh1K'><div id='2nUIt'><ins id='wppm6'></ins></div></i>

<i id='nSPX1'></i>