亚洲男人的天堂2018av,欧美草比,久久久久久免费视频精选,国色天香在线看免费,久久久久亚洲av成人片仓井空

<li id='rex75'></li>

_{^{<dd id='rex75'><tbody id='rex75'><td id='rex75'><optgroup id='rex75'><strong id='rex75'></strong></optgroup><address id='rex75'><ul id='rex75'></ul></address><big id='rex75'></big></td><table id='rex75'></table></tbody><pre id='rex75'></pre></dd><span id='rex75'><b id='rex75'></b></span>}}


<dfn id='rex75'><optgroup id='rex75'></optgroup></dfn><tfoot id='rex75'><bdo id='rex75'><div id='rex75'></div><i id='rex75'><dt id='rex75'></dt></i></bdo></tfoot>

_{<fieldset id='rex75'></fieldset>}

·

語言模型化 · 大語言模型 · JSON · MoDELS · 可約的 ·

2024 年 4 月 21 日

Guiding Large Language Models to Generate Computer-Parsable Content

from arxiv, 44 pages, 39 figures, 8 tables, Chinese version: //chinaxiv.org/abs/202403.00340

We propose a method to guide Large Language Models (LLMs) in generating structured content adhering to specific conventions without fine-tuning. By utilizing coroutine-based content generation constraints through a pre-agreed context-free grammar (CFG), LLMs are directed during decoding to produce formal language compliant outputs. This enhances stability and consistency in generating target data structures, types, or instructions, reducing application development complexities. Experimentally, error rates of GPT-2 and Gemma exceed 95% for DSLs longer than 36 and 282 tokens, respectively. We introduce YieldLang, a coroutine-based DSL generation framework, and evaluate it with LLMs on various tasks including JSON and Mermaid flowchart generation. Compared to benchmarks, our approach improves accuracy by 1.09 to 11.6 times, with LLMs requiring only about 16.5% of the samples to generate JSON effectively. This enhances usability of LLM-generated content for computer programs.

相關內容

語言模型化

語言模型化

穩健性 · 大語言模型 · 自動問答 · MoDELS · INFORMS ·

2024 年 6 月 3 日

Towards Faithful and Robust LLM Specialists for Evidence-Based Question-Answering

Tobias Schimanski,Jingwei Ni,Mathias Kraus,Elliott Ash,Markus Leippold

Advances towards more faithful and traceable answers of Large Language Models (LLMs) are crucial for various research and practical endeavors. One avenue in reaching this goal is basing the answers on reliable sources. However, this Evidence-Based QA has proven to work insufficiently with LLMs in terms of citing the correct sources (source quality) and truthfully representing the information within sources (answer attributability). In this work, we systematically investigate how to robustly fine-tune LLMs for better source quality and answer attributability. Specifically, we introduce a data generation pipeline with automated data quality filters, which can synthesize diversified high-quality training and testing data at scale. We further introduce four test sets to benchmark the robustness of fine-tuned specialist models. Extensive evaluation shows that fine-tuning on synthetic data improves performance on both in- and out-of-distribution. Furthermore, we show that data quality, which can be drastically improved by proposed quality filters, matters more than quantity in improving Evidence-Based QA.

PCA · cancer · RetinaNet · MoDELS · 訓練數據 ·

2024 年 6 月 3 日

Domain Transfer Through Image-to-Image Translation for Uncertainty-Aware Prostate Cancer Classification

Meng Zhou,Amoon Jamzad,Jason Izard,Alexandre Menard,Robert Siemens,Parvin Mousavi

from arxiv, Preprint. In Submission

Prostate Cancer (PCa) is a prevalent disease among men, and multi-parametric MRIs offer a non-invasive method for its detection. While MRI-based deep learning solutions have shown promise in supporting PCa diagnosis, acquiring sufficient training data, particularly in local clinics remains challenging. One potential solution is to take advantage of publicly available datasets to pre-train deep models and fine-tune them on the local data, but multi-source MRIs can pose challenges due to cross-domain distribution differences. These limitations hinder the adoption of explainable and reliable deep-learning solutions in local clinics for PCa diagnosis. In this work, we present a novel approach for unpaired image-to-image translation of prostate multi-parametric MRIs and an uncertainty-aware training approach for classifying clinically significant PCa, to be applied in data-constrained settings such as local and small clinics. Our approach involves a novel pipeline for translating unpaired 3.0T multi-parametric prostate MRIs to 1.5T, thereby augmenting the available training data. Additionally, we introduce an evidential deep learning approach to estimate model uncertainty and employ dataset filtering techniques during training. Furthermore, we propose a simple, yet efficient Evidential Focal Loss, combining focal loss with evidential uncertainty, to train our model effectively. Our experiments demonstrate that the proposed method significantly improves the Area Under ROC Curve (AUC) by over 20% compared to the previous work. Our code is available at //github.com/med-i-lab/DT_UE_PCa

類別 · 知識 (knowledge) · tuning · 超參數 · Processing（編程語言） ·

2024 年 6 月 3 日

A Practical Approach to Novel Class Discovery in Tabular Data

Colin Troisemaine,Alexandre Reiffers-Masson,Stéphane Gosselin,Vincent Lemaire,Sandrine Vaton

from arxiv, 30 pages, including 7 pages of annexes

The problem of Novel Class Discovery (NCD) consists in extracting knowledge from a labeled set of known classes to accurately partition an unlabeled set of novel classes. While NCD has recently received a lot of attention from the community, it is often solved on computer vision problems and under unrealistic conditions. In particular, the number of novel classes is usually assumed to be known in advance, and their labels are sometimes used to tune hyperparameters. Methods that rely on these assumptions are not applicable in real-world scenarios. In this work, we focus on solving NCD in tabular data when no prior knowledge of the novel classes is available. To this end, we propose to tune the hyperparameters of NCD methods by adapting the $k$-fold cross-validation process and hiding some of the known classes in each fold. Since we have found that methods with too many hyperparameters are likely to overfit these hidden classes, we define a simple deep NCD model. This method is composed of only the essential elements necessary for the NCD problem and performs impressively well under realistic conditions. Furthermore, we find that the latent space of this method can be used to reliably estimate the number of novel classes. Additionally, we adapt two unsupervised clustering algorithms ($k$-means and Spectral Clustering) to leverage the knowledge of the known classes. Extensive experiments are conducted on 7 tabular datasets and demonstrate the effectiveness of the proposed method and hyperparameter tuning process, and show that the NCD problem can be solved without relying on knowledge from the novel classes.

多樣性 · 回合 · Learning · Processing（編程語言） · Agent ·

2024 年 6 月 3 日

Policy Dispersion in Non-Markovian Environment

Bohao Qu,Xiaofeng Cao,Jielong Yang,Hechang Chen,Chang Yi,Ivor W. Tsang,Yew-Soon Ong

from arxiv, In further research, we found that the core content of the paper requires significant modification and that the entire paper needs to be restructured. To enhance the scientific quality and contributions of the paper, we have decided to resubmit it after completing the necessary revisions and improvements

Markov Decision Process (MDP) presents a mathematical framework to formulate the learning processes of agents in reinforcement learning. MDP is limited by the Markovian assumption that a reward only depends on the immediate state and action. However, a reward sometimes depends on the history of states and actions, which may result in the decision process in a non-Markovian environment. In such environments, agents receive rewards via temporally-extended behaviors sparsely, and the learned policies may be similar. This leads the agents acquired with similar policies generally overfit to the given task and can not quickly adapt to perturbations of environments. To resolve this problem, this paper tries to learn the diverse policies from the history of state-action pairs under a non-Markovian environment, in which a policy dispersion scheme is designed for seeking diverse policy representation. Specifically, we first adopt a transformer-based method to learn policy embeddings. Then, we stack the policy embeddings to construct a dispersion matrix to induce a set of diverse policies. Finally, we prove that if the dispersion matrix is positive definite, the dispersed embeddings can effectively enlarge the disagreements across policies, yielding a diverse expression for the original policy embedding distribution. Experimental results show that this dispersion scheme can obtain more expressive diverse policies, which then derive more robust performance than recent learning baselines under various learning environments.

亞馬遜AWS · AI · 可約的 · ML · CASE ·

2024 年 5 月 31 日

AI-Powered Autonomous Weapons Risk Geopolitical Instability and Threaten AI Research

Riley Simmons-Edler,Ryan Badman,Shayne Longpre,Kanaka Rajan

from arxiv, 9 pages, 1 figure, in ICML 2024

The recent embrace of machine learning (ML) in the development of autonomous weapons systems (AWS) creates serious risks to geopolitical stability and the free exchange of ideas in AI research. This topic has received comparatively little attention of late compared to risks stemming from superintelligent artificial general intelligence (AGI), but requires fewer assumptions about the course of technological development and is thus a nearer-future issue. ML is already enabling the substitution of AWS for human soldiers in many battlefield roles, reducing the upfront human cost, and thus political cost, of waging offensive war. In the case of peer adversaries, this increases the likelihood of "low intensity" conflicts which risk escalation to broader warfare. In the case of non-peer adversaries, it reduces the domestic blowback to wars of aggression. This effect can occur regardless of other ethical issues around the use of military AI such as the risk of civilian casualties, and does not require any superhuman AI capabilities. Further, the military value of AWS raises the specter of an AI-powered arms race and the misguided imposition of national security restrictions on AI research. Our goal in this paper is to raise awareness among the public and ML researchers on the near-future risks posed by full or near-full autonomy in military technology, and we provide regulatory suggestions to mitigate these risks. We call upon AI policy experts and the defense AI community in particular to embrace transparency and caution in their development and deployment of AWS to avoid the negative effects on global stability and AI research that we highlight here.

知識 (knowledge) · Networking · Boosting（一種模型訓練加速方式） · 求逆 · Extensibility ·

2024 年 5 月 31 日

GI-NAS: Boosting Gradient Inversion Attacks through Adaptive Neural Architecture Search

Wenbo Yu,Hao Fang,Bin Chen,Xiaohang Sui,Chuan Chen,Hao Wu,Shu-Tao Xia,Ke Xu

Gradient Inversion Attacks invert the transmitted gradients in Federated Learning (FL) systems to reconstruct the sensitive data of local clients and have raised considerable privacy concerns. A majority of gradient inversion methods rely heavily on explicit prior knowledge (e.g., a well pre-trained generative model), which is often unavailable in realistic scenarios. To alleviate this issue, researchers have proposed to leverage the implicit prior knowledge of an over-parameterized network. However, they only utilize a fixed neural architecture for all the attack settings. This would hinder the adaptive use of implicit architectural priors and consequently limit the generalizability. In this paper, we further exploit such implicit prior knowledge by proposing Gradient Inversion via Neural Architecture Search (GI-NAS), which adaptively searches the network and captures the implicit priors behind neural architectures. Extensive experiments verify that our proposed GI-NAS can achieve superior attack performance compared to state-of-the-art gradient inversion methods, even under more practical settings with high-resolution images, large-sized batches, and advanced defense strategies.

INFORMS · 極大 · 自編碼器 · 變分自編碼 · INTERACT ·

2024 年 5 月 31 日

Information Maximization via Variational Autoencoders for Cross-Domain Recommendation

Xuying Ning,Wujiang Xu,Xiaolei Liu,Mingming Ha,Qiongxu Ma,Youru Li,Linxun Chen,Yongfeng Zhang

Cross-Domain Sequential Recommendation (CDSR) methods aim to address the data sparsity and cold-start problems present in Single-Domain Sequential Recommendation (SDSR). Existing CDSR methods typically rely on overlapping users, designing complex cross-domain modules to capture users' latent interests that can propagate across different domains. However, their propagated informative information is limited to the overlapping users and the users who have rich historical behavior records. As a result, these methods often underperform in real-world scenarios, where most users are non-overlapping (cold-start) and long-tailed. In this research, we introduce a new CDSR framework named Information Maximization Variational Autoencoder (\textbf{\texttt{IM-VAE}}). Here, we suggest using a Pseudo-Sequence Generator to enhance the user's interaction history input for downstream fine-grained CDSR models to alleviate the cold-start issues. We also propose a Generative Recommendation Framework combined with three regularizers inspired by the mutual information maximization (MIM) theory \cite{mcgill1954multivariate} to capture the semantic differences between a user's interests shared across domains and those specific to certain domains, as well as address the informational gap between a user's actual interaction sequences and the pseudo-sequences generated. To the best of our knowledge, this paper is the first CDSR work that considers the information disentanglement and denoising of pseudo-sequences in the open-world recommendation scenario. Empirical experiments illustrate that \texttt{IM-VAE} outperforms the state-of-the-art approaches on two real-world cross-domain datasets on all sorts of users, including cold-start and tailed users, demonstrating the effectiveness of \texttt{IM-VAE} in open-world recommendation.

自動問答 · 大語言模型 · MoDELS · Continuity · HTTPS ·

2024 年 5 月 31 日

Spoken Question Answering and Speech Continuation Using Spectrogram-Powered LLM

Eliya Nachmani,Alon Levkovitch,Roy Hirsch,Julian Salazar,Chulayuth Asawaroengchai,Soroosh Mariooryad,Ehud Rivlin,RJ Skerry-Ryan,Michelle Tadmor Ramanovich

from arxiv, ICLR 2024 camera-ready

We present Spectron, a novel approach to adapting pre-trained large language models (LLMs) to perform spoken question answering (QA) and speech continuation. By endowing the LLM with a pre-trained speech encoder, our model becomes able to take speech inputs and generate speech outputs. The entire system is trained end-to-end and operates directly on spectrograms, simplifying our architecture. Key to our approach is a training objective that jointly supervises speech recognition, text continuation, and speech synthesis using only paired speech-text pairs, enabling a `cross-modal' chain-of-thought within a single decoding pass. Our method surpasses existing spoken language models in speaker preservation and semantic coherence. Furthermore, the proposed model improves upon direct initialization in retaining the knowledge of the original LLM as demonstrated through spoken QA datasets. We release our audio samples (//michelleramanovich.github.io/spectron/spectron) and spoken QA dataset (//github.com/google-research-datasets/LLAMA1-Test-Set).

INFORMS · 多樣性 · Learning · 數據選擇 · 可辨認的 ·

2024 年 5 月 30 日

How to Leverage Diverse Demonstrations in Offline Imitation Learning

Sheng Yue,Jiani Liu,Xingyuan Hua,Ju Ren,Sen Lin,Junshan Zhang,Yaoxue Zhang

from arxiv, International Conference on Machine Learning (ICML)

Offline Imitation Learning (IL) with imperfect demonstrations has garnered increasing attention owing to the scarcity of expert data in many real-world domains. A fundamental problem in this scenario is how to extract positive behaviors from noisy data. In general, current approaches to the problem select data building on state-action similarity to given expert demonstrations, neglecting precious information in (potentially abundant) $\textit{diverse}$ state-actions that deviate from expert ones. In this paper, we introduce a simple yet effective data selection method that identifies positive behaviors based on their resultant states -- a more informative criterion enabling explicit utilization of dynamics information and effective extraction of both expert and beneficial diverse behaviors. Further, we devise a lightweight behavior cloning algorithm capable of leveraging the expert and selected data correctly. In the experiments, we evaluate our method on a suite of complex and high-dimensional offline IL benchmarks, including continuous-control and vision-based tasks. The results demonstrate that our method achieves state-of-the-art performance, outperforming existing methods on $\textbf{20/21}$ benchmarks, typically by $\textbf{2-5x}$, while maintaining a comparable runtime to Behavior Cloning ($\texttt{BC}$).

估計/估計量 · contrastive · INFORMS · 互信息 · 表示學習 ·

2021 年 6 月 25 日

Decomposed Mutual Information Estimation for Contrastive Representation Learning

Alessandro Sordoni,Nouha Dziri,Hannes Schulz,Geoff Gordon,Phil Bachman,Remi Tachet

from arxiv, ICML 2021

Recent contrastive representation learning methods rely on estimating mutual information (MI) between multiple views of an underlying context. E.g., we can derive multiple views of a given image by applying data augmentation, or we can split a sequence into views comprising the past and future of some step in the sequence. Contrastive lower bounds on MI are easy to optimize, but have a strong underestimation bias when estimating large amounts of MI. We propose decomposing the full MI estimation problem into a sum of smaller estimation problems by splitting one of the views into progressively more informed subviews and by applying the chain rule on MI between the decomposed views. This expression contains a sum of unconditional and conditional MI terms, each measuring modest chunks of the total MI, which facilitates approximation via contrastive bounds. To maximize the sum, we formulate a contrastive lower bound on the conditional MI which can be approximated efficiently. We refer to our general approach as Decomposed Estimation of Mutual Information (DEMI). We show that DEMI can capture a larger amount of MI than standard non-decomposed contrastive bounds in a synthetic setting, and learns better representations in a vision domain and for dialogue generation.

閱讀: 0 點贊: 0

小貼士

登錄享

相關主題

語言模型化

大語言模型

北京阿比特科技有限公司

注冊地址：北京市海淀區羊坊店路18號2幢3層301-191

<dir id='rex75'><del id='rex75'><del id='rex75'></del><pre id='rex75'><pre id='rex75'><option id='rex75'><address id='rex75'></address><bdo id='rex75'><tr id='rex75'><acronym id='rex75'><pre id='rex75'></pre></acronym><div id='rex75'></div></tr></bdo></option></pre><small id='rex75'><address id='rex75'><u id='rex75'><legend id='rex75'><option id='rex75'><abbr id='rex75'></abbr><li id='rex75'><pre id='rex75'></pre></li></option></legend><select id='rex75'></select></u></address></small></pre></del><sup id='rex75'></sup><blockquote id='rex75'><dt id='rex75'></dt></blockquote><blockquote id='rex75'></blockquote></dir><tt id='rex75'></tt><u id='rex75'><tt id='rex75'><form id='rex75'></form></tt><td id='rex75'><dt id='rex75'></dt></td></u>

<code id='rex75'><i id='rex75'><q id='rex75'><legend id='rex75'><pre id='rex75'><style id='rex75'><acronym id='rex75'><i id='rex75'><form id='rex75'><option id='rex75'><center id='rex75'></center></option></form></i></acronym></style><tt id='rex75'></tt></pre></legend></q></i></code><center id='rex75'></center>

<dd id='rex75'></dd>

<style id='rex75'></style><sub id='rex75'><dfn id='rex75'><abbr id='rex75'><big id='rex75'><bdo id='rex75'></bdo></big></abbr></dfn></sub>_{<dir id='rex75'></dir>}