亚洲男人的天堂2018av,欧美草比,久久久久久免费视频精选,国色天香在线看免费,久久久久亚洲av成人片仓井空

<tr id='oq9bl'><strong id='oq9bl'></strong><small id='oq9bl'></small><button id='oq9bl'></button><li id='oq9bl'><noscript id='oq9bl'><big id='oq9bl'></big><dt id='oq9bl'></dt></noscript></li></tr><ol id='oq9bl'><option id='oq9bl'><table id='oq9bl'><blockquote id='oq9bl'><tbody id='oq9bl'></tbody></blockquote></table></option></ol><u id='oq9bl'></u><kbd id='oq9bl'><kbd id='oq9bl'></kbd></kbd>

<code id='oq9bl'><strong id='oq9bl'></strong></code>

<fieldset id='oq9bl'></fieldset>

<span id='oq9bl'></span>

<ins id='oq9bl'></ins>

<acronym id='oq9bl'><em id='oq9bl'></em><td id='oq9bl'><div id='oq9bl'></div></td></acronym><address id='oq9bl'><big id='oq9bl'><big id='oq9bl'></big><legend id='oq9bl'></legend></big></address>

<i id='oq9bl'><div id='oq9bl'><ins id='oq9bl'></ins></div></i>

<i id='oq9bl'></i>

·

樸素貝葉斯分類器 · 樸素貝葉斯 · Processing（編程語言） · 知識 (knowledge) · 可理解性 ·

2023 年 9 月 8 日

Viewing the process of generating counterfactuals as a source of knowledge -- Application to the Naive Bayes classifier

Vincent Lemaire,Nathan Le Boudec,Fran?oise Fessant,Victor Guyomard

from arxiv, 12 pages

There are now many comprehension algorithms for understanding the decisions of a machine learning algorithm. Among these are those based on the generation of counterfactual examples. This article proposes to view this generation process as a source of creating a certain amount of knowledge that can be stored to be used, later, in different ways. This process is illustrated in the additive model and, more specifically, in the case of the naive Bayes classifier, whose interesting properties for this purpose are shown.

相關內容

樸素貝葉斯分類器

樸素貝葉斯分類器

在機器學習中，樸素貝葉斯分類器是一系列以假設特征之間強（樸素）獨立下運用貝葉斯定理為基礎的簡單概率分類器。樸素貝葉斯自20世紀50年代已廣泛研究。在20世紀60年代初就以另外一個名稱引入到文本信息檢索界中，并仍然是文本分類的一種熱門（基準）方法，文本分類是以詞頻為特征判斷文件所屬類別或其他（如垃圾郵件、合法性、體育或政治等等）的問題。通過適當的預處理，它可以與這個領域更先進的方法（包括支持向量機）相競爭。它在自動醫療診斷中也有應用

Networking · 頻率主義學派 · Neural Networks · 過擬合 · 稀疏 ·

2023 年 10 月 25 日

Sparse Bayesian neural networks for regression: Tackling overfitting and computational challenges in uncertainty quantification

Nastaran Dabiran,Brandon Robinson,Rimple Sandhu,Mohammad Khalil,Dominique Poirel,Abhijit Sarkar

Neural networks (NNs) are primarily developed within the frequentist statistical framework. Nevertheless, frequentist NNs lack the capability to provide uncertainties in the predictions, and hence their robustness can not be adequately assessed. Conversely, the Bayesian neural networks (BNNs) naturally offer predictive uncertainty by applying Bayes' theorem. However, their computational requirements pose significant challenges. Moreover, both frequentist NNs and BNNs suffer from overfitting issues when dealing with noisy and sparse data, which render their predictions unwieldy away from the available data space. To address both these problems simultaneously, we leverage insights from a hierarchical setting in which the parameter priors are conditional on hyperparameters to construct a BNN by applying a semi-analytical framework known as nonlinear sparse Bayesian learning (NSBL). We call our network sparse Bayesian neural network (SBNN) which aims to address the practical and computational issues associated with BNNs. Simultaneously, imposing a sparsity-inducing prior encourages the automatic pruning of redundant parameters based on the automatic relevance determination (ARD) concept. This process involves removing redundant parameters by optimally selecting the precision of the parameters prior probability density functions (pdfs), resulting in a tractable treatment for overfitting. To demonstrate the benefits of the SBNN algorithm, the study presents an illustrative regression problem and compares the results of a BNN using standard Bayesian inference, hierarchical Bayesian inference, and a BNN equipped with the proposed algorithm. Subsequently, we demonstrate the importance of considering the full parameter posterior by comparing the results with those obtained using the Laplace approximation with and without NSBL.

置信度 · AI · INTERACT · 在線 · 可理解性 ·

2023 年 10 月 25 日

Assessing the relationship between subjective trust, confidence measurements, and mouse trajectory characteristics in an online task

Martin Dechant,Susanne Poeller,Benedikt Hosp,Olga Lukashova-Sanz,Alexandra Sipatchin,Siegfried Wahl

from arxiv, Submitted to CHI 2023 and rejected

Trust is essential for our interactions with others but also with artificial intelligence (AI) based systems. To understand whether a user trusts an AI, researchers need reliable measurement tools. However, currently discussed markers mostly rely on expensive and invasive sensors, like electroencephalograms, which may cause discomfort. The analysis of mouse trajectory has been suggested as a convenient tool for trust assessment. However, the relationship between trust, confidence and mouse trajectory is not yet fully understood. To provide more insights into this relationship, we asked participants (n = 146) to rate whether several tweets were offensive while an AI suggested its assessment. Our results reveal which aspects of the mouse trajectory are affected by the users subjective trust and confidence ratings; yet they indicate that these measures might not explain sufficiently the variance to be used on their own. This work examines a potential low-cost trust assessment in AI systems.

知識 (knowledge) · 圖 · 知識圖譜 · MoDELS · 圖卷積神經網絡/圖卷積網絡 ·

2023 年 10 月 24 日

Context-aware explainable recommendations over knowledge graphs

Jinfeng Zhong,Elsa Negre

Knowledge graphs contain rich semantic relationships related to items and incorporating such semantic relationships into recommender systems helps to explore the latent connections of items, thus improving the accuracy of prediction and enhancing the explainability of recommendations. However, such explainability is not adapted to users' contexts, which can significantly influence their preferences. In this work, we propose CA-KGCN (Context-Aware Knowledge Graph Convolutional Network), an end-to-end framework that can model users' preferences adapted to their contexts and can incorporate rich semantic relationships in the knowledge graph related to items. This framework captures users' attention to different factors: contexts and features of items. More specifically, the framework can model users' preferences adapted to their contexts and provide explanations adapted to the given context. Experiments on three real-world datasets show the effectiveness of our framework: modeling users' preferences adapted to their contexts and explaining the recommendations generated.

Performer · 蒸餾 · MoDELS · 集成 · 數據集 ·

2023 年 10 月 24 日

Baby Llama: knowledge distillation from an ensemble of teachers trained on a small dataset with no performance penalty

Inar Timiryasov,Jean-Loup Tastet

from arxiv, 11 pages, 4 figures, 4 tables, submitted to the BabyLM Challenge and accepted as archival full paper (CoNLL--CMCL 2023 Shared Task), checkpoint available at //huggingface.co/timinar/baby-llama-58m, training code available at //github.com/timinar/BabyLlama

We present our submission to the BabyLM challenge, whose goal was to improve the sample efficiency of language models. We trained an ensemble consisting of a GPT-2 and small LLaMA models on the developmentally-plausible, 10M-word BabyLM dataset, then distilled it into a small, 58M-parameter LLaMA model, which exceeds in performance both of its teachers as well as a similar model trained without distillation. This suggests that distillation can not only retain the full performance of the teacher model when the latter is trained on a sufficiently small dataset; it can exceed it, and lead to significantly better performance than direct training.

GPT-2 · 語言模型化 · MoDELS · 數學 · 可辨認的 ·

2023 年 10 月 24 日

How does GPT-2 compute greater-than?: Interpreting mathematical abilities in a pre-trained language model

Michael Hanna,Ollie Liu,Alexandre Variengien

from arxiv, NeurIPS 2023 Camera Ready Version

Pre-trained language models can be surprisingly adept at tasks they were not explicitly trained on, but how they implement these capabilities is poorly understood. In this paper, we investigate the basic mathematical abilities often acquired by pre-trained language models. Concretely, we use mechanistic interpretability techniques to explain the (limited) mathematical abilities of GPT-2 small. As a case study, we examine its ability to take in sentences such as "The war lasted from the year 1732 to the year 17", and predict valid two-digit end years (years > 32). We first identify a circuit, a small subset of GPT-2 small's computational graph that computes this task's output. Then, we explain the role of each circuit component, showing that GPT-2 small's final multi-layer perceptrons boost the probability of end years greater than the start year. Finally, we find related tasks that activate our circuit. Our results suggest that GPT-2 small computes greater-than using a complex but general mechanism that activates across diverse contexts.

可理解性 · SimPLe · Analysis · Principle · CASES ·

2023 年 10 月 24 日

The Quantum Tortoise and the Classical Hare: A simple framework for understanding which problems quantum computing will accelerate (and which it will not)

Sukwoong Choi,William S. Moses,Neil Thompson

Quantum computing promises transformational gains for solving some problems, but little to none for others. For anyone hoping to use quantum computers now or in the future, it is important to know which problems will benefit. In this paper, we introduce a framework for answering this question both intuitively and quantitatively. The underlying structure of the framework is a race between quantum and classical computers, where their relative strengths determine when each wins. While classical computers operate faster, quantum computers can sometimes run more efficient algorithms. Whether the speed advantage or the algorithmic advantage dominates determines whether a problem will benefit from quantum computing or not. Our analysis reveals that many problems, particularly those of small to moderate size that can be important for typical businesses, will not benefit from quantum computing. Conversely, larger problems or those with particularly big algorithmic gains will benefit from near-term quantum computing. Since very large algorithmic gains are rare in practice and theorized to be rare even in principle, our analysis suggests that the benefits from quantum computing will flow either to users of these rare cases, or practitioners processing very large data.

推斷 · 可辨認的 · 查準率/準確率 · 均值 · 潛在 ·

2023 年 10 月 23 日

The difference between structural counterfactuals and potential outcomes

Most of the literature on causality considers the structural framework of Pearl and the potential-outcome framework of Neyman and Rubin to be formally equivalent, and therefore interchangeably uses the do-notation and the potential-outcome subscript notation to write counterfactual outcomes. In this paper, we superimpose the two causal frameworks to prove that structural counterfactual outcomes and potential outcomes do not coincide in general -- not even in law. More precisely, we express the law of the potential outcomes in terms of the latent structural causal model under the fundamental assumptions of causal inference. This enables us to precisely identify when counterfactual inference is or is not equivalent between approaches, and to clarify the meaning of each kind of counterfactuals.

評論員 · contrastive · Learning · 泛化理論 · 可辨認的 ·

2023 年 10 月 23 日

Contrastive Retrospection: honing in on critical steps for rapid learning and generalization in RL

Chen Sun,Wannan Yang,Thomas Jiralerspong,Dane Malenfant,Benjamin Alsbury-Nealy,Yoshua Bengio,Blake Richards

In real life, success is often contingent upon multiple critical steps that are distant in time from each other and from the final reward. These critical steps are challenging to identify with traditional reinforcement learning (RL) methods that rely on the Bellman equation for credit assignment. Here, we present a new RL algorithm that uses offline contrastive learning to hone in on these critical steps. This algorithm, which we call Contrastive Retrospection (ConSpec), can be added to any existing RL algorithm. ConSpec learns a set of prototypes for the critical steps in a task by a novel contrastive loss and delivers an intrinsic reward when the current state matches one of the prototypes. The prototypes in ConSpec provide two key benefits for credit assignment: (i) They enable rapid identification of all the critical steps. (ii) They do so in a readily interpretable manner, enabling out-of-distribution generalization when sensory features are altered. Distinct from other contemporary RL approaches to credit assignment, ConSpec takes advantage of the fact that it is easier to retrospectively identify the small set of steps that success is contingent upon (and ignoring other states) than it is to prospectively predict reward at every taken step. ConSpec greatly improves learning in a diverse set of RL tasks.

任務對話系統 · INTERACT · Engineering · RE · MoDELS ·

2023 年 10 月 21 日

Towards dialogue based, computer aided software requirements elicitation

Vasiliy Seibert

Several approaches have been presented, which aim to extract models from natural language specifications. These approaches have inherent weaknesses for they assume an initial problem understanding that is perfect, and they leave no room for feedback. Motivated by real-world collaboration settings between requirements engineers and customers, this paper proposes an interaction blueprint that aims for dialogue based, computer aided software requirements analysis. Compared to mere model extraction approaches, this interaction blueprint encourages individuality, creativity and genuine compromise. A simplistic Experiment was conducted to showcase the general idea. This paper discusses the experiment as well as the proposed interaction blueprint and argues, that advancements in natural language processing and generative AI might lead to significant progress in a foreseeable future. However, for that, there is a need to move away from a magical black box expectation and instead moving towards a dialogue based approach that recognizes the individuality that is an undeniable part of requirements engineering.

PDE · Neural Networks · Networking · 期望錯誤 · MoDELS ·

2023 年 10 月 20 日

Meta-learning of Physics-informed Neural Networks for Efficiently Solving Newly Given PDEs

Tomoharu Iwata,Yusuke Tanaka,Naonori Ueda

We propose a neural network-based meta-learning method to efficiently solve partial differential equation (PDE) problems. The proposed method is designed to meta-learn how to solve a wide variety of PDE problems, and uses the knowledge for solving newly given PDE problems. We encode a PDE problem into a problem representation using neural networks, where governing equations are represented by coefficients of a polynomial function of partial derivatives, and boundary conditions are represented by a set of point-condition pairs. We use the problem representation as an input of a neural network for predicting solutions, which enables us to efficiently predict problem-specific solutions by the forwarding process of the neural network without updating model parameters. To train our model, we minimize the expected error when adapted to a PDE problem based on the physics-informed neural network framework, by which we can evaluate the error even when solutions are unknown. We demonstrate that our proposed method outperforms existing methods in predicting solutions of PDE problems.

閱讀: 0 點贊: 0

小貼士

登錄享

相關主題

樸素貝葉斯分類器

樸素貝葉斯

Processing（編程語言）

知識 (knowledge)

北京阿比特科技有限公司

注冊地址：北京市海淀區羊坊店路18號2幢3層301-191

<form id='AilCG'></form>

<bdo id='7iJAJ'><sup id='PReg6'><div id='XRCqx'><bdo id='iYVnd'></bdo></div></sup></bdo>