亚洲男人的天堂2018av,欧美草比,久久久久久免费视频精选,国色天香在线看免费,久久久久亚洲av成人片仓井空

<dir id='x7e63'><del id='fPjUj'><del id='9kb2j'></del><pre id='tZKTu'><pre id='ZDi36'><option id='yMLWl'><address id='Hu7YO'></address><bdo id='lwv4z'><tr id='t7nsD'><acronym id='3gCDZ'><pre id='65XoS'></pre></acronym><div id='5XlaE'></div></tr></bdo></option></pre><small id='DewIL'><address id='Onmn9'><u id='7EQta'><legend id='4pZzc'><option id='rkh8p'><abbr id='MGSgT'></abbr><li id='xluz7'><pre id='pJPRZ'></pre></li></option></legend><select id='O4oF3'></select></u></address></small></pre></del><sup id='BoEnY'></sup><blockquote id='P8W8m'><dt id='s7ys9'></dt></blockquote><blockquote id='iFgDK'></blockquote></dir><tt id='VjrxV'></tt><u id='uKiJy'><tt id='x6MfX'><form id='ycFrd'></form></tt><td id='AyxmN'><dt id='tlwTR'></dt></td></u>

<code id='xspR3'><i id='g1Zvx'><q id='Iwgaz'><legend id='4pREF'><pre id='lyu8T'><style id='7xOYw'><acronym id='pM8dp'><i id='QwpGj'><form id='br39B'><option id='Fi77X'><center id='PYkJZ'></center></option></form></i></acronym></style><tt id='09s86'></tt></pre></legend></q></i></code><center id='Tma9h'></center>

<dd id='Wi1Bq'></dd>

<style id='o7P08'></style><sub id='0Vjww'><dfn id='EgAdy'><abbr id='szQVn'><big id='2scMH'><bdo id='QLfZa'></bdo></big></abbr></dfn></sub>_{<dir id='xQtve'></dir>}

·

函數式編程 · 神經網絡 · 形式化 · 編程 · 深度神經網絡 ·

2023 年 4 月 18 日

A Neural Lambda Calculus: Neurosymbolic AI meets the foundations of computing and functional programming

Jo?o Flach,Luis C. Lamb

from arxiv, Keywords: Machine Learning, Lambda Calculus, Neurosymbolic AI, Neural Networks, Transformer Model, Sequence-to-Sequence Models, Computational Models

Over the last decades, deep neural networks based-models became the dominant paradigm in machine learning. Further, the use of artificial neural networks in symbolic learning has been seen as increasingly relevant recently. To study the capabilities of neural networks in the symbolic AI domain, researchers have explored the ability of deep neural networks to learn mathematical constructions, such as addition and multiplication, logic inference, such as theorem provers, and even the execution of computer programs. The latter is known to be too complex a task for neural networks. Therefore, the results were not always successful, and often required the introduction of biased elements in the learning process, in addition to restricting the scope of possible programs to be executed. In this work, we will analyze the ability of neural networks to learn how to execute programs as a whole. To do so, we propose a different approach. Instead of using an imperative programming language, with complex structures, we use the Lambda Calculus ({\lambda}-Calculus), a simple, but Turing-Complete mathematical formalism, which serves as the basis for modern functional programming languages and is at the heart of computability theory. We will introduce the use of integrated neural learning and lambda calculi formalization. Finally, we explore execution of a program in {\lambda}-Calculus is based on reductions, we will show that it is enough to learn how to perform these reductions so that we can execute any program. Keywords: Machine Learning, Lambda Calculus, Neurosymbolic AI, Neural Networks, Transformer Model, Sequence-to-Sequence Models, Computational Models

相關內容

函數式編程

函數式編程

和命令式編程對應的編程方式，不直接(jie)描述求(qiu)解細節(jie)，通(tong)過代碼描述運算關(guan)系，由系統(tong)完成求(qiu)解

MoDELS · Learning · 生成模型 · 代碼 · EASE ·

2023 年 6 月 5 日

Computing Education in the Era of Generative AI

Paul Denny,James Prather,Brett A. Becker,James Finnie-Ansley,Arto Hellas,Juho Leinonen,Andrew Luxton-Reilly,Brent N. Reeves,Eddie Antonio Santos,Sami Sarsa

from arxiv, Accepted for publication as a Contributed Article in Communications of the ACM (CACM)

The computing education community has a rich history of pedagogical innovation designed to support students in introductory courses, and to support teachers in facilitating student learning. Very recent advances in artificial intelligence have resulted in code generation models that can produce source code from natural language problem descriptions -- with impressive accuracy in many cases. The wide availability of these models and their ease of use has raised concerns about potential impacts on many aspects of society, including the future of computing education. In this paper, we discuss the challenges and opportunities such models present to computing educators, with a focus on introductory programming classrooms. We summarize the results of two recent articles, the first evaluating the performance of code generation models on typical introductory-level programming problems, and the second exploring the quality and novelty of learning resources generated by these models. We consider likely impacts of such models upon pedagogical practice in the context of the most recent advances at the time of writing.

有偏 · GPT3 · MoDELS · OpenAI · NLP ·

2023 年 6 月 4 日

Taught by the Internet, Exploring Bias in OpenAIs GPT3

Ali Ayaz,Aditya Nawalgaria,Ruilian Yin

This research delves into the current literature on bias in Natural Language Processing Models and the techniques proposed to mitigate the problem of bias, including why it is important to tackle bias in the first place. Additionally, these techniques are further analysed in the light of newly developed models that tower in size over past editions. To achieve those aims, the authors of this paper conducted their research on GPT3 by OpenAI, the largest NLP model available to consumers today. With 175 billion parameters in contrast to BERTs 340 million, GPT3 is the perfect model to test the common pitfalls of NLP models. Tests were conducted through the development of an Applicant Tracking System using GPT3. For the sake of feasibility and time constraints, the tests primarily focused on gender bias, rather than all or multiple types of bias. Finally, current mitigation techniques are considered and tested to measure their degree of functionality.

二階導數 · Integration · 離散化 · 線性的 · Microsoft Surface ·

2023 年 6 月 2 日

Nonlinear Boundary Conditions for Initial Boundary Value Problems with Applications in Computational Fluid Dynamics

from arxiv, arXiv admin note: substantial text overlap with arXiv:2301.04568

We derive new boundary conditions and implementation procedures for nonlinear initial boundary value problems (IBVPs) with non-zero boundary data that lead to bounded solutions. The new boundary procedure is applied to nonlinear IBVPs on skew-symmetric form, including dissipative terms. The complete procedure has two main ingredients. In the first part (published in [1, 2]), the energy and entropy rate in terms of a surface integral with boundary terms was produced for problems with first derivatives. In this second part we complement it by adding second derivative dissipative terms and bound the boundary terms. We develop a new nonlinear boundary procedure which generalise the characteristic boundary procedure for linear problems. Both strong and weak imposition of the nonlinear boundary conditions with non-zero boundary data are considered, and we prove that the solution is bounded. The boundary procedure is applied to four important IBVPs in computational fluid dynamics: the incompressible Euler and Navier-Stokes, the shallow water and the compressible Euler equations. Finally we show that stable discrete approximations follow by using summation-by-parts operators combined with weak boundary conditions.

變換 · Performer · 組合性 · MoDELS · dynamic programming ·

2023 年 6 月 1 日

Faith and Fate: Limits of Transformers on Compositionality

Nouha Dziri,Ximing Lu,Melanie Sclar,Xiang Lorraine Li,Liwei Jiang,Bill Yuchen Lin,Peter West,Chandra Bhagavatula,Ronan Le Bras,Jena D. Hwang,Soumya Sanyal,Sean Welleck,Xiang Ren,Allyson Ettinger,Zaid Harchaoui,Yejin Choi

from arxiv, 10 pages + appendix (21 pages)

Transformer large language models (LLMs) have sparked admiration for their exceptional performance on tasks that demand intricate multi-step reasoning. Yet, these models simultaneously show failures on surprisingly trivial problems. This begs the question: Are these errors incidental, or do they signal more substantial limitations? In an attempt to demystify Transformers, we investigate the limits of these models across three representative compositional tasks -- multi-digit multiplication, logic grid puzzles, and a classic dynamic programming problem. These tasks require breaking problems down into sub-steps and synthesizing these steps into a precise answer. We formulate compositional tasks as computation graphs to systematically quantify the level of complexity, and break down reasoning steps into intermediate sub-procedures. Our empirical findings suggest that Transformers solve compositional tasks by reducing multi-step compositional reasoning into linearized subgraph matching, without necessarily developing systematic problem-solving skills. To round off our empirical study, we provide theoretical arguments on abstract multi-step reasoning problems that highlight how Transformers' performance will rapidly decay with increased task complexity.

binary · 多重集 · 分解的 · 線性的 · CASE ·

2023 年 6 月 1 日

A New Algebraic Approach for String Reconstruction from Substring Compositions

Utkarsh Gupta,Hessam Mahdavifar

We consider the problem of binary string reconstruction from the multiset of its substring compositions, i.e., referred to as the substring composition multiset, first introduced and studied by Acharya et al. We introduce a new algorithm for the problem of string reconstruction from its substring composition multiset which relies on the algebraic properties of the equivalent bivariate polynomial formulation of the problem. We then characterize specific algebraic conditions for the binary string to be reconstructed that guarantee the algorithm does not require any backtracking through the reconstruction, and, consequently, the time complexity is bounded polynomially. More specifically, in the case of no backtracking, our algorithm has a time complexity of $O(n^2)$ compared to the algorithm by Acharya et al., which has a time complexity of $O(n^2\log(n))$, where $n$ is the length of the binary string. Furthermore, it is shown that larger sets of binary strings are uniquely reconstructable by the new algorithm and without the need for backtracking leading to codebooks of reconstruction codes that are larger, by a linear factor in size, compared to the previously known construction by Pattabiraman et al., while having $O(n^2)$ reconstruction complexity.

INTERACT · 核化 · 統計量 · 劃分 · MoDELS ·

2023 年 6 月 1 日

Interaction Measures, Partition Lattices and Kernel Tests for High-Order Interactions

Zhaolu Liu,Robert L. Peach,Pedro A. M. Mediano,Mauricio Barahona

from arxiv, 19 pages, 7 figures

Models that rely solely on pairwise relationships often fail to capture the complete statistical structure of the complex multivariate data found in diverse domains, such as socio-economic, ecological, or biomedical systems. Non-trivial dependencies between groups of more than two variables can play a significant role in the analysis and modelling of such systems, yet extracting such high-order interactions from data remains challenging. Here, we introduce a hierarchy of $d$-order ($d \geq 2$) interaction measures, increasingly inclusive of possible factorisations of the joint probability distribution, and define non-parametric, kernel-based tests to establish systematically the statistical significance of $d$-order interactions. We also establish mathematical links with lattice theory, which elucidate the derivation of the interaction measures and their composite permutation tests; clarify the connection of simplicial complexes with kernel matrix centring; and provide a means to enhance computational efficiency. We illustrate our results numerically with validations on synthetic data, and through an application to neuroimaging data.

情景 · Learning · 講稿 · 論文 ·

2023 年 6 月 1 日

Physical Attacks on the Railway System

Lukas Iffl?nder,Thomas Buder,Teresa Loreth,Marina Alonso Villota,Walter Schmitz,Karl Adolf Neubecker,Stefan Pickl

Recent attacks encouraged public interest in physical security for railways. Knowing about and learning from previous attacks is necessary to secure against them. This paper presents a structured data set of physical attacks against railways. We analyze the data regarding the used means, the railway system's target component, the attacker type, and the geographical distribution of attacks. The results indicate a growing heterogeneity of observed attacks in the recent decade compared to the previous decades and centuries, making protecting railways more complex.

估計/估計量 · 得分 · MoDELS · Learning · 可約的 ·

2023 年 6 月 1 日

Calibrated Propensity Scores for Causal Effect Estimation

Shachi Deshpande,Volodymyr Kuleshov

from arxiv, 23 pages, 3 figures

Propensity scores are commonly used to balance observed covariates while estimating treatment effects. Estimates obtained through propensity score weighing can be biased when the propensity score model cannot learn the true treatment assignment mechanism. We argue that the probabilistic output of a learned propensity score model should be calibrated, i.e. a predictive treatment probability of 90% should correspond to 90% of individuals being assigned the treatment group. We propose simple recalibration techniques to ensure this property. We investigate the theoretical properties of a calibrated propensity score model and its role in unbiased treatment effect estimation. We demonstrate improved causal effect estimation with calibrated propensity scores in several tasks including high-dimensional genome-wide association studies, where we also show reduced computational requirements when calibration is applied to simpler propensity score models.

MoDELS · 評論員 · 縮放 · 可理解性 · GPT-3 ·

2021 年 8 月 18 日

On the Opportunities and Risks of Foundation Models

Rishi Bommasani,Drew A. Hudson,Ehsan Adeli,Russ Altman,Simran Arora,Sydney von Arx,Michael S. Bernstein,Jeannette Bohg,Antoine Bosselut,Emma Brunskill,Erik Brynjolfsson,Shyamal Buch,Dallas Card,Rodrigo Castellon,Niladri Chatterji,Annie Chen,Kathleen Creel,Jared Quincy Davis,Dora Demszky,Chris Donahue,Moussa Doumbouya,Esin Durmus,Stefano Ermon,John Etchemendy,Kawin Ethayarajh,Li Fei-Fei,Chelsea Finn,Trevor Gale,Lauren Gillespie,Karan Goel,Noah Goodman,Shelby Grossman,Neel Guha,Tatsunori Hashimoto,Peter Henderson,John Hewitt,Daniel E. Ho,Jenny Hong,Kyle Hsu,Jing Huang,Thomas Icard,Saahil Jain,Dan Jurafsky,Pratyusha Kalluri,Siddharth Karamcheti,Geoff Keeling,Fereshte Khani,Omar Khattab,Pang Wei Kohd,Mark Krass,Ranjay Krishna,Rohith Kuditipudi,Ananya Kumar,Faisal Ladhak,Mina Lee,Tony Lee,Jure Leskovec,Isabelle Levent,Xiang Lisa Li,Xuechen Li,Tengyu Ma,Ali Malik,Christopher D. Manning,Suvir Mirchandani,Eric Mitchell,Zanele Munyikwa,Suraj Nair,Avanika Narayan,Deepak Narayanan,Ben Newman,Allen Nie,Juan Carlos Niebles,Hamed Nilforoshan,Julian Nyarko,Giray Ogut,Laurel Orr,Isabel Papadimitriou,Joon Sung Park,Chris Piech,Eva Portelance,Christopher Potts,Aditi Raghunathan,Rob Reich,Hongyu Ren,Frieda Rong,Yusuf Roohani,Camilo Ruiz,Jack Ryan,Christopher Ré,Dorsa Sadigh,Shiori Sagawa,Keshav Santhanam,Andy Shih,Krishnan Srinivasan,Alex Tamkin,Rohan Taori,Armin W. Thomas,Florian Tramèr,Rose E. Wang,William Wang,Bohan Wu,Jiajun Wu,Yuhuai Wu,Sang Michael Xie,Michihiro Yasunaga,Jiaxuan You,Matei Zaharia,Michael Zhang,Tianyi Zhang,Xikun Zhang,Yuhui Zhang,Lucia Zheng,Kaitlyn Zhou,Percy Liang

from arxiv, Authored by the Center for Research on Foundation Models (CRFM) at the Stanford Institute for Human-Centered Artificial Intelligence (HAI)

AI is undergoing a paradigm shift with the rise of models (e.g., BERT, DALL-E, GPT-3) that are trained on broad data at scale and are adaptable to a wide range of downstream tasks. We call these models foundation models to underscore their critically central yet incomplete character. This report provides a thorough account of the opportunities and risks of foundation models, ranging from their capabilities (e.g., language, vision, robotics, reasoning, human interaction) and technical principles(e.g., model architectures, training procedures, data, systems, security, evaluation, theory) to their applications (e.g., law, healthcare, education) and societal impact (e.g., inequity, misuse, economic and environmental impact, legal and ethical considerations). Though foundation models are based on standard deep learning and transfer learning, their scale results in new emergent capabilities,and their effectiveness across so many tasks incentivizes homogenization. Homogenization provides powerful leverage but demands caution, as the defects of the foundation model are inherited by all the adapted models downstream. Despite the impending widespread deployment of foundation models, we currently lack a clear understanding of how they work, when they fail, and what they are even capable of due to their emergent properties. To tackle these questions, we believe much of the critical research on foundation models will require deep interdisciplinary collaboration commensurate with their fundamentally sociotechnical nature.

過擬合 · SimPLe · Principle · 模型評估 · 統計量 ·

2021 年 3 月 16 日

Deep learning: a statistical viewpoint

Peter L. Bartlett,Andrea Montanari,Alexander Rakhlin

The remarkable practical success of deep learning has revealed some major surprises from a theoretical perspective. In particular, simple gradient methods easily find near-optimal solutions to non-convex optimization problems, and despite giving a near-perfect fit to training data without any explicit effort to control model complexity, these methods exhibit excellent predictive accuracy. We conjecture that specific principles underlie these phenomena: that overparametrization allows gradient methods to find interpolating solutions, that these methods implicitly impose regularization, and that overparametrization leads to benign overfitting. We survey recent theoretical progress that provides examples illustrating these principles in simpler settings. We first review classical uniform convergence results and why they fall short of explaining aspects of the behavior of deep learning methods. We give examples of implicit regularization in simple settings, where gradient methods lead to minimal norm functions that perfectly fit the training data. Then we review prediction methods that exhibit benign overfitting, focusing on regression problems with quadratic loss. For these methods, we can decompose the prediction rule into a simple component that is useful for prediction and a spiky component that is useful for overfitting but, in a favorable setting, does not harm prediction accuracy. We focus specifically on the linear regime for neural networks, where the network can be approximated by a linear model. In this regime, we demonstrate the success of gradient flow, and we consider benign overfitting with two-layer networks, giving an exact asymptotic analysis that precisely demonstrates the impact of overparametrization. We conclude by highlighting the key challenges that arise in extending these insights to realistic deep learning settings.

閱讀: 0 點贊: 0

小貼士

登錄享

相關主題

函數式編程

神經網絡(luo)

形式(shi)化(hua)

深度(du)神經(jing)網絡

北京阿比特科技有限公司

注冊地址：北京市海淀區羊坊店路18號2幢3層301-191

<dir id='Fm4Bw'><del id='ZtC42'><del id='B8uSW'></del><pre id='ckCCD'><pre id='6JaTa'><option id='wBd9i'><address id='Y9i7Q'></address><bdo id='sHS5A'><tr id='KbQJf'><acronym id='RFnYc'><pre id='AVV14'></pre></acronym><div id='6lkA0'></div></tr></bdo></option></pre><small id='ypEVd'><address id='DYycD'><u id='ECHb4'><legend id='54ANt'><option id='2op6R'><abbr id='cmrcU'></abbr><li id='gxr5c'><pre id='uH6GM'></pre></li></option></legend><select id='3hkuI'></select></u></address></small></pre></del><sup id='9oU80'></sup><blockquote id='cVGhi'><dt id='kaJea'></dt></blockquote><blockquote id='qGvrw'></blockquote></dir><tt id='gkM0I'></tt><u id='w94bB'><tt id='Td5wS'><form id='YBudS'></form></tt><td id='v2kV2'><dt id='u82Gv'></dt></td></u>

<code id='bcBRm'><i id='O0TyW'><q id='lZZuk'><legend id='NPQjT'><pre id='MMiFM'><style id='0TNe1'><acronym id='vB7fF'><i id='TDaQk'><form id='r7W1W'><option id='xfdYU'><center id='MCGHF'></center></option></form></i></acronym></style><tt id='z57fm'></tt></pre></legend></q></i></code><center id='5vOHA'></center>

<dd id='ElaKi'></dd>

<style id='2NZsn'></style><sub id='refkv'><dfn id='vZ7t9'><abbr id='GiT2T'><big id='Ok6q9'><bdo id='rpAwH'></bdo></big></abbr></dfn></sub>_{<dir id='bOSN2'></dir>}