亚洲男人的天堂2018av,欧美草比,久久久久久免费视频精选,国色天香在线看免费,久久久久亚洲av成人片仓井空

<li id='dkVzW'></li>

_{^{<dd id='sgqTI'><tbody id='B2eUI'><td id='Rqyls'><optgroup id='Hpomc'><strong id='KBLHx'></strong></optgroup><address id='IQx3e'><ul id='j0A6w'></ul></address><big id='xoQuM'></big></td><table id='8RNda'></table></tbody><pre id='urBlN'></pre></dd><span id='Kv4e6'><b id='1ESva'></b></span>}}


<dfn id='UASB4'><optgroup id='BoGs5'></optgroup></dfn><tfoot id='AkDxs'><bdo id='NmIau'><div id='c9sG4'></div><i id='bLTZ6'><dt id='9hx4r'></dt></i></bdo></tfoot>

_{<fieldset id='U47Dh'></fieldset>}

·

ChatGPT · 控制器 · Learning · CASE · 強化學習 ·

2023 年 6 月 13 日

Can ChatGPT Enable ITS? The Case of Mixed Traffic Control via Reinforcement Learning

Michael Villarreal,Bibek Poudel,Weizi Li

The surge in Reinforcement Learning (RL) applications in Intelligent Transportation Systems (ITS) has contributed to its growth as well as highlighted key challenges. However, defining objectives of RL agents in traffic control and management tasks, as well as aligning policies with these goals through an effective formulation of Markov Decision Process (MDP), can be challenging and often require domain experts in both RL and ITS. Recent advancements in Large Language Models (LLMs) such as GPT-4 highlight their broad general knowledge, reasoning capabilities, and commonsense priors across various domains. In this work, we conduct a large-scale user study involving 70 participants to investigate whether novices can leverage ChatGPT to solve complex mixed traffic control problems. Three environments are tested, including ring road, bottleneck, and intersection. We find ChatGPT has mixed results. For intersection and bottleneck, ChatGPT increases number of successful policies by 150% and 136% compared to solely beginner capabilities, with some of them even outperforming experts. However, ChatGPT does not provide consistent improvements across all scenarios.

相關內容

ChatGPT

ChatGPT（全名：Chat Generative Pre-trained Transformer），美國OpenAI 研發的聊天機(ji)器人(ren)程(cheng)序 [1] ，于2022年(nian)11月30日(ri)發布。ChatGPT是人(ren)工智能(neng)(neng)技術驅(qu)動(dong)的自然語言處理工具，它能(neng)(neng)夠通過學(xue)習和理解人(ren)類的語言來進行對話，還能(neng)(neng)根據聊天的上下文進行互(hu)動(dong)，真正像人(ren)類一(yi)樣來聊天交流，甚至能(neng)(neng)完(wan)成撰(zhuan)寫郵件(jian)、視頻腳(jiao)本、文案、翻譯(yi)、代碼，寫論文任務。 [1] //openai.com/blog/chatgpt/

線性回歸 · 線性的 · 極小點 · 估計/估計量 · 各向同性 ·

2023 年 8 月 7 日

Batches Stabilize the Minimum Norm Risk in High Dimensional Overparameterized Linear Regression

Shahar Stein Ioushua,Inbar Hasidim,Ofer Shayevitz,Meir Feder

from arxiv, 55 pages

Learning algorithms that divide the data into batches are prevalent in many machine-learning applications, typically offering useful trade-offs between computational efficiency and performance. In this paper, we examine the benefits of batch-partitioning through the lens of a minimum-norm overparameterized linear regression model with isotropic Gaussian features. We suggest a natural small-batch version of the minimum-norm estimator, and derive an upper bound on its quadratic risk, showing it is inversely proportional to the noise level as well as to the overparameterization ratio, for the optimal choice of batch size. In contrast to minimum-norm, our estimator admits a stable risk behavior that is monotonically increasing in the overparameterization ratio, eliminating both the blowup at the interpolation point and the double-descent phenomenon. Interestingly, we observe that this implicit regularization offered by the batch partition is partially explained by feature overlap between the batches. Our bound is derived via a novel combination of techniques, in particular normal approximation in the Wasserstein metric of noisy projections over random subspaces.

語言模型化 · 泛函 · Med-PaLM 2 · MoDELS · 得分 ·

2023 年 8 月 3 日

The Capability of Large Language Models to Measure Psychiatric Functioning

Isaac R. Galatzer-Levy,Daniel McDuff,Vivek Natarajan,Alan Karthikesalingam,Matteo Malgaroli

The current work investigates the capability of Large language models (LLMs) that are explicitly trained on large corpuses of medical knowledge (Med-PaLM 2) to predict psychiatric functioning from patient interviews and clinical descriptions without being trained to do so. To assess this, n = 145 depression and n =115 PTSD assessments and n = 46 clinical case studies across high prevalence/high comorbidity disorders (Depressive, Anxiety, Psychotic, trauma and stress, Addictive disorders) were analyzed using prompts to extract estimated clinical scores and diagnoses. Results demonstrate that Med-PaLM 2 is capable of assessing psychiatric functioning across a range of psychiatric conditions with the strongest performance being the prediction of depression scores based on standardized assessments (Accuracy range= 0.80 - 0.84) which were statistically indistinguishable from human clinical raters t(1,144) = 1.20; p = 0.23. Results show the potential for general clinical language models to flexibly predict psychiatric risk based on free descriptions of functioning from both patients and clinicians.

估計/估計量 · 樣本 · 相關系數 · 置信度 · Networking ·

2023 年 8 月 3 日

Quantification of Predictive Uncertainty via Inference-Time Sampling

Katarína Tóthová,?ubor Ladicky,Daniel Thul,Marc Pollefeys,Ender Konukoglu

Predictive variability due to data ambiguities has typically been addressed via construction of dedicated models with built-in probabilistic capabilities that are trained to predict uncertainty estimates as variables of interest. These approaches require distinct architectural components and training mechanisms, may include restrictive assumptions and exhibit overconfidence, i.e., high confidence in imprecise predictions. In this work, we propose a post-hoc sampling strategy for estimating predictive uncertainty accounting for data ambiguity. The method can generate different plausible outputs for a given input and does not assume parametric forms of predictive distributions. It is architecture agnostic and can be applied to any feed-forward deterministic network without changes to the architecture or training procedure. Experiments on regression tasks on imaging and non-imaging input data show the method's ability to generate diverse and multi-modal predictive distributions, and a desirable correlation of the estimated uncertainty with the prediction error.

鏈路預測 · 圖形處理器 · Networking · 圖 · Neural Networks ·

2023 年 8 月 3 日

Evaluating Link Prediction Explanations for Graph Neural Networks

Claudio Borile,Alan Perotti,André Panisson

from arxiv, This work has been accepted to be presented to The 1st World Conference on eXplainable Artificial Intelligence (xAI 2023), July 26-28, 2023 - Lisboa, Portugal

Graph Machine Learning (GML) has numerous applications, such as node/graph classification and link prediction, in real-world domains. Providing human-understandable explanations for GML models is a challenging yet fundamental task to foster their adoption, but validating explanations for link prediction models has received little attention. In this paper, we provide quantitative metrics to assess the quality of link prediction explanations, with or without ground-truth. State-of-the-art explainability methods for Graph Neural Networks are evaluated using these metrics. We discuss how underlying assumptions and technical details specific to the link prediction task, such as the choice of distance between node embeddings, can influence the quality of the explanations.

可辨認的 · Analysis · Automator · 可約的 · 查全率/召回率 ·

2023 年 8 月 2 日

Manual Tests Do Smell! Cataloging and Identifying Natural Language Test Smells

Elvys Soares,Manoel Aranda,Naelson Oliveira,Márcio Ribeiro,Rohit Gheyi,Emerson Souza,Ivan Machado,André Santos,Baldoino Fonseca,Rodrigo Bonifácio

from arxiv, The 17th ACM/IEEE International Symposium on Empirical Software Engineering and Measurement (ESEM), 2023

Background: Test smells indicate potential problems in the design and implementation of automated software tests that may negatively impact test code maintainability, coverage, and reliability. When poorly described, manual tests written in natural language may suffer from related problems, which enable their analysis from the point of view of test smells. Despite the possible prejudice to manually tested software products, little is known about test smells in manual tests, which results in many open questions regarding their types, frequency, and harm to tests written in natural language. Aims: Therefore, this study aims to contribute to a catalog of test smells for manual tests. Method: We perform a two-fold empirical strategy. First, an exploratory study in manual tests of three systems: the Ubuntu Operational System, the Brazilian Electronic Voting Machine, and the User Interface of a large smartphone manufacturer. We use our findings to propose a catalog of eight test smells and identification rules based on syntactical and morphological text analysis, validating our catalog with 24 in-company test engineers. Second, using our proposals, we create a tool based on Natural Language Processing (NLP) to analyze the subject systems' tests, validating the results. Results: We observed the occurrence of eight test smells. A survey of 24 in-company test professionals showed that 80.7% agreed with our catalog definitions and examples. Our NLP-based tool achieved a precision of 92%, recall of 95%, and f-measure of 93.5%, and its execution evidenced 13,169 occurrences of our cataloged test smells in the analyzed systems. Conclusion: We contribute with a catalog of natural language test smells and novel detection strategies that better explore the capabilities of current NLP mechanisms with promising results and reduced effort to analyze tests written in different idioms.

語言模型化 · MoDELS · 泛化理論 · 可辨認的 · Continuity ·

2023 年 7 月 12 日

A Comprehensive Overview of Large Language Models

Humza Naveed,Asad Ullah Khan,Shi Qiu,Muhammad Saqib,Saeed Anwar,Muhammad Usman,Nick Barnes,Ajmal Mian

Large Language Models (LLMs) have shown excellent generalization capabilities that have led to the development of numerous models. These models propose various new architectures, tweaking existing architectures with refined training strategies, increasing context length, using high-quality training data, and increasing training time to outperform baselines. Analyzing new developments is crucial for identifying changes that enhance training stability and improve generalization in LLMs. This survey paper comprehensively analyses the LLMs architectures and their categorization, training strategies, training datasets, and performance evaluations and discusses future research directions. Moreover, the paper also discusses the basic building blocks and concepts behind LLMs, followed by a complete overview of LLMs, including their important features and functions. Finally, the paper summarizes significant findings from LLM research and consolidates essential architectural and training strategies for developing advanced LLMs. Given the continuous advancements in LLMs, we intend to regularly update this paper by incorporating new sections and featuring the latest LLM models.

INFORMS · 信息抽取 · 穩健性 · 數據集 · CLUES ·

2021 年 1 月 24 日

Towards Robust Visual Information Extraction in Real World: New Dataset and Novel Solution

Jiapeng Wang,Chongyu Liu,Lianwen Jin,Guozhi Tang,Jiaxin Zhang,Shuaitao Zhang,Qianying Wang,Yaqiang Wu,Mingxiang Cai

from arxiv, 8 pages, 5 figures, to be published in AAAI 2021

Visual information extraction (VIE) has attracted considerable attention recently owing to its various advanced applications such as document understanding, automatic marking and intelligent education. Most existing works decoupled this problem into several independent sub-tasks of text spotting (text detection and recognition) and information extraction, which completely ignored the high correlation among them during optimization. In this paper, we propose a robust visual information extraction system (VIES) towards real-world scenarios, which is a unified end-to-end trainable framework for simultaneous text detection, recognition and information extraction by taking a single document image as input and outputting the structured information. Specifically, the information extraction branch collects abundant visual and semantic representations from text spotting for multimodal feature fusion and conversely, provides higher-level semantic clues to contribute to the optimization of text spotting. Moreover, regarding the shortage of public benchmarks, we construct a fully-annotated dataset called EPHOIE (//github.com/HCIILAB/EPHOIE), which is the first Chinese benchmark for both text spotting and visual information extraction. EPHOIE consists of 1,494 images of examination paper head with complex layouts and background, including a total of 15,771 Chinese handwritten or printed text instances. Compared with the state-of-the-art methods, our VIES shows significant superior performance on the EPHOIE dataset and achieves a 9.01% F-score gain on the widely used SROIE dataset under the end-to-end scenario.

Performer · 圖形處理器 · 圖 · Neural Networks · Extensibility ·

2020 年 10 月 29 日

Scalable Graph Neural Networks via Bidirectional Propagation

Ming Chen,Zhewei Wei,Bolin Ding,Yaliang Li,Ye Yuan,Xiaoyong Du,Ji-Rong Wen

from arxiv, NeurIPS 2020

Graph Neural Networks (GNN) is an emerging field for learning on non-Euclidean data. Recently, there has been increased interest in designing GNN that scales to large graphs. Most existing methods use "graph sampling" or "layer-wise sampling" techniques to reduce training time. However, these methods still suffer from degrading performance and scalability problems when applying to graphs with billions of edges. This paper presents GBP, a scalable GNN that utilizes a localized bidirectional propagation process from both the feature vectors and the training/testing nodes. Theoretical analysis shows that GBP is the first method that achieves sub-linear time complexity for both the precomputation and the training phases. An extensive empirical study demonstrates that GBP achieves state-of-the-art performance with significantly less training/testing time. Most notably, GBP can deliver superior performance on a graph with over 60 million nodes and 1.8 billion edges in less than half an hour on a single machine.

小樣本學習 · 注意力機制 · 圖形處理器 · GNN · 學成 ·

2020 年 7 月 14 日

Attentive Graph Neural Networks for Few-Shot Learning

Hao Cheng,Joey Tianyi Zhou,Wee Peng Tay,Bihan Wen

Graph Neural Networks (GNN) has demonstrated the superior performance in many challenging applications, including the few-shot learning tasks. Despite its powerful capacity to learn and generalize from few samples, GNN usually suffers from severe over-fitting and over-smoothing as the model becomes deep, which limit the model scalability. In this work, we propose a novel Attentive GNN to tackle these challenges, by incorporating a triple-attention mechanism, \ie node self-attention, neighborhood attention, and layer memory attention. We explain why the proposed attentive modules can improve GNN for few-shot learning with theoretical analysis and illustrations. Extensive experiments show that the proposed Attentive GNN outperforms the state-of-the-art GNN-based methods for few-shot learning over the mini-ImageNet and Tiered-ImageNet datasets, with both inductive and transductive settings.

節點分類 · 學成 · GNN · 圖 · 結點 ·

2020 年 3 月 26 日

A Collective Learning Framework to Boost GNN Expressiveness

Mengyue Hang,Jennifer Neville,Bruno Ribeiro

Graph Neural Networks (GNNs) have recently been used for node and graph classification tasks with great success, but GNNs model dependencies among the attributes of nearby neighboring nodes rather than dependencies among observed node labels. In this work, we consider the task of inductive node classification using GNNs in supervised and semi-supervised settings, with the goal of incorporating label dependencies. Because current GNNs are not universal (i.e., most-expressive) graph representations, we propose a general collective learning approach to increase the representation power of any existing GNN. Our framework combines ideas from collective classification with self-supervised learning, and uses a Monte Carlo approach to sampling embeddings for inductive learning across graphs. We evaluate performance on five real-world network datasets and demonstrate consistent, significant improvement in node classification accuracy, for a variety of state-of-the-art GNNs.

閱讀: 0 點贊: 0

小貼士

登錄享

相關主題

強(qiang)化學習(xi)

北京阿比特科技有限公司

注冊地址：北京市海淀區羊坊店路18號2幢3層301-191

<li id='0x6ry'></li>

_{^{<dd id='0x6ry'><tbody id='0x6ry'><td id='0x6ry'><optgroup id='0x6ry'><strong id='0x6ry'></strong></optgroup><address id='0x6ry'><ul id='0x6ry'></ul></address><big id='0x6ry'></big></td><table id='0x6ry'></table></tbody><pre id='0x6ry'></pre></dd><span id='0x6ry'><b id='0x6ry'></b></span>}}


<dfn id='0x6ry'><optgroup id='0x6ry'></optgroup></dfn><tfoot id='0x6ry'><bdo id='0x6ry'><div id='0x6ry'></div><i id='0x6ry'><dt id='0x6ry'></dt></i></bdo></tfoot>

_{<fieldset id='0x6ry'></fieldset>}