在线亚洲91SE亚洲综合在线_欧美成人亚洲国产中文精品_日日噜噜夜夜狠狠视频无韩无码_黄色小视频网站在线免费观看_国产高清视频A在线大全_一区二区三区观看_日韩国产欧美综合

Recent development of large language models (LLMs), such as ChatGPT has been widely applied to a wide range of software engineering tasks. Many papers have reported their analysis on the potential advantages and limitations of ChatGPT for writing code, summarization, text generation, etc. However, the analysis of the current state of ChatGPT for log processing has received little attention. Logs generated by large-scale software systems are complex and hard to understand. Despite their complexity, they provide crucial information for subject matter experts to understand the system status and diagnose problems of the systems. In this paper, we investigate the current capabilities of ChatGPT to perform several interesting tasks on log data, while also trying to identify its main shortcomings. Our findings show that the performance of the current version of ChatGPT for log processing is limited, with a lack of consistency in responses and scalability issues. We also outline our views on how we perceive the role of LLMs in the log processing discipline and possible next steps to improve the current capabilities of ChatGPT and the future LLMs in this area. We believe our work can contribute to future academic research to address the identified issues.

相關內容

ChatGPT

關注 257

ChatGPT（全(quan)名：Chat Generative Pre-trained Transformer），美國OpenAI 研發(fa)(fa)的(de)聊(liao)天機器人(ren)程序 [1] ，于2022年(nian)11月30日(ri)發(fa)(fa)布(bu) 。ChatGPT是(shi)人(ren)工(gong)智能(neng)技術驅動的(de)自(zi)然(ran)語言處(chu)理(li)工(gong)具，它能(neng)夠通過學習和(he)理(li)解人(ren)類的(de)語言來進行對(dui)話，還(huan)能(neng)根據聊(liao)天的(de)上下文(wen)進行互動，真正(zheng)像人(ren)類一樣(yang)來聊(liao)天交流，甚至能(neng)完成撰寫郵件、視頻腳本、文(wen)案、翻(fan)譯、代碼，寫論(lun)文(wen)任務。 [1] //openai.com/blog/chatgpt/

語言模型化 · 標準差 · MoDELS · 優化器 · 極大 ·

2023 年 10 月 31 日

Vanishing Gradients in Reinforcement Finetuning of Language Models

Noam Razin,Hattie Zhou,Omid Saremi,Vimal Thilak,Arwen Bradley,Preetum Nakkiran,Joshua Susskind,Etai Littwin

Pretrained language models are commonly aligned with human preferences and downstream tasks via reinforcement finetuning (RFT), which entails maximizing a (possibly learned) reward function using policy gradient algorithms. This work highlights a fundamental optimization obstacle in RFT: we prove that the expected gradient for an input vanishes when its reward standard deviation under the model is small, even if the expected reward is far from optimal. Through experiments on an RFT benchmark and controlled environments, as well as a theoretical analysis, we then demonstrate that vanishing gradients due to small reward standard deviation are prevalent and detrimental, leading to extremely slow reward maximization. Lastly, we explore ways to overcome vanishing gradients in RFT. We find the common practice of an initial supervised finetuning (SFT) phase to be the most promising candidate, which sheds light on its importance in an RFT pipeline. Moreover, we show that a relatively small number of SFT optimization steps on as few as 1% of the input samples can suffice, indicating that the initial SFT phase need not be expensive in terms of compute and data labeling efforts. Overall, our results emphasize that being mindful for inputs whose expected gradient vanishes, as measured by the reward standard deviation, is crucial for successful execution of RFT.

變換 · Performer · 組合性 · MoDELS · dynamic programming ·

2023 年 10 月 31 日

Faith and Fate: Limits of Transformers on Compositionality

Nouha Dziri,Ximing Lu,Melanie Sclar,Xiang Lorraine Li,Liwei Jiang,Bill Yuchen Lin,Peter West,Chandra Bhagavatula,Ronan Le Bras,Jena D. Hwang,Soumya Sanyal,Sean Welleck,Xiang Ren,Allyson Ettinger,Zaid Harchaoui,Yejin Choi

from arxiv, 10 pages + appendix (40 pages)

Transformer large language models (LLMs) have sparked admiration for their exceptional performance on tasks that demand intricate multi-step reasoning. Yet, these models simultaneously show failures on surprisingly trivial problems. This begs the question: Are these errors incidental, or do they signal more substantial limitations? In an attempt to demystify transformer LLMs, we investigate the limits of these models across three representative compositional tasks -- multi-digit multiplication, logic grid puzzles, and a classic dynamic programming problem. These tasks require breaking problems down into sub-steps and synthesizing these steps into a precise answer. We formulate compositional tasks as computation graphs to systematically quantify the level of complexity, and break down reasoning steps into intermediate sub-procedures. Our empirical findings suggest that transformer LLMs solve compositional tasks by reducing multi-step compositional reasoning into linearized subgraph matching, without necessarily developing systematic problem-solving skills. To round off our empirical study, we provide theoretical arguments on abstract multi-step reasoning problems that highlight how autoregressive generations' performance can rapidly decay with\,increased\,task\,complexity.

潛變量/隱變量 · 可辨認的 · 潛在 · MoDELS · 觀測變量 ·

2023 年 10 月 31 日

Identification of Nonlinear Latent Hierarchical Models

Lingjing Kong,Biwei Huang,Feng Xie,Eric Xing,Yuejie Chi,Kun Zhang

from arxiv, NeurIPS 2023

Identifying latent variables and causal structures from observational data is essential to many real-world applications involving biological data, medical data, and unstructured data such as images and languages. However, this task can be highly challenging, especially when observed variables are generated by causally related latent variables and the relationships are nonlinear. In this work, we investigate the identification problem for nonlinear latent hierarchical causal models in which observed variables are generated by a set of causally related latent variables, and some latent variables may not have observed children. We show that the identifiability of causal structures and latent variables (up to invertible transformations) can be achieved under mild assumptions: on causal structures, we allow for multiple paths between any pair of variables in the graph, which relaxes latent tree assumptions in prior work; on structural functions, we permit general nonlinearity and multi-dimensional continuous variables, alleviating existing work's parametric assumptions. Specifically, we first develop an identification criterion in the form of novel identifiability guarantees for an elementary latent variable model. Leveraging this criterion, we show that both causal structures and latent variables of the hierarchical model can be identified asymptotically by explicitly constructing an estimation procedure. To the best of our knowledge, our work is the first to establish identifiability guarantees for both causal structures and latent variables in nonlinear latent hierarchical models.

穩健性 · MoDELS · Lipschitz · Analysis · Machine Learning ·

2023 年 10 月 30 日

On the Robustness of Removal-Based Feature Attributions

Chris Lin,Ian Covert,Su-In Lee

from arxiv, NeurIPS camera-ready version

To explain predictions made by complex machine learning models, many feature attribution methods have been developed that assign importance scores to input features. Some recent work challenges the robustness of these methods by showing that they are sensitive to input and model perturbations, while other work addresses this issue by proposing robust attribution methods. However, previous work on attribution robustness has focused primarily on gradient-based feature attributions, whereas the robustness of removal-based attribution methods is not currently well understood. To bridge this gap, we theoretically characterize the robustness properties of removal-based feature attributions. Specifically, we provide a unified analysis of such methods and derive upper bounds for the difference between intact and perturbed attributions, under settings of both input and model perturbations. Our empirical results on synthetic and real-world data validate our theoretical results and demonstrate their practical implications, including the ability to increase attribution robustness by improving the model's Lipschitz regularity.

Analysis · SPL · 可約的 · Cognition · 情景 ·

2023 年 10 月 30 日

Comprehending Variability in Analysis Results of Software Product Lines

Rafael F. Toledo,Joanne M. Atlee,Rui Ming Xiong

Analyses of a software product line (SPL) typically report variable results that are annotated with logical expressions indicating the set of product variants for which the results hold. These expressions can get complicated and difficult to reason about when the SPL has lots of features and product variants. Previous work introduced a visualizer that supports filters for highlighting the analysis results that apply to product variants of interest, but this work was weakly evaluated. In this paper, we report on a controlled user study that evaluates the effectiveness of this new visualizer in helping the user search variable results and compare the results of multiple variants. Our findings indicate that the use of the new visualizer significantly improves the correctness and efficiency of the user's work and reduces the user's cognitive load in working with variable results.

Learning · INFORMS · 無監督 · 獎勵函數 · 泛函 ·

2023 年 10 月 30 日

Variational Curriculum Reinforcement Learning for Unsupervised Discovery of Skills

Seongun Kim,Kyowoon Lee,Jaesik Choi

from arxiv, ICML 2023. First two authors contributed equally. Code at //github.com/seongun-kim/vcrl

Mutual information-based reinforcement learning (RL) has been proposed as a promising framework for retrieving complex skills autonomously without a task-oriented reward function through mutual information (MI) maximization or variational empowerment. However, learning complex skills is still challenging, due to the fact that the order of training skills can largely affect sample efficiency. Inspired by this, we recast variational empowerment as curriculum learning in goal-conditioned RL with an intrinsic reward function, which we name Variational Curriculum RL (VCRL). From this perspective, we propose a novel approach to unsupervised skill discovery based on information theory, called Value Uncertainty Variational Curriculum (VUVC). We prove that, under regularity conditions, VUVC accelerates the increase of entropy in the visited states compared to the uniform curriculum. We validate the effectiveness of our approach on complex navigation and robotic manipulation tasks in terms of sample efficiency and state coverage speed. We also demonstrate that the skills discovered by our method successfully complete a real-world robot navigation task in a zero-shot setup and that incorporating these skills with a global planner further increases the performance.

約束 · 最優化 · 值域 · MoDELS · Extensibility ·

2023 年 10 月 29 日

Neural Fields with Hard Constraints of Arbitrary Differential Order

Fangcheng Zhong,Kyle Fogarty,Param Hanji,Tianhao Wu,Alejandro Sztrajman,Andrew Spielberg,Andrea Tagliasacchi,Petra Bosilj,Cengiz Oztireli

from arxiv, 37th Conference on Neural Information Processing Systems (NeurIPS 2023)

While deep learning techniques have become extremely popular for solving a broad range of optimization problems, methods to enforce hard constraints during optimization, particularly on deep neural networks, remain underdeveloped. Inspired by the rich literature on meshless interpolation and its extension to spectral collocation methods in scientific computing, we develop a series of approaches for enforcing hard constraints on neural fields, which we refer to as Constrained Neural Fields (CNF). The constraints can be specified as a linear operator applied to the neural field and its derivatives. We also design specific model representations and training strategies for problems where standard models may encounter difficulties, such as conditioning of the system, memory consumption, and capacity of the network when being constrained. Our approaches are demonstrated in a wide range of real-world applications. Additionally, we develop a framework that enables highly efficient model and constraint specification, which can be readily applied to any downstream task where hard constraints need to be explicitly satisfied during optimization.

標注 · 相似度 · 列 · Engineering · 相似度度量 ·

2023 年 10 月 27 日

Matching of Descriptive Labels to Glossary Descriptions

Toshihiro Takahashi,Takaaki Tateishi,Michiaki Tatsubori

Semantic text similarity plays an important role in software engineering tasks in which engineers are requested to clarify the semantics of descriptive labels (e.g., business terms, table column names) that are often consists of too short or too generic words and appears in their IT systems. We formulate this type of problem as a task of matching descriptive labels to glossary descriptions. We then propose a framework to leverage an existing semantic text similarity measurement (STS) and augment it using semantic label enrichment and set-based collective contextualization where the former is a method to retrieve sentences relevant to a given label and the latter is a method to compute similarity between two contexts each of which is derived from a set of texts (e.g., column names in the same table). We performed an experiment on two datasets derived from publicly available data sources. The result indicated that the proposed methods helped the underlying STS correctly match more descriptive labels with the descriptions.

Automator · Learning · MoDELS · 模型評估 · 數據集 ·

2023 年 10 月 26 日

Systematic Evaluation of Deep Learning Models for Failure Prediction

Fatemeh Hadadi,Joshua H. Dawes,Donghwan Shin,Domenico Bianculli,Lionel Briand

With the increasing complexity and scope of software systems, their dependability is crucial. The analysis of log data recorded during system execution can enable engineers to automatically predict failures at run time. Several Machine Learning (ML) techniques, including traditional ML and Deep Learning (DL), have been proposed to automate such tasks. However, current empirical studies are limited in terms of covering all main DL types -- Recurrent Neural Network (RNN), Convolutional Neural network (CNN), and transformer -- as well as examining them on a wide range of diverse datasets. In this paper, we aim to address these issues by systematically investigating the combination of log data embedding strategies and DL types for failure prediction. To that end, we propose a modular architecture to accommodate various configurations of embedding strategies and DL-based encoders. To further investigate how dataset characteristics such as dataset size and failure percentage affect model accuracy, we synthesised 360 datasets, with varying characteristics, for three distinct system behavioral models, based on a systematic and automated generation approach. Using the F1 score metric, our results show that the best overall performing configuration is a CNN-based encoder with Logkey2vec. Additionally, we provide specific dataset conditions, namely a dataset size >350 or a failure percentage >7.5%, under which this configuration demonstrates high accuracy for failure prediction.

Machine Translation · NMT · Performer · state-of-the-art · 學成 ·

2018 年 6 月 1 日

A Survey of Domain Adaptation for Neural Machine Translation

Chenhui Chu,Rui Wang

from arxiv, COLING 2018, 16 pages, 9 figures

Neural machine translation (NMT) is a deep learning based approach for machine translation, which yields the state-of-the-art translation performance in scenarios where large-scale parallel corpora are available. Although the high-quality and domain-specific translation is crucial in the real world, domain-specific corpora are usually scarce or nonexistent, and thus vanilla NMT performs poorly in such scenarios. Domain adaptation that leverages both out-of-domain parallel corpora as well as monolingual corpora for in-domain translation, is very important for domain-specific translation. In this paper, we give a comprehensive survey of the state-of-the-art domain adaptation techniques for NMT.