亚洲男人的天堂2018av,欧美草比,久久久久久免费视频精选,国色天香在线看免费,久久久久亚洲av成人片仓井空

<dir id='0axwj'><del id='0axwj'><del id='0axwj'></del><pre id='0axwj'><pre id='0axwj'><option id='0axwj'><address id='0axwj'></address><bdo id='0axwj'><tr id='0axwj'><acronym id='0axwj'><pre id='0axwj'></pre></acronym><div id='0axwj'></div></tr></bdo></option></pre><small id='0axwj'><address id='0axwj'><u id='0axwj'><legend id='0axwj'><option id='0axwj'><abbr id='0axwj'></abbr><li id='0axwj'><pre id='0axwj'></pre></li></option></legend><select id='0axwj'></select></u></address></small></pre></del><sup id='0axwj'></sup><blockquote id='0axwj'><dt id='0axwj'></dt></blockquote><blockquote id='0axwj'></blockquote></dir><tt id='0axwj'></tt><u id='0axwj'><tt id='0axwj'><form id='0axwj'></form></tt><td id='0axwj'><dt id='0axwj'></dt></td></u>

<code id='0axwj'><i id='0axwj'><q id='0axwj'><legend id='0axwj'><pre id='0axwj'><style id='0axwj'><acronym id='0axwj'><i id='0axwj'><form id='0axwj'><option id='0axwj'><center id='0axwj'></center></option></form></i></acronym></style><tt id='0axwj'></tt></pre></legend></q></i></code><center id='0axwj'></center>

<dd id='0axwj'></dd>

<style id='0axwj'></style><sub id='0axwj'><dfn id='0axwj'><abbr id='0axwj'><big id='0axwj'><bdo id='0axwj'></bdo></big></abbr></dfn></sub>_{<dir id='0axwj'></dir>}

·

優化器 · MoDELS · 可辨認的 · 語言模型化 · 可理解性 ·

2023 年 8 月 23 日

Diagnosing Infeasible Optimization Problems Using Large Language Models

Hao Chen,Gonzalo E. Constante-Flores,Can Li

Decision-making problems can be represented as mathematical optimization models, finding wide applications in fields such as economics, engineering and manufacturing, transportation, and health care. Optimization models are mathematical abstractions of the problem of making the best decision while satisfying a set of requirements or constraints. One of the primary barriers to deploying these models in practice is the challenge of helping practitioners understand and interpret such models, particularly when they are infeasible, meaning no decision satisfies all the constraints. Existing methods for diagnosing infeasible optimization models often rely on expert systems, necessitating significant background knowledge in optimization. In this paper, we introduce OptiChat, a first-of-its-kind natural language-based system equipped with a chatbot GUI for engaging in interactive conversations about infeasible optimization models. OptiChat can provide natural language descriptions of the optimization model itself, identify potential sources of infeasibility, and offer suggestions to make the model feasible. The implementation of OptiChat is built on GPT-4, which interfaces with an optimization solver to identify the minimal subset of constraints that render the entire optimization problem infeasible, also known as the Irreducible Infeasible Subset (IIS). We utilize few-shot learning, expert chain-of-thought, key-retrieve, and sentiment prompts to enhance OptiChat's reliability. Our experiments demonstrate that OptiChat assists both expert and non-expert users in improving their understanding of the optimization models, enabling them to quickly identify the sources of infeasibility.

相關內容

優化器

控制器 · 優化器 · Performer · 樣本 · 查準率/準確率 ·

2023 年 10 月 11 日

Controllable Data Generation Via Iterative Data-Property Mutual Mappings

Bo Pan,Muran Qin,Shiyu Wang,Yifei Zhang,Liang Zhao

Deep generative models have been widely used for their ability to generate realistic data samples in various areas, such as images, molecules, text, and speech. One major goal of data generation is controllability, namely to generate new data with desired properties. Despite growing interest in the area of controllable generation, significant challenges still remain, including 1) disentangling desired properties with unrelated latent variables, 2) out-of-distribution property control, and 3) objective optimization for out-of-distribution property control. To address these challenges, in this paper, we propose a general framework to enhance VAE-based data generators with property controllability and ensure disentanglement. Our proposed objective can be optimized on both data seen and unseen in the training set. We propose a training procedure to train the objective in a semi-supervised manner by iteratively conducting mutual mappings between the data and properties. The proposed framework is implemented on four VAE-based controllable generators to evaluate its performance on property error, disentanglement, generation quality, and training time. The results indicate that our proposed framework enables more precise control over the properties of generated samples in a short training time, ensuring the disentanglement and keeping the validity of the generated samples.

語言模型化 · MoDELS · Continuity · Performer · Better ·

2023 年 10 月 11 日

Evaluating Large Language Models at Evaluating Instruction Following

Zhiyuan Zeng,Jiatong Yu,Tianyu Gao,Yu Meng,Tanya Goyal,Danqi Chen

from arxiv, Under review

As research in large language models (LLMs) continues to accelerate, LLM-based evaluation has emerged as a scalable and cost-effective alternative to human evaluations for comparing the ever increasing list of models. This paper investigates the efficacy of these "LLM evaluators", particularly in using them to assess instruction following, a metric that gauges how closely generated text adheres to the given instruction. We introduce a challenging meta-evaluation benchmark, LLMBar, designed to test the ability of an LLM evaluator in discerning instruction-following outputs. The authors manually curated 419 pairs of outputs, one adhering to instructions while the other diverging, yet may possess deceptive qualities that mislead an LLM evaluator, e.g., a more engaging tone. Contrary to existing meta-evaluation, we discover that different evaluators (i.e., combinations of LLMs and prompts) exhibit distinct performance on LLMBar and even the highest-scoring ones have substantial room for improvement. We also present a novel suite of prompting strategies that further close the gap between LLM and human evaluators. With LLMBar, we hope to offer more insight into LLM evaluators and foster future research in developing better instruction-following models.

廣義函數 · 泛函 · INFORMS · 累積分布函數 · Extensibility ·

2023 年 10 月 10 日

Cumulative Information Generating Function and Generalized Gini Functions

Marco Capaldo,Antonio Di Crescenzo,Alessandra Meoli

from arxiv, 25 pages, 1 figure, revision submitted on September 19, 2023

We introduce and study the cumulative information generating function, which provides a unifying mathematical tool suitable to deal with classical and fractional entropies based on the cumulative distribution function and on the survival function. Specifically, after establishing its main properties and some bounds, we show that it is a variability measure itself that extends the Gini mean semi-difference. We also provide (i) an extension of such a measure, based on distortion functions, and (ii) a weighted version based on a mixture distribution. Furthermore, we explore some connections with the reliability of $k$-out-of-$n$ systems and with stress-strength models for multi-component systems. Also, we address the problem of extending the cumulative information generating function to higher dimensions.

核化 · 圖 · 估計/估計量 · 泛函 · Learning ·

2023 年 10 月 10 日

Universal Graph Random Features

Isaac Reid,Krzysztof Choromanski,Eli Berger,Adrian Weller

We propose a novel random walk-based algorithm for unbiased estimation of arbitrary functions of a weighted adjacency matrix, coined universal graph random features (u-GRFs). This includes many of the most popular examples of kernels defined on the nodes of a graph. Our algorithm enjoys subquadratic time complexity with respect to the number of nodes, overcoming the notoriously prohibitive cubic scaling of exact graph kernel evaluation. It can also be trivially distributed across machines, permitting learning on much larger networks. At the heart of the algorithm is a modulation function which upweights or downweights the contribution from different random walks depending on their lengths. We show that by parameterising it with a neural network we can obtain u-GRFs that give higher-quality kernel estimates or perform efficient, scalable kernel learning. We provide robust theoretical analysis and support our findings with experiments including pointwise estimation of fixed graph kernels, solving non-homogeneous graph ordinary differential equations, node clustering and kernel regression on triangular meshes.

Learning · binary · 二分類 · 在線 · 批量學習 ·

2023 年 10 月 10 日

Quantum Learning Theory Beyond Batch Binary Classification

Preetham Mohan,Ambuj Tewari

from arxiv, 26 pages, 2 figures, 2 tables; v3: tightens expected regret bounds in various settings under quantum online learning (Section 5); adds a figure to illustrate the quantum circuit used for reduction in Appendix A; incorporates more recent work (as compared to v2); provides explicit open problems in Sections 5.4 and 6;

Arunachalam and de Wolf (2018) showed that the sample complexity of quantum batch learning of boolean functions, in the realizable and agnostic settings, has the same form and order as the corresponding classical sample complexities. In this paper, we extend this, ostensibly surprising, message to batch multiclass learning, online boolean learning, and online multiclass learning. For our online learning results, we first consider an adaptive adversary variant of the classical model of Dawid and Tewari (2022). Then, we introduce the first (to the best of our knowledge) model of online learning with quantum examples.

Learning · SimPLe · 貢獻度分配問題 · Markov · INFORMS ·

2023 年 10 月 9 日

Swarm Reinforcement Learning For Adaptive Mesh Refinement

Niklas Freymuth,Philipp Dahlinger,Tobias Würth,Simon Reisch,Luise K?rger,Gerhard Neumann

from arxiv, Accepted at Neural Information Processing Systems (NeurIPS) 2023. Version 1 of this paper is a preliminary version that was accepted as a workshop paper in the International Conference on Learning Representations (ICLR) 2023 Workshop on Physics for Machine Learning

Adaptive Mesh Refinement (AMR) enhances the Finite Element Method, an important technique for simulating complex problems in engineering, by dynamically refining mesh regions, enabling a favorable trade-off between computational speed and simulation accuracy. Classical methods for AMR depend on heuristics or expensive error estimators, hindering their use for complex simulations. Recent learning-based AMR methods tackle these issues, but so far scale only to simple toy examples. We formulate AMR as a novel Adaptive Swarm Markov Decision Process in which a mesh is modeled as a system of simple collaborating agents that may split into multiple new agents. This framework allows for a spatial reward formulation that simplifies the credit assignment problem, which we combine with Message Passing Networks to propagate information between neighboring mesh elements. We experimentally validate our approach, Adaptive Swarm Mesh Refinement (ASMR), on challenging refinement tasks. Our approach learns reliable and efficient refinement strategies that can robustly generalize to different domains during inference. Additionally, it achieves a speedup of up to $2$ orders of magnitude compared to uniform refinements in more demanding simulations. We outperform learned baselines and heuristics, achieving a refinement quality that is on par with costly error-based oracle AMR strategies.

Metal · 前向 · 優化器 · 變換 · 可約的 ·

2023 年 10 月 6 日

Gradient Descent Provably Solves Nonlinear Tomographic Reconstruction

Sara Fridovich-Keil,Fabrizio Valdivia,Gordon Wetzstein,Benjamin Recht,Mahdi Soltanolkotabi

In computed tomography (CT), the forward model consists of a linear Radon transform followed by an exponential nonlinearity based on the attenuation of light according to the Beer-Lambert Law. Conventional reconstruction often involves inverting this nonlinearity as a preprocessing step and then solving a convex inverse problem. However, this nonlinear measurement preprocessing required to use the Radon transform is poorly conditioned in the vicinity of high-density materials, such as metal. This preprocessing makes CT reconstruction methods numerically sensitive and susceptible to artifacts near high-density regions. In this paper, we study a technique where the signal is directly reconstructed from raw measurements through the nonlinear forward model. Though this optimization is nonconvex, we show that gradient descent provably converges to the global optimum at a geometric rate, perfectly reconstructing the underlying signal with a near minimal number of random measurements. We also prove similar results in the under-determined setting where the number of measurements is significantly smaller than the dimension of the signal. This is achieved by enforcing prior structural information about the signal through constraints on the optimization variables. We illustrate the benefits of direct nonlinear CT reconstruction with cone-beam CT experiments on synthetic and real 3D volumes. We show that this approach reduces metal artifacts compared to a commercial reconstruction of a human skull with metal dental crowns.

Extensibility · INFORMS · INTERACT · Principle · prototype ·

2023 年 10 月 5 日

Trustworthy Formal Natural Language Specifications

Colin S. Gordon,Sergey Matskevich

from arxiv, arXiv admin note: substantial text overlap with arXiv:2205.07811

Interactive proof assistants are computer programs carefully constructed to check a human-designed proof of a mathematical claim with high confidence in the implementation. However, this only validates truth of a formal claim, which may have been mistranslated from a claim made in natural language. This is especially problematic when using proof assistants to formally verify the correctness of software with respect to a natural language specification. The translation from informal to formal remains a challenging, time-consuming process that is difficult to audit for correctness. This paper shows that it is possible to build support for specifications written in expressive subsets of natural language, within existing proof assistants, consistent with the principles used to establish trust and auditability in proof assistants themselves. We implement a means to provide specifications in a modularly extensible formal subset of English, and have them automatically translated into formal claims, entirely within the Lean proof assistant. Our approach is extensible (placing no permanent restrictions on grammatical structure), modular (allowing information about new words to be distributed alongside libraries), and produces proof certificates explaining how each word was interpreted and how the sentence's structure was used to compute the meaning. We apply our prototype to the translation of various English descriptions of formal specifications from a popular textbook into Lean formalizations; all can be translated correctly with a modest lexicon with only minor modifications related to lexicon size.

圖卷積神經網絡/圖卷積網絡 · 圖 · entity · 圖卷積 · 卷積 ·

2021 年 4 月 23 日

Knowledge Embedding Based Graph Convolutional Network

Donghan Yu,Yiming Yang,Ruohong Zhang,Yuexin Wu

from arxiv, WWW 2021

Recently, a considerable literature has grown up around the theme of Graph Convolutional Network (GCN). How to effectively leverage the rich structural information in complex graphs, such as knowledge graphs with heterogeneous types of entities and relations, is a primary open challenge in the field. Most GCN methods are either restricted to graphs with a homogeneous type of edges (e.g., citation links only), or focusing on representation learning for nodes only instead of jointly propagating and updating the embeddings of both nodes and edges for target-driven objectives. This paper addresses these limitations by proposing a novel framework, namely the Knowledge Embedding based Graph Convolutional Network (KE-GCN), which combines the power of GCNs in graph-based belief propagation and the strengths of advanced knowledge embedding (a.k.a. knowledge graph embedding) methods, and goes beyond. Our theoretical analysis shows that KE-GCN offers an elegant unification of several well-known GCN methods as specific cases, with a new perspective of graph convolution. Experimental results on benchmark datasets show the advantageous performance of KE-GCN over strong baseline methods in the tasks of knowledge graph alignment and entity classification.

長短期記憶網絡 · 命名實體識別 · MoDELS · Better · 門控 ·

2018 年 5 月 15 日

Chinese NER Using Lattice LSTM

Yue Zhang,Jie Yang

from arxiv, Accepted at ACL 2018 as Long paper

We investigate a lattice-structured LSTM model for Chinese NER, which encodes a sequence of input characters as well as all potential words that match a lexicon. Compared with character-based methods, our model explicitly leverages word and word sequence information. Compared with word-based methods, lattice LSTM does not suffer from segmentation errors. Gated recurrent cells allow our model to choose the most relevant characters and words from a sentence for better NER results. Experiments on various datasets show that lattice LSTM outperforms both word-based and character-based LSTM baselines, achieving the best results.

閱讀: 0 點贊: 0

小貼士

登錄享

相關主題

語言模型化

北京阿比特科技有限公司

注冊地址：北京市海淀區羊坊店路18號2幢3層301-191

<tfoot id='MPemE'></tfoot>

<legend id='vuVbu'><style id='FPo0c'><dir id='k6hAD'><q id='wOI3X'></q></dir></style></legend>

<i id='K5kG1'><tr id='Ih29n'><dt id='Fn6vi'><q id='xI9yi'><span id='oUGNV'><b id='q2o3g'><form id='p9ao6'><ins id='Vd6F5'></ins><ul id='vZla6'></ul><sub id='DayBa'></sub></form><legend id='WbEsk'></legend><bdo id='1xGUK'><pre id='R4939'><center id='GnQAx'></center></pre></bdo></b><th id='tNDEX'></th></span></q></dt></tr></i><div id='iIaEU'><tfoot id='r2XBu'></tfoot><dl id='oggIU'><fieldset id='cFMu0'></fieldset></dl></div>

<li id='1wFR7'><abbr id='lVEtW'></abbr></li>