亚洲男人的天堂2018av,欧美草比,久久久久久免费视频精选,国色天香在线看免费,久久久久亚洲av成人片仓井空

<dir id='eq2HA'><del id='rdGIy'><del id='KFAho'></del><pre id='hUwHg'><pre id='sRdDT'><option id='ZChUA'><address id='1fq75'></address><bdo id='EU9hp'><tr id='CLYkR'><acronym id='qNoFn'><pre id='zlJIU'></pre></acronym><div id='wnxXL'></div></tr></bdo></option></pre><small id='Z6VhH'><address id='EpEnU'><u id='xVosO'><legend id='R34WC'><option id='loMGa'><abbr id='OFRlH'></abbr><li id='rwkGz'><pre id='G1RMm'></pre></li></option></legend><select id='bjaU1'></select></u></address></small></pre></del><sup id='O7r5d'></sup><blockquote id='DuLvz'><dt id='Ksog3'></dt></blockquote><blockquote id='9HsQD'></blockquote></dir><tt id='aLS3h'></tt><u id='1lqJA'><tt id='co2Qw'><form id='SoXhd'></form></tt><td id='7UZgZ'><dt id='oMKUn'></dt></td></u>

<code id='qLrZN'><i id='LCEM0'><q id='vMjbz'><legend id='BhrYQ'><pre id='c0K1O'><style id='gYMws'><acronym id='X3r1t'><i id='HFVJt'><form id='grPxa'><option id='ESBZ3'><center id='RNSRk'></center></option></form></i></acronym></style><tt id='qbAlH'></tt></pre></legend></q></i></code><center id='X1ium'></center>

<dd id='ADf6e'></dd>

<style id='iH2rN'></style><sub id='bXiNh'><dfn id='TRDpS'><abbr id='92zni'><big id='pH49o'><bdo id='RdI64'></bdo></big></abbr></dfn></sub>_{<dir id='8pmTI'></dir>}

·

語言模型化 · 數據預處理 · MoDELS · 大語言模型 · GPT-4 ·

2024 年 10 月 27 日

Large Language Models as Data Preprocessors

Haochen Zhang,Yuyang Dong,Chuan Xiao,Masafumi Oyamada

from arxiv, TaDA 2024 (workshop in conjunction with VLDB 2024)

Large Language Models (LLMs), typified by OpenAI's GPT, have marked a significant advancement in artificial intelligence. Trained on vast amounts of text data, LLMs are capable of understanding and generating human-like text across a diverse range of topics. This study expands on the applications of LLMs, exploring their potential in data preprocessing, a critical stage in data mining and analytics applications. Aiming at tabular data, we delve into the applicability of state-of-the-art LLMs such as GPT-4 and GPT-4o for a series of preprocessing tasks, including error detection, data imputation, schema matching, and entity matching. Alongside showcasing the inherent capabilities of LLMs, we highlight their limitations, particularly in terms of computational expense and inefficiency. We propose an LLM-based framework for data preprocessing, which integrates cutting-edge prompt engineering techniques, coupled with traditional methods like contextualization and feature selection, to improve the performance and efficiency of these models. The effectiveness of LLMs in data preprocessing is evaluated through an experimental study spanning a variety of public datasets. GPT-4 emerged as a standout, achieving 100\% accuracy or F1 score on 4 of these datasets, suggesting LLMs' immense potential in these tasks. Despite certain limitations, our study underscores the promise of LLMs in this domain and anticipates future developments to overcome current hurdles.

相關內容

語言模型化

語言模型化

優化器 · AIM · REST · HER · INFORMS ·

2024 年 12 月 2 日

Optimal Strategies for Winning Certain Coset-Guessing Quantum Games

Michael Schleppy,Emina Soljanin,Nicolas Swanson

from arxiv, Submitted to JSAIT Special Issue on Quantum Error Correction and Fault Tolerance

In a recently introduced coset guessing game, Alice plays against Bob and Charlie, aiming to meet a joint winning condition. Bob and Charlie can only communicate before the game starts to devise a joint strategy. The game we consider begins with Alice preparing a 2m-qubit quantum state based on a random selection of three parameters. She sends the first m qubits to Bob and the rest to Charlie and then reveals to them her choice for one of the parameters. Bob is supposed to guess one of the hidden parameters, Charlie the other, and they win if both guesses are correct. From previous work, we know that the probability of Bob's and Charlie's guesses being simultaneously correct goes to zero exponentially as m increases. We derive a tight upper bound on this probability and show how Bob and Charlie can achieve it. While developing the optimal strategy, we devised an encoding circuit using only CNOT and Hadamard gates, which could be relevant for building efficient CSS-coded systems. We found that the role of quantum information that Alice communicates to Bob and Charlie is to make their responses correlated rather than improve their individual (marginal) correct guessing rates.

設計 · 泛函 · 代碼 · 語言模型化 · Processing（編程語言） ·

2024 年 12 月 2 日

Evaluating LLMs for Hardware Design and Test

Jason Blocklove,Siddharth Garg,Ramesh Karri,Hammond Pearce

Large Language Models (LLMs) have demonstrated capabilities for producing code in Hardware Description Languages (HDLs). However, most of the focus remains on their abilities to write functional code, not test code. The hardware design process consists of both design and test, and so eschewing validation and verification leaves considerable potential benefit unexplored, given that a design and test framework may allow for progress towards full automation of the digital design pipeline. In this work, we perform one of the first studies exploring how a LLM can both design and test hardware modules from provided specifications. Using a suite of 8 representative benchmarks, we examined the capabilities and limitations of the state-of-the-art conversational LLMs when producing Verilog for functional and verification purposes. We taped out the benchmarks on a Skywater 130nm shuttle and received the functional chip.

3D · 點云 · INFORMS · 表示 · 語言模型化 ·

2024 年 11 月 29 日

PerLA: Perceptive 3D Language Assistant

Guofeng Mei,Wei Lin,Luigi Riz,Yujiao Wu,Fabio Poiesi,Yiming Wang

Enabling Large Language Models (LLMs) to understand the 3D physical world is an emerging yet challenging research direction. Current strategies for processing point clouds typically downsample the scene or divide it into smaller parts for separate analysis. However, both approaches risk losing key local details or global contextual information. In this paper, we introduce PerLA, a 3D language assistant designed to be more perceptive to both details and context, making visual representations more informative for the LLM. PerLA captures high-resolution (local) details in parallel from different point cloud areas and integrates them with (global) context obtained from a lower-resolution whole point cloud. We present a novel algorithm that preserves point cloud locality through the Hilbert curve and effectively aggregates local-to-global information via cross-attention and a graph neural network. Lastly, we introduce a novel loss for local representation consensus to promote training stability. PerLA outperforms state-of-the-art 3D language assistants, with gains of up to +1.34 CiDEr on ScanQA for question answering, and +4.22 on ScanRefer and +3.88 on Nr3D for dense captioning.\url{//gfmei.github.io/PerLA/}

MCMC · 近似 · Performer · Markov · 馬爾可夫鏈 ·

2024 年 11 月 28 日

No Free Lunch for Approximate MCMC

James E. Johndrow,Natesh S. Pillai,Aaron Smith

It is widely known that the performance of Markov chain Monte Carlo (MCMC) can degrade quickly when targeting computationally expensive posterior distributions, such as when the sample size is large. This has motivated the search for MCMC variants that scale well to large datasets. One popular general approach has been to look at only a subsample of the data at every step. In this note, we point out that well-known MCMC convergence results often imply that these ``subsampling'' MCMC algorithms cannot greatly improve performance. We apply these abstract results to realistic statistical problems and proposed algorithms, and also discuss some design principles suggested by the results. Finally, we develop estimates for the singular values of random matrices bounds that may be of independent interest.

多峰值 · 語言模型化 · MoDELS · Performer · AIM ·

2023 年 6 月 23 日

A Survey on Multimodal Large Language Models

Shukang Yin,Chaoyou Fu,Sirui Zhao,Ke Li,Xing Sun,Tong Xu,Enhong Chen

from arxiv, Project page://github.com/BradyFU/Awesome-Multimodal-Large-Language-Models

Multimodal Large Language Model (MLLM) recently has been a new rising research hotspot, which uses powerful Large Language Models (LLMs) as a brain to perform multimodal tasks. The surprising emergent capabilities of MLLM, such as writing stories based on images and OCR-free math reasoning, are rare in traditional methods, suggesting a potential path to artificial general intelligence. In this paper, we aim to trace and summarize the recent progress of MLLM. First of all, we present the formulation of MLLM and delineate its related concepts. Then, we discuss the key techniques and applications, including Multimodal Instruction Tuning (M-IT), Multimodal In-Context Learning (M-ICL), Multimodal Chain of Thought (M-CoT), and LLM-Aided Visual Reasoning (LAVR). Finally, we discuss existing challenges and point out promising research directions. In light of the fact that the era of MLLM has only just begun, we will keep updating this survey and hope it can inspire more research. An associated GitHub link collecting the latest papers is available at //github.com/BradyFU/Awesome-Multimodal-Large-Language-Models.

Capsule · 圖 · INFORMS · Networking · entity ·

2020 年 12 月 16 日

Hierarchical Graph Capsule Network

Jinyu Yang,Peilin Zhao,Yu Rong,Chaochao Yan,Chunyuan Li,Hehuan Ma,Junzhou Huang

from arxiv, AAAI 2021

Graph Neural Networks (GNNs) draw their strength from explicitly modeling the topological information of structured data. However, existing GNNs suffer from limited capability in capturing the hierarchical graph representation which plays an important role in graph classification. In this paper, we innovatively propose hierarchical graph capsule network (HGCN) that can jointly learn node embeddings and extract graph hierarchies. Specifically, disentangled graph capsules are established by identifying heterogeneous factors underlying each node, such that their instantiation parameters represent different properties of the same entity. To learn the hierarchical representation, HGCN characterizes the part-whole relationship between lower-level capsules (part) and higher-level capsules (whole) by explicitly considering the structure information among the parts. Experimental studies demonstrate the effectiveness of HGCN and the contribution of each component.

圖 · MoDELS · Continuity · 圖形處理器 · 隱藏層 ·

2020 年 6 月 7 日

Principal Neighbourhood Aggregation for Graph Nets

Gabriele Corso,Luca Cavalleri,Dominique Beaini,Pietro Liò,Petar Veli?kovi?

Graph Neural Networks (GNNs) have been shown to be effective models for different predictive tasks on graph-structured data. Recent work on their expressive power has focused on isomorphism tasks and countable feature spaces. We extend this theoretical framework to include continuous features - which occur regularly in real-world input domains and within the hidden layers of GNNs - and we demonstrate the requirement for multiple aggregation functions in this context. Accordingly, we propose Principal Neighbourhood Aggregation (PNA), a novel architecture combining multiple aggregators with degree-scalers (which generalize the sum aggregator). Finally, we compare the capacity of different models to capture and exploit the graph structure via a novel benchmark containing multiple tasks taken from classical graph theory, alongside existing benchmarks from real-world domains, all of which demonstrate the strength of our model. With this work, we hope to steer some of the GNN research towards new aggregation methods which we believe are essential in the search for powerful and robust models.

有向 · Better · Feel · 情景 · AI ·

2020 年 3 月 17 日

Directions for Explainable Knowledge-Enabled Systems

Shruthi Chari,Daniel M. Gruen,Oshani Seneviratne,Deborah L. McGuinness

from arxiv, S. Chari, D. M. Gruen, O. Seneviratne, D. L. McGuinness, "Directions for Explainable Knowledge-Enabled Systems". In: Ilaria Tiddi, Freddy Lecue, Pascal Hitzler (eds.), Knowledge Graphs for eXplainable AI -- Foundations, Applications and Challenges. Studies on the Semantic Web, IOS Press, Amsterdam, 2020, to appear

Interest in the field of Explainable Artificial Intelligence has been growing for decades and has accelerated recently. As Artificial Intelligence models have become more complex, and often more opaque, with the incorporation of complex machine learning techniques, explainability has become more critical. Recently, researchers have been investigating and tackling explainability with a user-centric focus, looking for explanations to consider trustworthiness, comprehensibility, explicit provenance, and context-awareness. In this chapter, we leverage our survey of explanation literature in Artificial Intelligence and closely related fields and use these past efforts to generate a set of explanation types that we feel reflect the expanded needs of explanation for today's artificial intelligence applications. We define each type and provide an example question that would motivate the need for this style of explanation. We believe this set of explanation types will help future system designers in their generation and prioritization of requirements and further help generate explanations that are better aligned to users' and situational needs.

標記傳播 · 圖卷積網絡 · 節點分類 · Weight · 結點 ·

2020 年 2 月 17 日

Unifying Graph Convolutional Neural Networks and Label Propagation

Hongwei Wang,Jure Leskovec

Label Propagation (LPA) and Graph Convolutional Neural Networks (GCN) are both message passing algorithms on graphs. Both solve the task of node classification but LPA propagates node label information across the edges of the graph, while GCN propagates and transforms node feature information. However, while conceptually similar, theoretical relation between LPA and GCN has not yet been investigated. Here we study the relationship between LPA and GCN in terms of two aspects: (1) feature/label smoothing where we analyze how the feature/label of one node is spread over its neighbors; And, (2) feature/label influence of how much the initial feature/label of one node influences the final feature/label of another node. Based on our theoretical analysis, we propose an end-to-end model that unifies GCN and LPA for node classification. In our unified model, edge weights are learnable, and the LPA serves as regularization to assist the GCN in learning proper edge weights that lead to improved classification performance. Our model can also be seen as learning attention weights based on node labels, which is more task-oriented than existing feature-based attention models. In a number of experiments on real-world graphs, our model shows superiority over state-of-the-art GCN-based methods in terms of node classification accuracy.

模式崩潰 · 對抗自編碼 · 自編碼器 · 峰值 · Better ·

2018 年 3 月 23 日

Generative Adversarial Autoencoder Networks

Ngoc-Trung Tran,Tuan-Anh Bui,Ngai-Man Cheung

We introduce an effective model to overcome the problem of mode collapse when training Generative Adversarial Networks (GAN). Firstly, we propose a new generator objective that finds it better to tackle mode collapse. And, we apply an independent Autoencoders (AE) to constrain the generator and consider its reconstructed samples as "real" samples to slow down the convergence of discriminator that enables to reduce the gradient vanishing problem and stabilize the model. Secondly, from mappings between latent and data spaces provided by AE, we further regularize AE by the relative distance between the latent and data samples to explicitly prevent the generator falling into mode collapse setting. This idea comes when we find a new way to visualize the mode collapse on MNIST dataset. To the best of our knowledge, our method is the first to propose and apply successfully the relative distance of latent and data samples for stabilizing GAN. Thirdly, our proposed model, namely Generative Adversarial Autoencoder Networks (GAAN), is stable and has suffered from neither gradient vanishing nor mode collapse issues, as empirically demonstrated on synthetic, MNIST, MNIST-1K, CelebA and CIFAR-10 datasets. Experimental results show that our method can approximate well multi-modal distribution and achieve better results than state-of-the-art methods on these benchmark datasets. Our model implementation is published here: //github.com/tntrung/gaan

閱讀: 0 點贊: 0

小貼士

登錄享

相關主題

語言模型化

數據預處理

大語言模型

北京阿比特科技有限公司

注冊地址：北京市海淀區羊坊店路18號2幢3層301-191

<tfoot id='aMFXQ'></tfoot>

<legend id='dAESq'><style id='Th9T6'><dir id='EjQLv'><q id='J5vIZ'></q></dir></style></legend>

<i id='353Xp'><tr id='Tg1Ll'><dt id='j8zRV'><q id='Cc0d7'><span id='vaQk9'><b id='oVyKW'><form id='8qTu9'><ins id='j3a0I'></ins><ul id='BDotW'></ul><sub id='abs4W'></sub></form><legend id='kmfsq'></legend><bdo id='3oSYa'><pre id='8SpZn'><center id='4nfSL'></center></pre></bdo></b><th id='KcaFE'></th></span></q></dt></tr></i><div id='hmJXy'><tfoot id='g1u2X'></tfoot><dl id='FoBmC'><fieldset id='TVaMQ'></fieldset></dl></div>