东京热加勒比中文无码_久久婷婷人人喊人人泡人人爽_久久国产精品视频免费在看_人人插人人摸精品在线视频_亚洲鲁在视频在线观看_图片区视频区另类_一亚洲一区二区中文字幕

The extraction of a small number of relevant insights from vast amounts of data is a crucial component of data-driven decision-making. However, accomplishing this task requires considerable technical skills, domain expertise, and human labor. This study explores the potential of using Large Language Models (LLMs) to automate the discovery of insights in data, leveraging recent advances in reasoning and code generation techniques. We propose a new evaluation methodology based on a "capture the flag" principle, measuring the ability of such models to recognize meaningful and pertinent information (flags) in a dataset. We further propose two proof-of-concept agents, with different inner workings, and compare their ability to capture such flags in a real-world sales dataset. While the work reported here is preliminary, our results are sufficiently interesting to mandate future exploration by the community.

相關內容

大語言模型

關注 56

大語(yu)(yu)言(yan)(yan)(yan)(yan)模(mo)型(xing)(xing)是基于海量文(wen)本(ben)數(shu)據(ju)訓練的(de)(de)(de)深(shen)度學習模(mo)型(xing)(xing)。它不僅能(neng)(neng)夠生成自然語(yu)(yu)言(yan)(yan)(yan)(yan)文(wen)本(ben)，還能(neng)(neng)夠深(shen)入(ru)(ru)理(li)解(jie)文(wen)本(ben)含義，處(chu)(chu)理(li)各種自然語(yu)(yu)言(yan)(yan)(yan)(yan)任務，如文(wen)本(ben)摘要、問(wen)答、翻譯等(deng)。2023年，大語(yu)(yu)言(yan)(yan)(yan)(yan)模(mo)型(xing)(xing)及其在人(ren)工智(zhi)能(neng)(neng)領(ling)域的(de)(de)(de)應用(yong)(yong)已成為全球科技(ji)研究(jiu)的(de)(de)(de)熱點，其在規模(mo)上的(de)(de)(de)增長尤(you)為引(yin)人(ren)注(zhu)目，參(can)數(shu)量已從最初的(de)(de)(de)十幾(ji)億躍(yue)升(sheng)到如今的(de)(de)(de)一(yi)萬億。參(can)數(shu)量的(de)(de)(de)提升(sheng)使得模(mo)型(xing)(xing)能(neng)(neng)夠更(geng)(geng)加精(jing)細地(di)(di)捕捉(zhuo)人(ren)類語(yu)(yu)言(yan)(yan)(yan)(yan)微妙之處(chu)(chu)，更(geng)(geng)加深(shen)入(ru)(ru)地(di)(di)理(li)解(jie)人(ren)類語(yu)(yu)言(yan)(yan)(yan)(yan)的(de)(de)(de)復雜性(xing)。在過(guo)去的(de)(de)(de)一(yi)年里，大語(yu)(yu)言(yan)(yan)(yan)(yan)模(mo)型(xing)(xing)在吸納(na)新(xin)知識、分(fen)解(jie)復雜任務以(yi)及圖文(wen)對齊等(deng)多方面都(dou)有顯(xian)著提升(sheng)。隨(sui)著技(ji)術的(de)(de)(de)不斷(duan)成熟，它將不斷(duan)拓展其應用(yong)(yong)范圍，為人(ren)類提供更(geng)(geng)加智(zhi)能(neng)(neng)化和個性(xing)化的(de)(de)(de)服(fu)務，進一(yi)步(bu)改善人(ren)們(men)的(de)(de)(de)生活(huo)和生產方式。

全 · 可辨認的 · MoDELS · 統計量 · 聯合分布 ·

2024 年 2 月 8 日

Full Law Identification under Missing Data in the Categorical Colluder Model

Santtu Tikka,Juha Karvanen

Missing data may be disastrous for the identifiability of causal and statistical estimands. In graphical missing data models, colluders are dependence structures that have a special importance for identification considerations. It has been shown that the presence of a colluder makes the full law, i.e., the joint distribution of variables and response indicators, non-parametrically non-identifiable. However, with additional mild assumptions regarding the variables involved with the colluder structure, identifiability is regained. We present a necessary and sufficient condition for the identification of the full law in the presence of a colluder structure with arbitrary categorical variables.

控制器 · Extensibility · 模態 · INFORMS · Guidance ·

2024 年 2 月 8 日

TextFusion: Unveiling the Power of Textual Semantics for Controllable Image Fusion

Chunyang Cheng,Tianyang Xu,Xiao-Jun Wu,Hui Li,Xi Li,Zhangyong Tang,Josef Kittler

from arxiv, v2 version, 13 pages, 16 figures, with the code repository link

Advanced image fusion methods are devoted to generating the fusion results by aggregating the complementary information conveyed by the source images. However, the difference in the source-specific manifestation of the imaged scene content makes it difficult to design a robust and controllable fusion process. We argue that this issue can be alleviated with the help of higher-level semantics, conveyed by the text modality, which should enable us to generate fused images for different purposes, such as visualisation and downstream tasks, in a controllable way. This is achieved by exploiting a vision-and-language model to build a coarse-to-fine association mechanism between the text and image signals. With the guidance of the association maps, an affine fusion unit is embedded in the transformer network to fuse the text and vision modalities at the feature level. As another ingredient of this work, we propose the use of textual attention to adapt image quality assessment to the fusion task. To facilitate the implementation of the proposed text-guided fusion paradigm, and its adoption by the wider research community, we release a text-annotated image fusion dataset IVT. Extensive experiments demonstrate that our approach (TextFusion) consistently outperforms traditional appearance-based fusion methods. Our code and dataset will be publicly available at //github.com/AWCXV/TextFusion.

Networking · Neural Networks · DNN · 線性的 · MoDELS ·

2024 年 2 月 8 日

NeuralMatrix: Compute the Entire Neural Networks with Linear Matrix Operations for Efficient Inference

Ruiqi Sun,Siwei Ye,Jie Zhao,Xin He,Yiran Li,An Zou

from arxiv, 11 pages, 6figures, Submitted to 41st International Conference on Machine Learning

The inherent diversity of computation types within individual Deep Neural Network (DNN) models imposes a corresponding need for a varied set of computation units within hardware processors. This diversity poses a significant constraint on computation efficiency during the execution of different neural networks. In this study, we present NeuralMatrix, a framework that transforms the computation of entire DNNs into linear matrix operations. This transformation seamlessly enables the execution of various DNN models using a single General-Purpose Matrix Multiplication (GEMM) accelerator. Extensive experimental results spanning different DNN models demonstrate that our approach preserves network accuracy while providing both generality and application-specific levels of computation efficiency. This allows a broad spectrum of DNN models to be executed using a single GEMM accelerator, eliminating the need for additional special function units.

高斯分布 · MoDELS · 線性的 · CASE · 變換 ·

2024 年 2 月 8 日

Distribution-on-Distribution Regression with Wasserstein Metric: Multivariate Gaussian Case

Ryo Okano,Masaaki Imaizumi

from arxiv, 34 pages

Distribution data refers to a data set where each sample is represented as a probability distribution, a subject area receiving burgeoning interest in the field of statistics. Although several studies have developed distribution-to-distribution regression models for univariate variables, the multivariate scenario remains under-explored due to technical complexities. In this study, we introduce models for regression from one Gaussian distribution to another, utilizing the Wasserstein metric. These models are constructed using the geometry of the Wasserstein space, which enables the transformation of Gaussian distributions into components of a linear matrix space. Owing to their linear regression frameworks, our models are intuitively understandable, and their implementation is simplified because of the optimal transport problem's analytical solution between Gaussian distributions. We also explore a generalization of our models to encompass non-Gaussian scenarios. We establish the convergence rates of in-sample prediction errors for the empirical risk minimizations in our models. In comparative simulation experiments, our models demonstrate superior performance over a simpler alternative method that transforms Gaussian distributions into matrices. We present an application of our methodology using weather data for illustration purposes.

高斯混合（模型） · 估計/估計量 · 泛函 · 損失函數（機器學習） · 混合專家模型 ·

2024 年 2 月 7 日

On Parameter Estimation in Deviated Gaussian Mixture of Experts

Huy Nguyen,Khai Nguyen,Nhat Ho

from arxiv, 34 pages, 3 figures

We consider the parameter estimation problem in the deviated Gaussian mixture of experts in which the data are generated from $(1 - \lambda^{\ast}) g_0(Y| X)+ \lambda^{\ast} \sum_{i = 1}^{k_{\ast}} p_{i}^{\ast} f(Y|(a_{i}^{\ast})^{\top}X+b_i^{\ast},\sigma_{i}^{\ast})$, where $X, Y$ are respectively a covariate vector and a response variable, $g_{0}(Y|X)$ is a known function, $\lambda^{\ast} \in [0, 1]$ is true but unknown mixing proportion, and $(p_{i}^{\ast}, a_{i}^{\ast}, b_{i}^{\ast}, \sigma_{i}^{\ast})$ for $1 \leq i \leq k^{\ast}$ are unknown parameters of the Gaussian mixture of experts. This problem arises from the goodness-of-fit test when we would like to test whether the data are generated from $g_{0}(Y|X)$ (null hypothesis) or they are generated from the whole mixture (alternative hypothesis). Based on the algebraic structure of the expert functions and the distinguishability between $g_0$ and the mixture part, we construct novel Voronoi-based loss functions to capture the convergence rates of maximum likelihood estimation (MLE) for our models. We further demonstrate that our proposed loss functions characterize the local convergence rates of parameter estimation more accurately than the generalized Wasserstein, a loss function being commonly used for estimating parameters in the Gaussian mixture of experts.

SGD · 優化器 · 服務器 · 非凸 · 計算學習理論 ·

2024 年 2 月 7 日

Shadowheart SGD: Distributed Asynchronous SGD with Optimal Time Complexity Under Arbitrary Computation and Communication Heterogeneity

Alexander Tyurin,Marta Pozzi,Ivan Ilin,Peter Richtárik

We consider nonconvex stochastic optimization problems in the asynchronous centralized distributed setup where the communication times from workers to a server can not be ignored, and the computation and communication times are potentially different for all workers. Using an unbiassed compression technique, we develop a new method-Shadowheart SGD-that provably improves the time complexities of all previous centralized methods. Moreover, we show that the time complexity of Shadowheart SGD is optimal in the family of centralized methods with compressed communication. We also consider the bidirectional setup, where broadcasting from the server to the workers is non-negligible, and develop a corresponding method.

Guidance · 噪聲 · 最優化 · 逼真度 · 步幅 ·

2024 年 2 月 7 日

Noise Map Guidance: Inversion with Spatial Context for Real Image Editing

Hansam Cho,Jonghyun Lee,Seoung Bum Kim,Tae-Hyun Oh,Yonghyun Jeong

from arxiv, ICLR 2024

Text-guided diffusion models have become a popular tool in image synthesis, known for producing high-quality and diverse images. However, their application to editing real images often encounters hurdles primarily due to the text condition deteriorating the reconstruction quality and subsequently affecting editing fidelity. Null-text Inversion (NTI) has made strides in this area, but it fails to capture spatial context and requires computationally intensive per-timestep optimization. Addressing these challenges, we present Noise Map Guidance (NMG), an inversion method rich in a spatial context, tailored for real-image editing. Significantly, NMG achieves this without necessitating optimization, yet preserves the editing quality. Our empirical investigations highlight NMG's adaptability across various editing techniques and its robustness to variants of DDIM inversions.

知識 (knowledge) · Machine Learning · MoDELS · 學成 · Conformer ·

2022 年 5 月 10 日

Knowledge Augmented Machine Learning with Applications in Autonomous Driving: A Survey

Julian W?rmann,Daniel Bogdoll,Etienne Bührle,Han Chen,Evaristus Fuh Chuo,Kostadin Cvejoski,Ludger van Elst,Tobias Glei?ner,Philip Gottschall,Stefan Griesche,Christian Hellert,Christian Hesels,Sebastian Houben,Tim Joseph,Niklas Keil,Johann Kelsch,Hendrik K?nigshof,Erwin Kraft,Leonie Kreuser,Kevin Krone,Tobias Latka,Denny Mattern,Stefan Matthes,Mohsin Munir,Moritz Nekolla,Adrian Paschke,Maximilian Alexander Pintz,Tianming Qiu,Faraz Qureishi,Syed Tahseen Raza Rizvi,J?rg Reichardt,Laura von Rueden,Stefan Rudolph,Alexander Sagel,Gerhard Schunk,Hao Shen,Hendrik Stapelbroek,Vera Stehr,Gurucharan Srinivas,Anh Tuan Tran,Abhishek Vivekanandan,Ya Wang,Florian Wasserrab,Tino Werner,Christian Wirth,Stefan Zwicklbauer

from arxiv, 93 pages

The existence of representative datasets is a prerequisite of many successful artificial intelligence and machine learning models. However, the subsequent application of these models often involves scenarios that are inadequately represented in the data used for training. The reasons for this are manifold and range from time and cost constraints to ethical considerations. As a consequence, the reliable use of these models, especially in safety-critical applications, is a huge challenge. Leveraging additional, already existing sources of knowledge is key to overcome the limitations of purely data-driven approaches, and eventually to increase the generalization capability of these models. Furthermore, predictions that conform with knowledge are crucial for making trustworthy and safe decisions even in underrepresented scenarios. This work provides an overview of existing techniques and methods in the literature that combine data-based models with existing knowledge. The identified approaches are structured according to the categories integration, extraction and conformity. Special attention is given to applications in the field of autonomous driving.

圖形處理器 · 圖 · Neural Networks · Networking · 層 ·

2020 年 5 月 24 日

Connecting the Dots: Multivariate Time Series Forecasting with Graph Neural Networks

Zonghan Wu,Shirui Pan,Guodong Long,Jing Jiang,Xiaojun Chang,Chengqi Zhang

from arxiv, Accepted by KDD 2020

Modeling multivariate time series has long been a subject that has attracted researchers from a diverse range of fields including economics, finance, and traffic. A basic assumption behind multivariate time series forecasting is that its variables depend on one another but, upon looking closely, it is fair to say that existing methods fail to fully exploit latent spatial dependencies between pairs of variables. In recent years, meanwhile, graph neural networks (GNNs) have shown high capability in handling relational dependencies. GNNs require well-defined graph structures for information propagation which means they cannot be applied directly for multivariate time series where the dependencies are not known in advance. In this paper, we propose a general graph neural network framework designed specifically for multivariate time series data. Our approach automatically extracts the uni-directed relations among variables through a graph learning module, into which external knowledge like variable attributes can be easily integrated. A novel mix-hop propagation layer and a dilated inception layer are further proposed to capture the spatial and temporal dependencies within the time series. The graph learning, graph convolution, and temporal convolution modules are jointly learned in an end-to-end framework. Experimental results show that our proposed model outperforms the state-of-the-art baseline methods on 3 of 4 benchmark datasets and achieves on-par performance with other approaches on two traffic datasets which provide extra structural information.

Faster R-CNN · domain shift · R-CNN · 目標檢測 · 可約的 ·

2018 年 3 月 8 日

Domain Adaptive Faster R-CNN for Object Detection in the Wild

Yuhua Chen,Wen Li,Christos Sakaridis,Dengxin Dai,Luc Van Gool

from arxiv, Accepted to CVPR 2018

Object detection typically assumes that training and test data are drawn from an identical distribution, which, however, does not always hold in practice. Such a distribution mismatch will lead to a significant performance drop. In this work, we aim to improve the cross-domain robustness of object detection. We tackle the domain shift on two levels: 1) the image-level shift, such as image style, illumination, etc, and 2) the instance-level shift, such as object appearance, size, etc. We build our approach based on the recent state-of-the-art Faster R-CNN model, and design two domain adaptation components, on image level and instance level, to reduce the domain discrepancy. The two domain adaptation components are based on H-divergence theory, and are implemented by learning a domain classifier in adversarial training manner. The domain classifiers on different levels are further reinforced with a consistency regularization to learn a domain-invariant region proposal network (RPN) in the Faster R-CNN model. We evaluate our newly proposed approach using multiple datasets including Cityscapes, KITTI, SIM10K, etc. The results demonstrate the effectiveness of our proposed approach for robust object detection in various domain shift scenarios.