姑娘日本电影免费观看全集中文_91婷婷国产精选国产色_九九最新精品视频_无码专区中文字幕无野区_亚洲日本一区二区三区_无码精品一区二区三区电影_最新日本超大胆视频网站

Deep autoencoders provide an effective tool for learning non-linear dimensionality reduction in an unsupervised way. Recently, they have been used for the task of anomaly detection in the visual domain. By optimizing for the reconstruction error using anomaly-free examples, the common belief is that a corresponding network should fail to accurately reconstruct anomalous regions in the application phase. This goal is typically addressed by controlling the capacity of the network, either by reducing the size of the bottleneck layer or by enforcing sparsity constraints on the activations. However, neither of these techniques does explicitly penalize reconstruction of anomalous signals often resulting in poor detection. We tackle this problem by adapting a self-supervised learning regime that allows the use of discriminative information during training but focuses on the data manifold of normal examples. We emphasize that inference with our approach is very efficient during training and prediction requiring a single forward pass for each input image. Our experiments on the MVTec AD dataset demonstrate high detection and localization performance. On the texture-subset, in particular, our approach consistently outperforms recent anomaly detection methods by a significant margin.

相關內容

異常檢(jian)測

關注 102

在(zai)數(shu)據(ju)(ju)(ju)挖(wa)掘(jue)中(zhong)，異(yi)(yi)(yi)常(chang)(chang)(chang)(chang)檢(jian)測(ce)（英語：anomaly detection）對(dui)不符合(he)(he)預(yu)期模(mo)(mo)(mo)式或數(shu)據(ju)(ju)(ju)集(ji)(ji)中(zhong)其他項(xiang)目的(de)(de)(de)(de)(de)項(xiang)目、事(shi)件或觀測(ce)值的(de)(de)(de)(de)(de)識別。通常(chang)(chang)(chang)(chang)異(yi)(yi)(yi)常(chang)(chang)(chang)(chang)項(xiang)目會轉變成(cheng)銀行欺詐、結(jie)構(gou)缺(que)陷、醫療問(wen)(wen)題(ti)、文本錯(cuo)誤等類(lei)(lei)(lei)型的(de)(de)(de)(de)(de)問(wen)(wen)題(ti)。異(yi)(yi)(yi)常(chang)(chang)(chang)(chang)也被(bei)稱為(wei)(wei)離群值、新(xin)奇、噪聲、偏差(cha)和例(li)外(wai)。特別是(shi)在(zai)檢(jian)測(ce)濫用(yong)與(yu)網絡入侵時，有(you)趣性(xing)(xing)對(dui)象(xiang)往(wang)往(wang)不是(shi)罕見對(dui)象(xiang)，但卻是(shi)超出預(yu)料(liao)的(de)(de)(de)(de)(de)突發活動。這種模(mo)(mo)(mo)式不遵循(xun)通常(chang)(chang)(chang)(chang)統計(ji)定(ding)義中(zhong)把異(yi)(yi)(yi)常(chang)(chang)(chang)(chang)點看作(zuo)是(shi)罕見對(dui)象(xiang)，于是(shi)許多異(yi)(yi)(yi)常(chang)(chang)(chang)(chang)檢(jian)測(ce)方(fang)法(fa)(fa)（特別是(shi)無監(jian)督的(de)(de)(de)(de)(de)方(fang)法(fa)(fa)）將對(dui)此(ci)類(lei)(lei)(lei)數(shu)據(ju)(ju)(ju)失效，除非(fei)進行了合(he)(he)適(shi)的(de)(de)(de)(de)(de)聚(ju)集(ji)(ji)。相反，聚(ju)類(lei)(lei)(lei)分(fen)析算法(fa)(fa)可能可以檢(jian)測(ce)出這些(xie)模(mo)(mo)(mo)式形(xing)成(cheng)的(de)(de)(de)(de)(de)微聚(ju)類(lei)(lei)(lei)。有(you)三(san)大類(lei)(lei)(lei)異(yi)(yi)(yi)常(chang)(chang)(chang)(chang)檢(jian)測(ce)方(fang)法(fa)(fa)。[1] 在(zai)假(jia)設(she)數(shu)據(ju)(ju)(ju)集(ji)(ji)中(zhong)大多數(shu)實例(li)都是(shi)正(zheng)常(chang)(chang)(chang)(chang)的(de)(de)(de)(de)(de)前提下，無監(jian)督異(yi)(yi)(yi)常(chang)(chang)(chang)(chang)檢(jian)測(ce)方(fang)法(fa)(fa)能通過尋找與(yu)其他數(shu)據(ju)(ju)(ju)最不匹配的(de)(de)(de)(de)(de)實例(li)來(lai)檢(jian)測(ce)出未標(biao)(biao)記測(ce)試(shi)數(shu)據(ju)(ju)(ju)的(de)(de)(de)(de)(de)異(yi)(yi)(yi)常(chang)(chang)(chang)(chang)。監(jian)督式異(yi)(yi)(yi)常(chang)(chang)(chang)(chang)檢(jian)測(ce)方(fang)法(fa)(fa)需(xu)要一個(ge)已經被(bei)標(biao)(biao)記“正(zheng)常(chang)(chang)(chang)(chang)”與(yu)“異(yi)(yi)(yi)常(chang)(chang)(chang)(chang)”的(de)(de)(de)(de)(de)數(shu)據(ju)(ju)(ju)集(ji)(ji)，并涉及(ji)到訓練分(fen)類(lei)(lei)(lei)器(qi)（與(yu)許多其他的(de)(de)(de)(de)(de)統計(ji)分(fen)類(lei)(lei)(lei)問(wen)(wen)題(ti)的(de)(de)(de)(de)(de)關(guan)鍵區(qu)別是(shi)異(yi)(yi)(yi)常(chang)(chang)(chang)(chang)檢(jian)測(ce)的(de)(de)(de)(de)(de)內在(zai)不均(jun)衡性(xing)(xing)）。半監(jian)督式異(yi)(yi)(yi)常(chang)(chang)(chang)(chang)檢(jian)測(ce)方(fang)法(fa)(fa)根據(ju)(ju)(ju)一個(ge)給定(ding)的(de)(de)(de)(de)(de)正(zheng)常(chang)(chang)(chang)(chang)訓練數(shu)據(ju)(ju)(ju)集(ji)(ji)創建一個(ge)表(biao)示正(zheng)常(chang)(chang)(chang)(chang)行為(wei)(wei)的(de)(de)(de)(de)(de)模(mo)(mo)(mo)型，然后檢(jian)測(ce)由學習模(mo)(mo)(mo)型生成(cheng)的(de)(de)(de)(de)(de)測(ce)試(shi)實例(li)的(de)(de)(de)(de)(de)可能性(xing)(xing)。

可辨認的 · Learning · 表示學習 · 表示 · 情景 ·

2023 年 11 月 7 日

Multi-View Causal Representation Learning with Partial Observability

Dingling Yao,Danru Xu,Sébastien Lachapelle,Sara Magliacane,Perouz Taslakian,Georg Martius,Julius von Kügelgen,Francesco Locatello

from arxiv, 28 pages, 10 figures, 10 tables

We present a unified framework for studying the identifiability of representations learned from simultaneously observed views, such as different data modalities. We allow a partially observed setting in which each view constitutes a nonlinear mixture of a subset of underlying latent variables, which can be causally related. We prove that the information shared across all subsets of any number of views can be learned up to a smooth bijection using contrastive learning and a single encoder per view. We also provide graphical criteria indicating which latent variables can be identified through a simple set of rules, which we refer to as identifiability algebra. Our general framework and theoretical results unify and extend several previous works on multi-view nonlinear ICA, disentanglement, and causal representation learning. We experimentally validate our claims on numerical, image, and multi-modal data sets. Further, we demonstrate that the performance of prior methods is recovered in different special cases of our setup. Overall, we find that access to multiple partial views enables us to identify a more fine-grained representation, under the generally milder assumption of partial observability.

MoDELS · 語言模型化 · 張成子空間 · 模型評估 · 知識 (knowledge) ·

2023 年 11 月 6 日

C-Eval: A Multi-Level Multi-Discipline Chinese Evaluation Suite for Foundation Models

Yuzhen Huang,Yuzhuo Bai,Zhihao Zhu,Junlei Zhang,Jinghan Zhang,Tangjun Su,Junteng Liu,Chuancheng Lv,Yikai Zhang,Jiayi Lei,Yao Fu,Maosong Sun,Junxian He

from arxiv, NeurIPS 2023. Website: //cevalbenchmark.com

New NLP benchmarks are urgently needed to align with the rapid development of large language models (LLMs). We present C-Eval, the first comprehensive Chinese evaluation suite designed to assess advanced knowledge and reasoning abilities of foundation models in a Chinese context. C-Eval comprises multiple-choice questions across four difficulty levels: middle school, high school, college, and professional. The questions span 52 diverse disciplines, ranging from humanities to science and engineering. C-Eval is accompanied by C-Eval Hard, a subset of very challenging subjects in C-Eval that requires advanced reasoning abilities to solve. We conduct a comprehensive evaluation of the most advanced LLMs on C-Eval, including both English- and Chinese-oriented models. Results indicate that only GPT-4 could achieve an average accuracy of over 60%, suggesting that there is still significant room for improvement for current LLMs. We anticipate C-Eval will help analyze important strengths and shortcomings of foundation models, and foster their development and growth for Chinese users.

Learning · 表示學習 · 表示 · 控制器 · 強化學習 ·

2023 年 11 月 5 日

For SALE: State-Action Representation Learning for Deep Reinforcement Learning

Scott Fujimoto,Wei-Di Chang,Edward J. Smith,Shixiang Shane Gu,Doina Precup,David Meger

from arxiv, NeurIPS 2023

In the field of reinforcement learning (RL), representation learning is a proven tool for complex image-based tasks, but is often overlooked for environments with low-level states, such as physical control problems. This paper introduces SALE, a novel approach for learning embeddings that model the nuanced interaction between state and action, enabling effective representation learning from low-level states. We extensively study the design space of these embeddings and highlight important design considerations. We integrate SALE and an adaptation of checkpoints for RL into TD3 to form the TD7 algorithm, which significantly outperforms existing continuous control algorithms. On OpenAI gym benchmark tasks, TD7 has an average performance gain of 276.7% and 50.7% over TD3 at 300k and 5M time steps, respectively, and works in both the online and offline settings.

Integration · 極大 · 約束 · 動力學模擬 · 穩健性 ·

2023 年 11 月 5 日

Variational Integrators and Graph-Based Solvers for Multibody Dynamics in Maximal Coordinates

Jan Brüdigam,Stefan Sosnowski,Zachary Manchester,Sandra Hirche

Multibody dynamics simulators are an important tool in many fields, including learning and control for robotics. However, many existing dynamics simulators suffer from inaccuracies when dealing with constrained mechanical systems due to unsuitable integrators with bad energy behavior and problematic constraint violations, for example for contact interactions. Variational integrators are numerical discretization methods that can reduce physical inaccuracies when simulating mechanical systems, and formulating the dynamics in maximal coordinates allows for easy and numerically robust incorporation of constraints such as kinematic loops or contacts. Therefore, this article derives a variational integrator for mechanical systems with equality and inequality constraints in maximal coordinates. Additionally, efficient graph-based sparsity-exploiting algorithms for solving the integrator are provided and implemented as an open-source simulator. The evaluation of the simulator shows improved physical accuracy due to the variational integrator and the advantages of the sparse solvers. Comparisons to minimal-coordinate algorithms show improved numerical robustness and application examples of a walking robot and an exoskeleton with explicit constraints demonstrate the necessity and capabilities of maximal coordinates.

數據增強 · 小樣本學習 · Seven · 可約的 · 假陰性 ·

2023 年 11 月 3 日

Vicinal Risk Minimization for Few-Shot Cross-lingual Transfer in Abusive Language Detection

Gretel Liz De la Pe?a Sarracén,Paolo Rosso,Robert Litschko,Goran Glava?,Simone Paolo Ponzetto

from arxiv, Accepted at EMNLP 2023 (Main Conference)

Cross-lingual transfer learning from high-resource to medium and low-resource languages has shown encouraging results. However, the scarcity of resources in target languages remains a challenge. In this work, we resort to data augmentation and continual pre-training for domain adaptation to improve cross-lingual abusive language detection. For data augmentation, we analyze two existing techniques based on vicinal risk minimization and propose MIXAG, a novel data augmentation method which interpolates pairs of instances based on the angle of their representations. Our experiments involve seven languages typologically distinct from English and three different domains. The results reveal that the data augmentation strategies can enhance few-shot cross-lingual abusive language detection. Specifically, we observe that consistently in all target languages, MIXAG improves significantly in multidomain and multilingual environments. Finally, we show through an error analysis how the domain adaptation can favour the class of abusive texts (reducing false negatives), but at the same time, declines the precision of the abusive language detection model.

Learning · 貪心 · 優化器 · 內部結點 · binary ·

2023 年 11 月 3 日

GradTree: Learning Axis-Aligned Decision Trees with Gradient Descent

Sascha Marton,Stefan Lüdtke,Christian Bartelt,Heiner Stuckenschmidt

Decision Trees (DTs) are commonly used for many machine learning tasks due to their high degree of interpretability. However, learning a DT from data is a difficult optimization problem, as it is non-convex and non-differentiable. Therefore, common approaches learn DTs using a greedy growth algorithm that minimizes the impurity locally at each internal node. Unfortunately, this greedy procedure can lead to inaccurate trees. In this paper, we present a novel approach for learning hard, axis-aligned DTs with gradient descent. The proposed method uses backpropagation with a straight-through operator on a dense DT representation, to jointly optimize all tree parameters. Our approach outperforms existing methods on binary classification benchmarks and achieves competitive results for multi-class tasks. The method is available under: //github.com/s-marton/GradTree

Networking · 潛在 · Neural Networks · Networks · 復合數據 ·

2023 年 11 月 3 日

Physics-Informed Generator-Encoder Adversarial Networks with Latent Space Matching for Stochastic Differential Equations

Ruisong Gao,Min Yang,Jin Zhang

from arxiv, 24 pages

We propose a new class of physics-informed neural networks, called Physics-Informed Generator-Encoder Adversarial Networks, to effectively address the challenges posed by forward, inverse, and mixed problems in stochastic differential equations. In these scenarios, while the governing equations are known, the available data consist of only a limited set of snapshots for system parameters. Our model consists of two key components: the generator and the encoder, both updated alternately by gradient descent. In contrast to previous approaches of directly matching the approximated solutions with real snapshots, we employ an indirect matching that operates within the lower-dimensional latent feature space. This method circumvents challenges associated with high-dimensional inputs and complex data distributions, while yielding more accurate solutions compared to existing neural network solvers. In addition, the approach also mitigates the training instability issues encountered in previous adversarial frameworks in an efficient manner. Numerical results provide compelling evidence of the effectiveness of the proposed method in solving different types of stochastic differential equations.

數據增強 · Taxonomy · 文本分類 · Machine Learning · 訓練數據 ·

2021 年 7 月 7 日

A Survey on Data Augmentation for Text Classification

Markus Bayer,Marc-André Kaufhold,Christian Reuter

from arxiv, 35 pages, 6 figures, 8 tables

Data augmentation, the artificial creation of training data for machine learning by transformations, is a widely studied research field across machine learning disciplines. While it is useful for increasing the generalization capabilities of a model, it can also address many other challenges and problems, from overcoming a limited amount of training data over regularizing the objective to limiting the amount data used to protect privacy. Based on a precise description of the goals and applications of data augmentation (C1) and a taxonomy for existing works (C2), this survey is concerned with data augmentation methods for textual classification and aims to achieve a concise and comprehensive overview for researchers and practitioners (C3). Derived from the taxonomy, we divided more than 100 methods into 12 different groupings and provide state-of-the-art references expounding which methods are highly promising (C4). Finally, research perspectives that may constitute a building block for future work are given (C5).

state-of-the-art · LISA · Performer · MoDELS · 標注 ·

2018 年 8 月 28 日

Linguistically-Informed Self-Attention for Semantic Role Labeling

Emma Strubell,Patrick Verga,Daniel Andor,David Weiss,Andrew McCallum

from arxiv, In Conference on Empirical Methods in Natural Language Processing (EMNLP). Brussels, Belgium. October 2018

Current state-of-the-art semantic role labeling (SRL) uses a deep neural network with no explicit linguistic features. However, prior work has shown that gold syntax trees can dramatically improve SRL decoding, suggesting the possibility of increased accuracy from explicit modeling of syntax. In this work, we present linguistically-informed self-attention (LISA): a neural network model that combines multi-head self-attention with multi-task learning across dependency parsing, part-of-speech tagging, predicate detection and SRL. Unlike previous models which require significant pre-processing to prepare linguistic features, LISA can incorporate syntax using merely raw tokens as input, encoding the sequence only once to simultaneously perform parsing, predicate detection and role labeling for all predicates. Syntax is incorporated by training one attention head to attend to syntactic parents for each token. Moreover, if a high-quality syntactic parse is already available, it can be beneficially injected at test time without re-training our SRL model. In experiments on CoNLL-2005 SRL, LISA achieves new state-of-the-art performance for a model using predicted predicates and standard word embeddings, attaining 2.5 F1 absolute higher than the previous state-of-the-art on newswire and more than 3.5 F1 on out-of-domain data, nearly 10% reduction in error. On ConLL-2012 English SRL we also show an improvement of more than 2.5 F1. LISA also out-performs the state-of-the-art with contextually-encoded (ELMo) word representations, by nearly 1.0 F1 on news and more than 2.0 F1 on out-of-domain text.

圖 · 學成 · Extensibility · 知識圖譜 · 平滑 ·

2018 年 5 月 31 日

Rethinking Knowledge Graph Propagation for Zero-Shot Learning

Michael Kampffmeyer,Yinbo Chen,Xiaodan Liang,Hao Wang,Yujia Zhang,Eric P. Xing

from arxiv, The first two authors contributed equally. Code at //github.com/cyvius96/adgpm

The potential of graph convolutional neural networks for the task of zero-shot learning has been demonstrated recently. These models are highly sample efficient as related concepts in the graph structure share statistical strength allowing generalization to new classes when faced with a lack of data. However, knowledge from distant nodes can get diluted when propagating through intermediate nodes, because current approaches to zero-shot learning use graph propagation schemes that perform Laplacian smoothing at each layer. We show that extensive smoothing does not help the task of regressing classifier weights in zero-shot learning. In order to still incorporate information from distant nodes and utilize the graph structure, we propose an Attentive Dense Graph Propagation Module (ADGPM). ADGPM allows us to exploit the hierarchical graph structure of the knowledge graph through additional connections. These connections are added based on a node's relationship to its ancestors and descendants and an attention scheme is further used to weigh their contribution depending on the distance to the node. Finally, we illustrate that finetuning of the feature representation after training the ADGPM leads to considerable improvements. Our method achieves competitive results, outperforming previous zero-shot learning approaches.