顾美玲国产一区二区三区_国产欧美日韩精品A在线播放_日韩专区欧美专区亚洲精品_免费人成网站在线看_午夜一级生片A国产一级毛片_国产一区二区精品夜夜嗨_黄色网站男人免费看

Anomaly detection is crucial to the advanced identification of product defects such as incorrect parts, misaligned components, and damages in industrial manufacturing. Due to the rare observations and unknown types of defects, anomaly detection is considered to be challenging in machine learning. To overcome this difficulty, recent approaches utilize the common visual representations pre-trained from natural image datasets and distill the relevant features. However, existing approaches still have the discrepancy between the pre-trained feature and the target data, or require the input augmentation which should be carefully designed, particularly for the industrial dataset. In this paper, we introduce ReConPatch, which constructs discriminative features for anomaly detection by training a linear modulation of patch features extracted from the pre-trained model. ReConPatch employs contrastive representation learning to collect and distribute features in a way that produces a target-oriented and easily separable representation. To address the absence of labeled pairs for the contrastive learning, we utilize two similarity measures between data representations, pairwise and contextual similarities, as pseudo-labels. Our method achieves the state-of-the-art anomaly detection performance (99.72%) for the widely used and challenging MVTec AD dataset. Additionally, we achieved a state-of-the-art anomaly detection performance (95.8%) for the BTAD dataset.

相關內容

異常檢測

關注 102

在(zai)數(shu)據(ju)(ju)(ju)(ju)挖掘中(zhong)，異常(chang)(chang)檢(jian)(jian)測(ce)(ce)(ce)(ce)（英語(yu)：anomaly detection）對(dui)不(bu)符合預(yu)期模(mo)(mo)式或(huo)數(shu)據(ju)(ju)(ju)(ju)集(ji)中(zhong)其(qi)他項目的(de)(de)(de)項目、事件或(huo)觀(guan)測(ce)(ce)(ce)(ce)值的(de)(de)(de)識別。通常(chang)(chang)異常(chang)(chang)項目會(hui)轉變成銀行(xing)(xing)欺詐、結構缺陷、醫療問題(ti)、文本(ben)錯(cuo)誤等類(lei)(lei)型的(de)(de)(de)問題(ti)。異常(chang)(chang)也被稱為(wei)離群值、新(xin)奇、噪聲、偏差和例(li)外。特別是(shi)(shi)在(zai)檢(jian)(jian)測(ce)(ce)(ce)(ce)濫用與網絡入侵(qin)時，有趣性對(dui)象(xiang)往(wang)往(wang)不(bu)是(shi)(shi)罕見對(dui)象(xiang)，但卻(que)是(shi)(shi)超出(chu)預(yu)料的(de)(de)(de)突發活(huo)動。這種模(mo)(mo)式不(bu)遵(zun)循通常(chang)(chang)統(tong)計定義中(zhong)把異常(chang)(chang)點(dian)看(kan)作是(shi)(shi)罕見對(dui)象(xiang)，于是(shi)(shi)許(xu)多(duo)(duo)異常(chang)(chang)檢(jian)(jian)測(ce)(ce)(ce)(ce)方(fang)法(fa)(fa)（特別是(shi)(shi)無監督(du)的(de)(de)(de)方(fang)法(fa)(fa)）將對(dui)此類(lei)(lei)數(shu)據(ju)(ju)(ju)(ju)失效，除非進(jin)行(xing)(xing)了(le)合適的(de)(de)(de)聚集(ji)。相反，聚類(lei)(lei)分(fen)析(xi)算法(fa)(fa)可能可以(yi)檢(jian)(jian)測(ce)(ce)(ce)(ce)出(chu)這些模(mo)(mo)式形成的(de)(de)(de)微(wei)聚類(lei)(lei)。有三大類(lei)(lei)異常(chang)(chang)檢(jian)(jian)測(ce)(ce)(ce)(ce)方(fang)法(fa)(fa)。[1] 在(zai)假設數(shu)據(ju)(ju)(ju)(ju)集(ji)中(zhong)大多(duo)(duo)數(shu)實例(li)都(dou)是(shi)(shi)正(zheng)常(chang)(chang)的(de)(de)(de)前提下，無監督(du)異常(chang)(chang)檢(jian)(jian)測(ce)(ce)(ce)(ce)方(fang)法(fa)(fa)能通過尋找(zhao)與其(qi)他數(shu)據(ju)(ju)(ju)(ju)最不(bu)匹配(pei)的(de)(de)(de)實例(li)來(lai)檢(jian)(jian)測(ce)(ce)(ce)(ce)出(chu)未標(biao)記測(ce)(ce)(ce)(ce)試數(shu)據(ju)(ju)(ju)(ju)的(de)(de)(de)異常(chang)(chang)。監督(du)式異常(chang)(chang)檢(jian)(jian)測(ce)(ce)(ce)(ce)方(fang)法(fa)(fa)需要一個(ge)已經被標(biao)記“正(zheng)常(chang)(chang)”與“異常(chang)(chang)”的(de)(de)(de)數(shu)據(ju)(ju)(ju)(ju)集(ji)，并涉(she)及到(dao)訓練分(fen)類(lei)(lei)器(qi)（與許(xu)多(duo)(duo)其(qi)他的(de)(de)(de)統(tong)計分(fen)類(lei)(lei)問題(ti)的(de)(de)(de)關鍵區(qu)別是(shi)(shi)異常(chang)(chang)檢(jian)(jian)測(ce)(ce)(ce)(ce)的(de)(de)(de)內在(zai)不(bu)均衡性）。半(ban)監督(du)式異常(chang)(chang)檢(jian)(jian)測(ce)(ce)(ce)(ce)方(fang)法(fa)(fa)根據(ju)(ju)(ju)(ju)一個(ge)給定的(de)(de)(de)正(zheng)常(chang)(chang)訓練數(shu)據(ju)(ju)(ju)(ju)集(ji)創建一個(ge)表示正(zheng)常(chang)(chang)行(xing)(xing)為(wei)的(de)(de)(de)模(mo)(mo)型，然后(hou)檢(jian)(jian)測(ce)(ce)(ce)(ce)由學(xue)習模(mo)(mo)型生成的(de)(de)(de)測(ce)(ce)(ce)(ce)試實例(li)的(de)(de)(de)可能性。

INTERACT · Learning · 多峰值 · 機器人 · MoDELS ·

2024 年 2 月 22 日

MOSAIC: Learning Unified Multi-Sensory Object Property Representations for Robot Learning via Interactive Perception

Gyan Tatiya,Jonathan Francis,Ho-Hsiang Wu,Yonatan Bisk,Jivko Sinapov

from arxiv, Accepted to the 2024 IEEE International Conference on Robotics and Automation (ICRA), May 13 to 17, 2024; Yokohama, Japan

A holistic understanding of object properties across diverse sensory modalities (e.g., visual, audio, and haptic) is essential for tasks ranging from object categorization to complex manipulation. Drawing inspiration from cognitive science studies that emphasize the significance of multi-sensory integration in human perception, we introduce MOSAIC (Multimodal Object property learning with Self-Attention and Interactive Comprehension), a novel framework designed to facilitate the learning of unified multi-sensory object property representations. While it is undeniable that visual information plays a prominent role, we acknowledge that many fundamental object properties extend beyond the visual domain to encompass attributes like texture, mass distribution, or sounds, which significantly influence how we interact with objects. In MOSAIC, we leverage this profound insight by distilling knowledge from multimodal foundation models and aligning these representations not only across vision but also haptic and auditory sensory modalities. Through extensive experiments on a dataset where a humanoid robot interacts with 100 objects across 10 exploratory behaviors, we demonstrate the versatility of MOSAIC in two task families: object categorization and object-fetching tasks. Our results underscore the efficacy of MOSAIC's unified representations, showing competitive performance in category recognition through a simple linear probe setup and excelling in the fetch object task under zero-shot transfer conditions. This work pioneers the application of sensory grounding in foundation models for robotics, promising a significant leap in multi-sensory perception capabilities for autonomous systems. We have released the code, datasets, and additional results: //github.com/gtatiya/MOSAIC.

可約的 · MoDELS · CASES · Performer · motivation ·

2024 年 2 月 22 日

Dissenting Explanations: Leveraging Disagreement to Reduce Model Overreliance

Omer Reingold,Judy Hanwen Shen,Aditi Talati

from arxiv, V2: AAAI 2024 V1: AI & HCI Workshop at ICML 2023

While explainability is a desirable characteristic of increasingly complex black-box models, modern explanation methods have been shown to be inconsistent and contradictory. The semantics of explanations is not always fully understood - to what extent do explanations "explain" a decision and to what extent do they merely advocate for a decision? Can we help humans gain insights from explanations accompanying correct predictions and not over-rely on incorrect predictions advocated for by explanations? With this perspective in mind, we introduce the notion of dissenting explanations: conflicting predictions with accompanying explanations. We first explore the advantage of dissenting explanations in the setting of model multiplicity, where multiple models with similar performance may have different predictions. In such cases, providing dissenting explanations could be done by invoking the explanations of disagreeing models. Through a pilot study, we demonstrate that dissenting explanations reduce overreliance on model predictions, without reducing overall accuracy. Motivated by the utility of dissenting explanations we present both global and local methods for their generation.

語音增強 · 卷積 · 狀態空間 · MoDELS · 上采樣 ·

2024 年 2 月 22 日

SICRN: Advancing Speech Enhancement through State Space Model and Inplace Convolution Techniques

Changjiang Zhao,Shulin He,Xueliang Zhang

Speech enhancement aims to improve speech quality and intelligibility, especially in noisy environments where background noise degrades speech signals. Currently, deep learning methods achieve great success in speech enhancement, e.g. the representative convolutional recurrent neural network (CRN) and its variants. However, CRN typically employs consecutive downsampling and upsampling convolution for frequency modeling, which destroys the inherent structure of the signal over frequency. Additionally, convolutional layers lacks of temporal modelling abilities. To address these issues, we propose an innovative module combing a State space model and Inplace Convolution (SIC), and to replace the conventional convolution in CRN, called SICRN. Specifically, a dual-path multidimensional State space model captures the global frequencies dependency and long-term temporal dependencies. Meanwhile, the 2D-inplace convolution is used to capture the local structure, which abandons the downsampling and upsampling. Systematic evaluations on the public INTERSPEECH 2020 DNS challenge dataset demonstrate SICRN's efficacy. Compared to strong baselines, SICRN achieves performance close to state-of-the-art while having advantages in model parameters, computations, and algorithmic delay. The proposed SICRN shows great promise for improved speech enhancement.

AIM · Automator · 可約的 · 情景 · 可辨認的 ·

2024 年 2 月 21 日

AIM: Automated Input Set Minimization for Metamorphic Security Testing

Nazanin Bayati Chaleshtari,Yoann Marquer,Fabrizio Pastore,Lionel C. Briand

Although the security testing of Web systems can be automated by generating crafted inputs, solutions to automate the test oracle, i.e., distinguishing correct from incorrect outputs, remain preliminary. Specifically, previous work has demonstrated the potential of metamorphic testing; indeed, security failures can be determined by metamorphic relations that turn valid inputs into malicious inputs. However, without further guidance, metamorphic relations are typically executed on a large set of inputs, which is time-consuming and thus makes metamorphic testing impractical. We propose AIM, an approach that automatically selects inputs to reduce testing costs while preserving vulnerability detection capabilities. AIM includes a clustering-based black box approach, to identify similar inputs based on their security properties. It also relies on a novel genetic algorithm able to efficiently select diverse inputs while minimizing their total cost. Further, it contains a problem-reduction component to reduce the search space and speed up the minimization process. We evaluated the effectiveness of AIM on two well-known Web systems, Jenkins and Joomla, with documented vulnerabilities. We compared AIM's results with four baselines. Overall, AIM reduced metamorphic testing time by 84% for Jenkins and 82% for Joomla, while preserving vulnerability detection. Furthermore, AIM outperformed all the considered baselines regarding vulnerability coverage.

MoDELS · 模型評估 · 優化器 · Performer · INFORMS ·

2024 年 2 月 21 日

AI-Powered Predictions for Electricity Load in Prosumer Communities

Aleksei Kychkin,Georgios C. Chasparis

from arxiv, It has been presented in the 18. Symposium Energieinnovation (14.-16.02.2024). Further information can be found at: //www.tugraz.at/events/eninnov2024/home

The flexibility in electricity consumption and production in communities of residential buildings, including those with renewable energy sources and energy storage (a.k.a., prosumers), can effectively be utilized through the advancement of short-term demand response mechanisms. It is known that flexibility can further be increased if demand response is performed at the level of communities of prosumers, since aggregated groups can better coordinate electricity consumption. However, the effectiveness of such short-term optimization is highly dependent on the accuracy of electricity load forecasts both for each building as well as for the whole community. Structural variations in the electricity load profile can be associated with different exogenous factors, such as weather conditions, calendar information and day of the week, as well as user behavior. In this paper, we review a wide range of electricity load forecasting techniques, that can provide significant assistance in optimizing load consumption in prosumer communities. We present and test artificial intelligence (AI) powered short-term load forecasting methodologies that operate with black-box time series models, such as Facebook's Prophet and Long Short-term Memory (LSTM) models; season-based SARIMA and smoothing Holt-Winters models; and empirical regression-based models that utilize domain knowledge. The integration of weather forecasts into data-driven time series forecasts is also tested. Results show that the combination of persistent and regression terms (adapted to the load forecasting task) achieves the best forecast accuracy.

數據集 · GROUP · Elevate · 評論員 · 生物特征識別 ·

2022 年 11 月 3 日

Expanding Accurate Person Recognition to New Altitudes and Ranges: The BRIAR Dataset

David Cornett III,Joel Brogan,Nell Barber,Deniz Aykac,Seth Baird,Nick Burchfield,Carl Dukes,Andrew Duncan,Regina Ferrell,Jim Goddard,Gavin Jager,Matt Larson,Bart Murphy,Christi Johnson,Ian Shelley,Nisha Srinivas,Brandon Stockwell,Leanne Thompson,Matt Yohe,Robert Zhang,Scott Dolvin,Hector J. Santos-Villalobos,David S. Bolme

Face recognition technology has advanced significantly in recent years due largely to the availability of large and increasingly complex training datasets for use in deep learning models. These datasets, however, typically comprise images scraped from news sites or social media platforms and, therefore, have limited utility in more advanced security, forensics, and military applications. These applications require lower resolution, longer ranges, and elevated viewpoints. To meet these critical needs, we collected and curated the first and second subsets of a large multi-modal biometric dataset designed for use in the research and development (R&D) of biometric recognition technologies under extremely challenging conditions. Thus far, the dataset includes more than 350,000 still images and over 1,300 hours of video footage of approximately 1,000 subjects. To collect this data, we used Nikon DSLR cameras, a variety of commercial surveillance cameras, specialized long-rage R&D cameras, and Group 1 and Group 2 UAV platforms. The goal is to support the development of algorithms capable of accurately recognizing people at ranges up to 1,000 m and from high angles of elevation. These advances will include improvements to the state of the art in face recognition and will support new research in the area of whole-body recognition using methods based on gait and anthropometry. This paper describes methods used to collect and curate the dataset, and the dataset's characteristics at the current stage.

可理解性 · MoDELS · INFORMS · 穩健性 · 黑盒 ·

2022 年 4 月 30 日

ExSum: From Local Explanations to Model Understanding

Yilun Zhou,Marco Tulio Ribeiro,Julie Shah

from arxiv, NAACL 2022. The project website is at //yilunzhou.github.io/exsum/

Interpretability methods are developed to understand the working mechanisms of black-box models, which is crucial to their responsible deployment. Fulfilling this goal requires both that the explanations generated by these methods are correct and that people can easily and reliably understand them. While the former has been addressed in prior work, the latter is often overlooked, resulting in informal model understanding derived from a handful of local explanations. In this paper, we introduce explanation summary (ExSum), a mathematical framework for quantifying model understanding, and propose metrics for its quality assessment. On two domains, ExSum highlights various limitations in the current practice, helps develop accurate model understanding, and reveals easily overlooked properties of the model. We also connect understandability to other properties of explanations such as human alignment, robustness, and counterfactual minimality and plausibility.

MoDELS · Taxonomy · INFORMS · 可理解性 · IR ·

2019 年 8 月 15 日

Explainable Recommendation: A Survey and New Perspectives

Yongfeng Zhang,Xu Chen

from arxiv, 88 pages

Explainable recommendation attempts to develop models that generate not only high-quality recommendations but also intuitive explanations. The explanations may either be post-hoc or directly come from an explainable model (also called interpretable or transparent model in some context). Explainable recommendation tries to address the problem of why: by providing explanations to users or system designers, it helps humans to understand why certain items are recommended by the algorithm, where the human can either be users or system designers. Explainable recommendation helps to improve the transparency, persuasiveness, effectiveness, trustworthiness, and satisfaction of recommendation systems. In this survey, we review works on explainable recommendation in or before the year of 2019. We first highlight the position of explainable recommendation in recommender system research by categorizing recommendation problems into the 5W, i.e., what, when, who, where, and why. We then conduct a comprehensive survey of explainable recommendation on three perspectives: 1) We provide a chronological research timeline of explainable recommendation, including user study approaches in the early years and more recent model-based approaches. 2) We provide a two-dimensional taxonomy to classify existing explainable recommendation research: one dimension is the information source (or display style) of the explanations, and the other dimension is the algorithmic mechanism to generate explainable recommendations. 3) We summarize how explainable recommendation applies to different recommendation tasks, such as product recommendation, social recommendation, and POI recommendation. We also devote a section to discuss the explanation perspectives in broader IR and AI/ML research. We end the survey by discussing potential future directions to promote the explainable recommendation research area and beyond.

圖卷積神經網絡/圖卷積網絡 · AdaBoost · 圖卷積 · 圖 · Networking ·

2019 年 8 月 14 日

AdaGCN: Adaboosting Graph Convolutional Networks into Deep Models

Ke Sun,Zhouchen Lin,Zhanxing Zhu

The design of deep graph models still remains to be investigated and the crucial part is how to explore and exploit the knowledge from different hops of neighbors in an efficient way. In this paper, we propose a novel RNN-like deep graph neural network architecture by incorporating AdaBoost into the computation of network; and the proposed graph convolutional network called AdaGCN~(AdaBoosting Graph Convolutional Network) has the ability to efficiently extract knowledge from high-order neighbors and integrate knowledge from different hops of neighbors into the network in an AdaBoost way. We also present the architectural difference between AdaGCN and existing graph convolutional methods to show the benefits of our proposal. Finally, extensive experiments demonstrate the state-of-the-art prediction performance and the computational advantage of our approach AdaGCN.

state-of-the-art · 可理解性 · BERT · 去噪自編碼器 · Performer ·

2019 年 6 月 19 日

XLNet: Generalized Autoregressive Pretraining for Language Understanding

Zhilin Yang,Zihang Dai,Yiming Yang,Jaime Carbonell,Ruslan Salakhutdinov,Quoc V. Le

from arxiv, Pretrained models and code are available at //github.com/zihangdai/xlnet

With the capability of modeling bidirectional contexts, denoising autoencoding based pretraining like BERT achieves better performance than pretraining approaches based on autoregressive language modeling. However, relying on corrupting the input with masks, BERT neglects dependency between the masked positions and suffers from a pretrain-finetune discrepancy. In light of these pros and cons, we propose XLNet, a generalized autoregressive pretraining method that (1) enables learning bidirectional contexts by maximizing the expected likelihood over all permutations of the factorization order and (2) overcomes the limitations of BERT thanks to its autoregressive formulation. Furthermore, XLNet integrates ideas from Transformer-XL, the state-of-the-art autoregressive model, into pretraining. Empirically, XLNet outperforms BERT on 20 tasks, often by a large margin, and achieves state-of-the-art results on 18 tasks including question answering, natural language inference, sentiment analysis, and document ranking.