精品夜色国产国偷自产乱码_91资源电影网站_99久久亚洲综合网精品男人_免费一级毛片在线精品_午夜免费视频完整在线看_好吊色国产欧美日韩在线红豆视频_黄色AV网站在线观看

Anomaly detection is the task of identifying abnormal behavior of a system. Anomaly detection in computational workflows is of special interest because of its wide implications in various domains such as cybersecurity, finance, and social networks. However, anomaly detection in computational workflows~(often modeled as graphs) is a relatively unexplored problem and poses distinct challenges. For instance, when anomaly detection is performed on graph data, the complex interdependency of nodes and edges, the heterogeneity of node attributes, and edge types must be accounted for. Although the use of graph neural networks can help capture complex inter-dependencies, the scarcity of labeled anomalous examples from workflow executions is still a significant challenge. To address this problem, we introduce an autoencoder-driven self-supervised learning~(SSL) approach that learns a summary statistic from unlabeled workflow data and estimates the normal behavior of the computational workflow in the latent space. In this approach, we combine generative and contrastive learning objectives to detect outliers in the summary statistics. We demonstrate that by estimating the distribution of normal behavior in the latent space, we can outperform state-of-the-art anomaly detection methods on our benchmark datasets.

相關內容

異常(chang)檢測(ce)

關注 102

在(zai)數(shu)據(ju)(ju)挖(wa)掘(jue)中，異(yi)(yi)常(chang)檢(jian)(jian)測(ce)(ce)(ce)(ce)（英語：anomaly detection）對不符合(he)預期模式(shi)(shi)或(huo)數(shu)據(ju)(ju)集中其他項目的(de)(de)(de)項目、事件或(huo)觀測(ce)(ce)(ce)(ce)值的(de)(de)(de)識別。通(tong)常(chang)異(yi)(yi)常(chang)項目會轉變成(cheng)銀行(xing)欺詐、結構缺(que)陷、醫療(liao)問(wen)題、文本錯誤等類(lei)(lei)型的(de)(de)(de)問(wen)題。異(yi)(yi)常(chang)也(ye)被稱為離群值、新奇、噪聲、偏(pian)差和例(li)外。特別是(shi)(shi)在(zai)檢(jian)(jian)測(ce)(ce)(ce)(ce)濫用(yong)與網絡入侵時，有趣性對象往(wang)往(wang)不是(shi)(shi)罕(han)見(jian)對象，但(dan)卻是(shi)(shi)超(chao)出預料的(de)(de)(de)突(tu)發活動。這種模式(shi)(shi)不遵(zun)循通(tong)常(chang)統(tong)計定(ding)義中把異(yi)(yi)常(chang)點看(kan)作是(shi)(shi)罕(han)見(jian)對象，于是(shi)(shi)許(xu)多(duo)異(yi)(yi)常(chang)檢(jian)(jian)測(ce)(ce)(ce)(ce)方法(fa)（特別是(shi)(shi)無監(jian)督(du)的(de)(de)(de)方法(fa)）將(jiang)對此(ci)類(lei)(lei)數(shu)據(ju)(ju)失效，除非進行(xing)了合(he)適的(de)(de)(de)聚(ju)集。相反(fan)，聚(ju)類(lei)(lei)分(fen)析算法(fa)可(ke)能可(ke)以檢(jian)(jian)測(ce)(ce)(ce)(ce)出這些模式(shi)(shi)形(xing)成(cheng)的(de)(de)(de)微聚(ju)類(lei)(lei)。有三大(da)類(lei)(lei)異(yi)(yi)常(chang)檢(jian)(jian)測(ce)(ce)(ce)(ce)方法(fa)。[1] 在(zai)假設數(shu)據(ju)(ju)集中大(da)多(duo)數(shu)實(shi)例(li)都是(shi)(shi)正(zheng)(zheng)常(chang)的(de)(de)(de)前提下，無監(jian)督(du)異(yi)(yi)常(chang)檢(jian)(jian)測(ce)(ce)(ce)(ce)方法(fa)能通(tong)過(guo)尋找(zhao)與其他數(shu)據(ju)(ju)最(zui)不匹配的(de)(de)(de)實(shi)例(li)來檢(jian)(jian)測(ce)(ce)(ce)(ce)出未標記測(ce)(ce)(ce)(ce)試(shi)(shi)數(shu)據(ju)(ju)的(de)(de)(de)異(yi)(yi)常(chang)。監(jian)督(du)式(shi)(shi)異(yi)(yi)常(chang)檢(jian)(jian)測(ce)(ce)(ce)(ce)方法(fa)需(xu)要一個已經被標記“正(zheng)(zheng)常(chang)”與“異(yi)(yi)常(chang)”的(de)(de)(de)數(shu)據(ju)(ju)集，并涉(she)及(ji)到訓練(lian)分(fen)類(lei)(lei)器（與許(xu)多(duo)其他的(de)(de)(de)統(tong)計分(fen)類(lei)(lei)問(wen)題的(de)(de)(de)關(guan)鍵(jian)區別是(shi)(shi)異(yi)(yi)常(chang)檢(jian)(jian)測(ce)(ce)(ce)(ce)的(de)(de)(de)內在(zai)不均衡(heng)性）。半監(jian)督(du)式(shi)(shi)異(yi)(yi)常(chang)檢(jian)(jian)測(ce)(ce)(ce)(ce)方法(fa)根據(ju)(ju)一個給定(ding)的(de)(de)(de)正(zheng)(zheng)常(chang)訓練(lian)數(shu)據(ju)(ju)集創建一個表示正(zheng)(zheng)常(chang)行(xing)為的(de)(de)(de)模型，然(ran)后檢(jian)(jian)測(ce)(ce)(ce)(ce)由學習模型生成(cheng)的(de)(de)(de)測(ce)(ce)(ce)(ce)試(shi)(shi)實(shi)例(li)的(de)(de)(de)可(ke)能性。

文本分類 · Prompt · tuning · Continuity · Networking ·

2023 年 11 月 17 日

Prompt Tuning on Graph-augmented Low-resource Text Classification

Zhihao Wen,Yuan Fang

from arxiv, 26 pages, journal under review. arXiv admin note: substantial text overlap with arXiv:2305.03324

Text classification is a fundamental problem in information retrieval with many real-world applications, such as predicting the topics of online articles and the categories of e-commerce product descriptions. However, low-resource text classification, with no or few labeled samples, presents a serious concern for supervised learning. Meanwhile, many text data are inherently grounded on a network structure, such as a hyperlink/citation network for online articles, and a user-item purchase network for e-commerce products. These graph structures capture rich semantic relationships, which can potentially augment low-resource text classification. In this paper, we propose a novel model called Graph-Grounded Pre-training and Prompting (G2P2) to address low-resource text classification in a two-pronged approach. During pre-training, we propose three graph interaction-based contrastive strategies to jointly pre-train a graph-text model; during downstream classification, we explore handcrafted discrete prompts and continuous prompt tuning for the jointly pre-trained model to achieve zero- and few-shot classification, respectively. Besides, for generalizing continuous prompts to unseen classes, we propose conditional prompt tuning on graphs (G2P2$^*$). Extensive experiments on four real-world datasets demonstrate the strength of G2P2 in zero- and few-shot low-resource text classification tasks, and illustrate the advantage of G2P2$^*$ in dealing with unseen classes.

MoDELS · 數學 · 路徑 · Performer · state-of-the-art ·

2023 年 11 月 16 日

Outcome-supervised Verifiers for Planning in Mathematical Reasoning

Fei Yu,Anningzhe Gao,Benyou Wang

from arxiv, //github.com/FreedomIntelligence/OVM

Large language models (LLMs) often struggle with maintaining accuracy across a sequence of intermediate reasoning steps in mathematical reasoning, leading to error propagation that undermines the final result. The current methodology to mitigate this issue primarily involves using a verifier model to assess the correctness of generated solution candidates, focusing either on the overall reasoning path or on an incomplete reasoning path. By rethinking this approach, we argue that assessing potentials of incomplete reasoning paths could be more advantageous as it guides towards correct final answers, transforming the task into a \textit{planning} problem. Our proposed verifier, the Outcome-supervision Value Model (OVM), employs outcome supervision for training, offering an efficient and intuitive method for \textit{planning} by prioritizing steps that lead to accurate conclusions over mere per-step correctness. Furthermore, the OVM eschews the need for labor-intensive annotations on step-level correctness, enhancing its scalability. Our experiments on two multi-step mathematical reasoning datasets, GSM8K and Game of 24, demonstrate the superior performance of the OVM model. Notably, in GSM8K, our \textbf{OVM-7B model achieves state-of-the-art results among LLMs up to 13B parameters}; especially it does not utilize GPT-4 or code execution. These findings offer a novel perspective on the role of outcome supervision in training verifiers for multi-step reasoning tasks and provide theoretical justification for its advantage in value estimation for planning.

操作 · 模型評估 · 近似 · 優化器 · MoDELS ·

2023 年 11 月 16 日

Residual-Based Error Corrector Operator to Enhance Accuracy and Reliability of Neural Operator Surrogates of Nonlinear Variational Boundary-Value Problems

Prashant K. Jha

from arxiv, 36 pages, 14 figures, 3 tables

This work focuses on developing methods for approximating the solution operators of a class of parametric partial differential equations via neural operators. Neural operators have several challenges, including the issue of generating appropriate training data, cost-accuracy trade-offs, and nontrivial hyperparameter tuning. The unpredictability of the accuracy of neural operators impacts their applications in downstream problems of inference, optimization, and control. A framework based on the linear variational problem that gives the correction to the prediction furnished by neural operators is considered based on earlier work in JCP 486 (2023) 112104. The operator, called Residual-based Error Corrector Operator or simply Corrector Operator, associated with the corrector problem is analyzed further. Numerical results involving a nonlinear reaction-diffusion model in two dimensions with PCANet-type neural operators show almost two orders of increase in the accuracy of approximations when neural operators are corrected using the correction scheme. Further, topology optimization involving a nonlinear reaction-diffusion model is considered to highlight the limitations of neural operators and the efficacy of the correction scheme. Optimizers with neural operator surrogates are seen to make significant errors (as high as 80 percent). However, the errors are much lower (below 7 percent) when neural operators are corrected.

儲層計算 · Processing（編程語言） · PULSE · 模型評估 · 寬度 ·

2023 年 11 月 15 日

Biomembrane-based Memcapacitive Reservoir Computing System for Energy Efficient Temporal Data Processing

Md Razuan Hossain,Ahmed Salah Mohamed,Nicholas Xavier Armendarez,Joseph S. Najem,Md Sakib Hasan

from arxiv, Supplementary information is attached under the main text

Reservoir computing is a highly efficient machine learning framework for processing temporal data by extracting features from the input signal and mapping them into higher dimensional spaces. Physical reservoir layers have been realized using spintronic oscillators, atomic switch networks, silicon photonic modules, ferroelectric transistors, and volatile memristors. However, these devices are intrinsically energy-dissipative due to their resistive nature, which leads to increased power consumption. Therefore, capacitive memory devices can provide a more energy-efficient approach. Here, we leverage volatile biomembrane-based memcapacitors that closely mimic certain short-term synaptic plasticity functions as reservoirs to solve classification tasks and analyze time-series data in simulation and experimentally. Our system achieves a 99.6% accuracy rate for spoken digit classification and a normalized mean square error of 7.81*10^{-4} in a second-order non-linear regression task. Furthermore, to showcase the device's real-time temporal data processing capability, we achieve 100% accuracy for a real-time epilepsy detection problem from an inputted electroencephalography (EEG) signal. Most importantly, we demonstrate that each memcapacitor consumes an average of 41.5 fJ of energy per spike, regardless of the selected input voltage pulse width, while maintaining an average power of 415 fW for a pulse width of 100 ms. These values are orders of magnitude lower than those achieved by state-of-the-art memristors used as reservoirs. Lastly, we believe the biocompatible, soft nature of our memcapacitor makes it highly suitable for computing and signal-processing applications in biological environments.

優化器 · 矩 · 估計/估計量 · SimPLe · 泛函 ·

2023 年 11 月 15 日

Resetting the Optimizer in Deep RL: An Empirical Study

Kavosh Asadi,Rasool Fakoor,Shoham Sabach

from arxiv, Accepted at Thirty-seventh Conference on Neural Information Processing Systems (NeurIPS 2023)

We focus on the task of approximating the optimal value function in deep reinforcement learning. This iterative process is comprised of solving a sequence of optimization problems where the loss function changes per iteration. The common approach to solving this sequence of problems is to employ modern variants of the stochastic gradient descent algorithm such as Adam. These optimizers maintain their own internal parameters such as estimates of the first-order and the second-order moments of the gradient, and update them over time. Therefore, information obtained in previous iterations is used to solve the optimization problem in the current iteration. We demonstrate that this can contaminate the moment estimates because the optimization landscape can change arbitrarily from one iteration to the next one. To hedge against this negative effect, a simple idea is to reset the internal parameters of the optimizer when starting a new iteration. We empirically investigate this resetting idea by employing various optimizers in conjunction with the Rainbow algorithm. We demonstrate that this simple modification significantly improves the performance of deep RL on the Atari benchmark.

Next · Integration · 有向 · 控制器 · Continuity ·

2022 年 3 月 5 日

AI for Next Generation Computing: Emerging Trends and Future Directions

Sukhpal Singh Gill,Minxian Xu,Carlo Ottaviani,Panos Patros,Rami Bahsoon,Arash Shaghaghi,Muhammed Golec,Vlado Stankovski,Huaming Wu,Ajith Abraham,Manmeet Singh,Harshit Mehta,Soumya K. Ghosh,Thar Baker,Ajith Kumar Parlikad,Hanan Lutfiyya,Salil S. Kanhere,Rizos Sakellariou,Schahram Dustdar,Omer Rana,Ivona Brandic,Steve Uhlig

from arxiv, Accepted for Publication in Elsevier IoT Journal, 2022

Autonomic computing investigates how systems can achieve (user) specified control outcomes on their own, without the intervention of a human operator. Autonomic computing fundamentals have been substantially influenced by those of control theory for closed and open-loop systems. In practice, complex systems may exhibit a number of concurrent and inter-dependent control loops. Despite research into autonomic models for managing computer resources, ranging from individual resources (e.g., web servers) to a resource ensemble (e.g., multiple resources within a data center), research into integrating Artificial Intelligence (AI) and Machine Learning (ML) to improve resource autonomy and performance at scale continues to be a fundamental challenge. The integration of AI/ML to achieve such autonomic and self-management of systems can be achieved at different levels of granularity, from full to human-in-the-loop automation. In this article, leading academics, researchers, practitioners, engineers, and scientists in the fields of cloud computing, AI/ML, and quantum computing join to discuss current research and potential future directions for these fields. Further, we discuss challenges and opportunities for leveraging AI and ML in next generation computing for emerging computing paradigms, including cloud, fog, edge, serverless and quantum computing environments.

示例 · 端到端 · 變換 · MoDELS · 可理解性 ·

2021 年 3 月 24 日

End-to-End Video Instance Segmentation with Transformers

Yuqing Wang,Zhaoliang Xu,Xinlong Wang,Chunhua Shen,Baoshan Cheng,Hao Shen,Huaxia Xia

from arxiv, CVPR2021 Oral

Video instance segmentation (VIS) is the task that requires simultaneously classifying, segmenting and tracking object instances of interest in video. Recent methods typically develop sophisticated pipelines to tackle this task. Here, we propose a new video instance segmentation framework built upon Transformers, termed VisTR, which views the VIS task as a direct end-to-end parallel sequence decoding/prediction problem. Given a video clip consisting of multiple image frames as input, VisTR outputs the sequence of masks for each instance in the video in order directly. At the core is a new, effective instance sequence matching and segmentation strategy, which supervises and segments instances at the sequence level as a whole. VisTR frames the instance segmentation and tracking in the same perspective of similarity learning, thus considerably simplifying the overall pipeline and is significantly different from existing approaches. Without bells and whistles, VisTR achieves the highest speed among all existing VIS models, and achieves the best result among methods using single model on the YouTube-VIS dataset. For the first time, we demonstrate a much simpler and faster video instance segmentation framework built upon Transformers, achieving competitive accuracy. We hope that VisTR can motivate future research for more video understanding tasks.

對象識別 · MoDELS · Backbone · Extensibility · 學成 ·

2020 年 3 月 31 日

Look-into-Object: Self-supervised Structure Modeling for Object Recognition

Mohan Zhou,Yalong Bai,Wei Zhang,Tiejun Zhao,Tao Mei

from arxiv, 10 pages, 7 figures, accepted by CVPR 2020

Most object recognition approaches predominantly focus on learning discriminative visual patterns while overlooking the holistic object structure. Though important, structure modeling usually requires significant manual annotations and therefore is labor-intensive. In this paper, we propose to "look into object" (explicitly yet intrinsically model the object structure) through incorporating self-supervisions into the traditional framework. We show the recognition backbone can be substantially enhanced for more robust representation learning, without any cost of extra annotation and inference speed. Specifically, we first propose an object-extent learning module for localizing the object according to the visual patterns shared among the instances in the same category. We then design a spatial context learning module for modeling the internal structures of the object, through predicting the relative positions within the extent. These two modules can be easily plugged into any backbone networks during training and detached at inference time. Extensive experiments show that our look-into-object approach (LIO) achieves large performance gain on a number of benchmarks, including generic object recognition (ImageNet) and fine-grained object recognition tasks (CUB, Cars, Aircraft). We also show that this learning paradigm is highly generalizable to other tasks such as object detection and segmentation (MS COCO). Project page: //github.com/JDAI-CV/LIO.

Faster R-CNN · domain shift · R-CNN · 目標檢測 · 可約的 ·

2018 年 3 月 8 日

Domain Adaptive Faster R-CNN for Object Detection in the Wild

Yuhua Chen,Wen Li,Christos Sakaridis,Dengxin Dai,Luc Van Gool

from arxiv, Accepted to CVPR 2018

Object detection typically assumes that training and test data are drawn from an identical distribution, which, however, does not always hold in practice. Such a distribution mismatch will lead to a significant performance drop. In this work, we aim to improve the cross-domain robustness of object detection. We tackle the domain shift on two levels: 1) the image-level shift, such as image style, illumination, etc, and 2) the instance-level shift, such as object appearance, size, etc. We build our approach based on the recent state-of-the-art Faster R-CNN model, and design two domain adaptation components, on image level and instance level, to reduce the domain discrepancy. The two domain adaptation components are based on H-divergence theory, and are implemented by learning a domain classifier in adversarial training manner. The domain classifiers on different levels are further reinforced with a consistency regularization to learn a domain-invariant region proposal network (RPN) in the Faster R-CNN model. We evaluate our newly proposed approach using multiple datasets including Cityscapes, KITTI, SIM10K, etc. The results demonstrate the effectiveness of our proposed approach for robust object detection in various domain shift scenarios.

平滑 · 注意力機制 · 反向傳播 · 維特比算法 · 正則化項 ·

2018 年 2 月 20 日

Differentiable Dynamic Programming for Structured Prediction and Attention

Arthur Mensch,Mathieu Blondel

Dynamic programming (DP) solves a variety of structured combinatorial problems by iteratively breaking them down into smaller subproblems. In spite of their versatility, DP algorithms are usually non-differentiable, which hampers their use as a layer in neural networks trained by backpropagation. To address this issue, we propose to smooth the max operator in the dynamic programming recursion, using a strongly convex regularizer. This allows to relax both the optimal value and solution of the original combinatorial problem, and turns a broad class of DP algorithms into differentiable operators. Theoretically, we provide a new probabilistic perspective on backpropagating through these DP operators, and relate them to inference in graphical models. We derive two particular instantiations of our framework, a smoothed Viterbi algorithm for sequence prediction and a smoothed DTW algorithm for time-series alignment. We showcase these instantiations on two structured prediction tasks and on structured and sparse attention for neural machine translation.