成人不卡顿免费视频在线_日韩人妻少妇无码第一页_69永久免费人妻精品_国产交换精品一二三区_日本黄色片三级视频一区二区_女女同性AV片在线观看免费_青草久久久国产线免费动漫

Deep neural networks usually require large labeled datasets for training to achieve state-of-the-art performance in many tasks, such as image classification and natural language processing. Although a lot of data is created each day by active Internet users, most of these data are unlabeled and are vulnerable to data poisoning attacks. In this paper, we develop an efficient active learning method that requires fewer labeled instances and incorporates the technique of adversarial retraining in which additional labeled artificial data are generated without increasing the budget of the labeling. The generated adversarial examples also provide a way to measure the vulnerability of the model. To check the performance of the proposed method under an adversarial setting, i.e., malicious mislabeling and data poisoning attacks, we perform an extensive evaluation on the reduced CIFAR-10 dataset, which contains only two classes: airplane and frog. Our experimental results demonstrate that the proposed active learning method is efficient for defending against malicious mislabeling and data poisoning attacks. Specifically, whereas the baseline active learning method based on the random sampling strategy performs poorly (about 50%) under a malicious mislabeling attack, the proposed active learning method can achieve the desired accuracy of 89% using only one-third of the dataset on average.

相關內容

主動(dong)學習(xi)

關注 240

主(zhu)(zhu)(zhu)動(dong)(dong)學(xue)(xue)(xue)(xue)習(xi)(xi)(xi)(xi)(xi)是(shi)機器學(xue)(xue)(xue)(xue)習(xi)(xi)(xi)(xi)(xi)（更(geng)普遍的說是(shi)人工(gong)智(zhi)能）的一個(ge)子(zi)領(ling)域，在(zai)統(tong)計(ji)學(xue)(xue)(xue)(xue)領(ling)域也叫(jiao)查詢學(xue)(xue)(xue)(xue)習(xi)(xi)(xi)(xi)(xi)、最(zui)優實驗(yan)設計(ji)。“學(xue)(xue)(xue)(xue)習(xi)(xi)(xi)(xi)(xi)模(mo)塊”和(he)(he)“選擇策略”是(shi)主(zhu)(zhu)(zhu)動(dong)(dong)學(xue)(xue)(xue)(xue)習(xi)(xi)(xi)(xi)(xi)算法(fa)(fa)的2個(ge)基本且重要(yao)的模(mo)塊。主(zhu)(zhu)(zhu)動(dong)(dong)學(xue)(xue)(xue)(xue)習(xi)(xi)(xi)(xi)(xi)是(shi)“一種(zhong)(zhong)學(xue)(xue)(xue)(xue)習(xi)(xi)(xi)(xi)(xi)方(fang)法(fa)(fa)，在(zai)這(zhe)種(zhong)(zhong)方(fang)法(fa)(fa)中(zhong)，學(xue)(xue)(xue)(xue)生(sheng)會主(zhu)(zhu)(zhu)動(dong)(dong)或體驗(yan)性地(di)(di)參(can)與(yu)學(xue)(xue)(xue)(xue)習(xi)(xi)(xi)(xi)(xi)過程(cheng)，并且根據學(xue)(xue)(xue)(xue)生(sheng)的參(can)與(yu)程(cheng)度(du)(du)(du)，有(you)不同(tong)程(cheng)度(du)(du)(du)的主(zhu)(zhu)(zhu)動(dong)(dong)學(xue)(xue)(xue)(xue)習(xi)(xi)(xi)(xi)(xi)。” （Bonwell＆Eison 1991）Bonwell＆Eison（1991）指出：“學(xue)(xue)(xue)(xue)生(sheng)除了被動(dong)(dong)地(di)(di)聽(ting)課以外(wai)，還(huan)從事(shi)其他活動(dong)(dong)。” 在(zai)高等教育研(yan)究協會（ASHE）的一份報告中(zhong)，作(zuo)者討論了各種(zhong)(zhong)促進(jin)主(zhu)(zhu)(zhu)動(dong)(dong)學(xue)(xue)(xue)(xue)習(xi)(xi)(xi)(xi)(xi)的方(fang)法(fa)(fa)。他們引(yin)用了一些(xie)文獻(xian)，這(zhe)些(xie)文獻(xian)表明學(xue)(xue)(xue)(xue)生(sheng)不僅要(yao)做聽(ting)，還(huan)必(bi)須做更(geng)多的事(shi)情才能學(xue)(xue)(xue)(xue)習(xi)(xi)(xi)(xi)(xi)。他們必(bi)須閱讀(du)，寫作(zuo)，討論并參(can)與(yu)解(jie)決問題(ti)。此過程(cheng)涉及三(san)個(ge)學(xue)(xue)(xue)(xue)習(xi)(xi)(xi)(xi)(xi)領(ling)域，即知識，技(ji)能和(he)(he)態度(du)(du)(du)（KSA）。這(zhe)種(zhong)(zhong)學(xue)(xue)(xue)(xue)習(xi)(xi)(xi)(xi)(xi)行(xing)為分類(lei)法(fa)(fa)可以被認(ren)為是(shi)“學(xue)(xue)(xue)(xue)習(xi)(xi)(xi)(xi)(xi)過程(cheng)的目標”。特(te)別是(shi)，學(xue)(xue)(xue)(xue)生(sheng)必(bi)須從事(shi)諸如分析，綜(zong)合和(he)(he)評估之類(lei)的高級思維任(ren)務。

語音識別 · CRAFT · Performer · 自動語音識別 · MoDELS ·

2021 年 10 月 25 日

VenoMave: Targeted Poisoning Against Speech Recognition

Hojjat Aghakhani,Lea Sch?nherr,Thorsten Eisenhofer,Dorothea Kolossa,Thorsten Holz,Christopher Kruegel,Giovanni Vigna

The wide adoption of Automatic Speech Recognition (ASR) remarkably enhanced human-machine interaction. Prior research has demonstrated that modern ASR systems are susceptible to adversarial examples, i.e., malicious audio inputs that lead to misclassification by the victim's model at run time. The research question of whether ASR systems are also vulnerable to data-poisoning attacks is still unanswered. In such an attack, a manipulation happens during the training phase: an adversary injects malicious inputs into the training set to compromise the neural network's integrity and performance. Prior work in the image domain demonstrated several types of data-poisoning attacks, but these results cannot directly be applied to the audio domain. In this paper, we present the first data-poisoning attack against ASR, called VenoMave. We evaluate our attack on an ASR system that detects sequences of digits. When poisoning only 0.17% of the dataset on average, we achieve an attack success rate of 86.67%. To demonstrate the practical feasibility of our attack, we also evaluate if the target audio waveform can be played over the air via simulated room transmissions. In this more realistic threat model, VenoMave still maintains a success rate up to 73.33%. We further extend our evaluation to the Speech Commands corpus and demonstrate the scalability of VenoMave to a larger vocabulary. During a transcription test with human listeners, we verify that more than 85% of the original text of poisons can be correctly transcribed. We conclude that data-poisoning attacks against ASR represent a real threat, and we are able to perform poisoning for arbitrary target input files while the crafted poison samples remain inconspicuous.

可約的 · 估計/估計量 · 估計誤差 · CRAFT · AIM ·

2021 年 2 月 18 日

Data Poisoning Attacks and Defenses to Crowdsourcing Systems

Minghong Fang,Minghao Sun,Qi Li,Neil Zhenqiang Gong,Jin Tian,Jia Liu

from arxiv, To appear in the Web Conference 2021 (WWW '21)

A key challenge of big data analytics is how to collect a large volume of (labeled) data. Crowdsourcing aims to address this challenge via aggregating and estimating high-quality data (e.g., sentiment label for text) from pervasive clients/users. Existing studies on crowdsourcing focus on designing new methods to improve the aggregated data quality from unreliable/noisy clients. However, the security aspects of such crowdsourcing systems remain under-explored to date. We aim to bridge this gap in this work. Specifically, we show that crowdsourcing is vulnerable to data poisoning attacks, in which malicious clients provide carefully crafted data to corrupt the aggregated data. We formulate our proposed data poisoning attacks as an optimization problem that maximizes the error of the aggregated data. Our evaluation results on one synthetic and two real-world benchmark datasets demonstrate that the proposed attacks can substantially increase the estimation errors of the aggregated data. We also propose two defenses to reduce the impact of malicious clients. Our empirical results show that the proposed defenses can substantially reduce the estimation errors of the data poisoning attacks.

次最優 · ML · 極小點 · state-of-the-art · MoDELS ·

2020 年 12 月 10 日

Composite Adversarial Attacks

Xiaofeng Mao,Yuefeng Chen,Shuhui Wang,Hang Su,Yuan He,Hui Xue

from arxiv, To appear in AAAI 2021, code will be released later

Adversarial attack is a technique for deceiving Machine Learning (ML) models, which provides a way to evaluate the adversarial robustness. In practice, attack algorithms are artificially selected and tuned by human experts to break a ML system. However, manual selection of attackers tends to be sub-optimal, leading to a mistakenly assessment of model security. In this paper, a new procedure called Composite Adversarial Attack (CAA) is proposed for automatically searching the best combination of attack algorithms and their hyper-parameters from a candidate pool of \textbf{32 base attackers}. We design a search space where attack policy is represented as an attacking sequence, i.e., the output of the previous attacker is used as the initialization input for successors. Multi-objective NSGA-II genetic algorithm is adopted for finding the strongest attack policy with minimum complexity. The experimental result shows CAA beats 10 top attackers on 11 diverse defenses with less elapsed time (\textbf{6 $\times$ faster than AutoAttack}), and achieves the new state-of-the-art on $l_{\infty}$, $l_{2}$ and unrestricted adversarial attacks.

穩健性 · MoDELS · Continuity · Taxonomy · 聯邦學習 ·

2020 年 12 月 7 日

Privacy and Robustness in Federated Learning: Attacks and Defenses

Lingjuan Lyu,Han Yu,Xingjun Ma,Lichao Sun,Jun Zhao,Qiang Yang,Philip S. Yu

from arxiv, arXiv admin note: text overlap with arXiv:2003.02133; text overlap with arXiv:1911.11815 by other authors

As data are increasingly being stored in different silos and societies becoming more aware of data privacy issues, the traditional centralized training of artificial intelligence (AI) models is facing efficiency and privacy challenges. Recently, federated learning (FL) has emerged as an alternative solution and continue to thrive in this new reality. Existing FL protocol design has been shown to be vulnerable to adversaries within or outside of the system, compromising data privacy and system robustness. Besides training powerful global models, it is of paramount importance to design FL systems that have privacy guarantees and are resistant to different types of adversaries. In this paper, we conduct the first comprehensive survey on this topic. Through a concise introduction to the concept of FL, and a unique taxonomy covering: 1) threat models; 2) poisoning attacks and defenses against robustness; 3) inference attacks and defenses against privacy, we provide an accessible review of this important topic. We highlight the intuitions, key techniques as well as fundamental assumptions adopted by various attacks and defenses. Finally, we discuss promising future research directions towards robust and privacy-preserving federated learning.

Weight · MoDELS · surge · Ripple · 情感分類 ·

2020 年 4 月 14 日

Weight Poisoning Attacks on Pre-trained Models

Keita Kurita,Paul Michel,Graham Neubig

from arxiv, Published as a long paper at ACL 2020

Recently, NLP has seen a surge in the usage of large pre-trained models. Users download weights of models pre-trained on large datasets, then fine-tune the weights on a task of their choice. This raises the question of whether downloading untrusted pre-trained weights can pose a security threat. In this paper, we show that it is possible to construct ``weight poisoning'' attacks where pre-trained weights are injected with vulnerabilities that expose ``backdoors'' after fine-tuning, enabling the attacker to manipulate the model prediction simply by injecting an arbitrary keyword. We show that by applying a regularization method, which we call RIPPLe, and an initialization procedure, which we call Embedding Surgery, such attacks are possible even with limited knowledge of the dataset and fine-tuning procedure. Our experiments on sentiment classification, toxicity detection, and spam detection show that this attack is widely applicable and poses a serious threat. Finally, we outline practical defenses against such attacks. Code to reproduce our experiments is available at //github.com/neulab/RIPPLe.

Performer · Networking · Capsule · state-of-the-art · 類別 ·

2020 年 2 月 18 日

Deflecting Adversarial Attacks

Yao Qin,Nicholas Frosst,Colin Raffel,Garrison Cottrell,Geoffrey Hinton

There has been an ongoing cycle where stronger defenses against adversarial attacks are subsequently broken by a more advanced defense-aware attack. We present a new approach towards ending this cycle where we "deflect'' adversarial attacks by causing the attacker to produce an input that semantically resembles the attack's target class. To this end, we first propose a stronger defense based on Capsule Networks that combines three detection mechanisms to achieve state-of-the-art detection performance on both standard and defense-aware attacks. We then show that undetected attacks against our defense often perceptually resemble the adversarial target class by performing a human study where participants are asked to label images produced by the attack. These attack images can no longer be called "adversarial'' because our network classifies them the same way as humans do.

MoDELS · 聯邦學習 · 學成 · CASES · 穩健性 ·

2019 年 11 月 26 日

Local Model Poisoning Attacks to Byzantine-Robust Federated Learning

Minghong Fang,Xiaoyu Cao,Jinyuan Jia,Neil Zhenqiang Gong

from arxiv, The paper was submitted to Usenix Security Symposium in February 2019 and will appear in Usenix Security Symposium 2020

In federated learning, multiple client devices jointly learn a machine learning model: each client device maintains a local model for its local training dataset, while a master device maintains a global model via aggregating the local models from the client devices. The machine learning community recently proposed several federated learning methods that were claimed to be robust against Byzantine failures (e.g., system failures, adversarial manipulations) of certain client devices. In this work, we perform the first systematic study on local model poisoning attacks to federated learning. We assume an attacker has compromised some client devices, and the attacker manipulates the local model parameters on the compromised client devices during the learning process such that the global model has a large testing error rate. We formulate our attacks as optimization problems and apply our attacks to four recent Byzantine-robust federated learning methods. Our empirical results on four real-world datasets show that our attacks can substantially increase the error rates of the models learnt by the federated learning methods that were claimed to be robust against Byzantine failures of some client devices. We generalize two defenses for data poisoning attacks to defend against our local model poisoning attacks. Our evaluation results show that one defense can effectively defend against our attacks in some cases, but the defenses are not effective enough in other cases, highlighting the need for new defenses against our local model poisoning attacks to federated learning.

穩健性 · 判別器 · MoDELS · 對數似然 · 樸素貝葉斯 ·

2018 年 7 月 9 日

Are Generative Classifiers More Robust to Adversarial Attacks?

Yingzhen Li,John Bradshaw,Yash Sharma

from arxiv, Presented as an oral talk at the ICML 2018 workshop on Theoretical Foundations and Applications of Deep Generative Models

There is a rising interest in studying the robustness of deep neural network classifiers against adversaries, with both advanced attack and defence techniques being actively developed. However, most recent work focuses on discriminative classifiers, which only model the conditional distribution of the labels given the inputs. In this paper we propose the deep Bayes classifier, which improves classical naive Bayes with conditional deep generative models. We further develop detection methods for adversarial examples, which reject inputs that have negative log-likelihood under the generative model exceeding a threshold pre-specified using training data. Experimental results suggest that deep Bayes classifiers are more robust than deep discriminative classifiers, and the proposed detection methods achieve high detection rates against many recently proposed attacks.

Networking · 學成 · CARS · Integration · DNN ·

2018 年 7 月 5 日

Sequential Attacks on Agents for Long-Term Adversarial Goals

Edgar Tretschk,Seong Joon Oh,Mario Fritz

Reinforcement learning (RL) has advanced greatly in the past few years with the employment of effective deep neural networks (DNNs) on the policy networks. With the great effectiveness came serious vulnerability issues with DNNs that small adversarial perturbations on the input can change the output of the network. Several works have pointed out that learned agents with a DNN policy network can be manipulated against achieving the original task through a sequence of small perturbations on the input states. In this paper, we demonstrate furthermore that it is also possible to impose an arbitrary adversarial reward on the victim policy network through a sequence of attacks. Our method involves the latest adversarial attack technique, Adversarial Transformer Network (ATN), that learns to generate the attack and is easy to integrate into the policy network. As a result of our attack, the victim agent is misguided to optimise for the adversarial reward over time. Our results expose serious security threats for RL applications in safety-critical systems including drones, medical analysis, and self-driving cars.

樣例 · 相似度 · 語音識別 · 端到端 · 轉錄 ·

2018 年 1 月 5 日

Audio Adversarial Examples: Targeted Attacks on Speech-to-Text

Nicholas Carlini,David Wagner

We construct targeted audio adversarial examples on automatic speech recognition. Given any audio waveform, we can produce another that is over 99.9% similar, but transcribes as any phrase we choose (at a rate of up to 50 characters per second). We apply our iterative optimization-based attack to Mozilla's implementation DeepSpeech end-to-end, and show it has a 100% success rate. The feasibility of this attack introduce a new domain to study adversarial examples.