QUICK REVIEW

[论文解读] Transferable Clean-Label Poisoning Attacks on Deep Neural Nets

Chen Zhu, Wei Huang|arXiv (Cornell University)|May 15, 2019

Adversarial Robustness in Machine Learning参考文献 30被引用 136

一句话总结

本文提出一种可转移的 clean-label 投毒方法，Convex Polytope Attack，在特征空间围绕目标以诱导错分类，在污染约 1% 的训练数据的同时实现超过 50% 的成功率。

ABSTRACT

Clean-label poisoning attacks inject innocuous looking (and "correctly" labeled) poison images into training data, causing a model to misclassify a targeted image after being trained on this data. We consider transferable poisoning attacks that succeed without access to the victim network's outputs, architecture, or (in some cases) training data. To achieve this, we propose a new "polytope attack" in which poison images are designed to surround the targeted image in feature space. We also demonstrate that using Dropout during poison creation helps to enhance transferability of this attack. We achieve transferable attack success rates of over 50% while poisoning only 1% of the training set.

研究动机与目标

Highlight the security risk of clean-label data poisoning when data is scraped from the web.
Develop a model-agnostic poisoning strategy that transfers to unknown victim networks without access to outputs or architecture.
Improve attack transferability via a convex polytope in feature space.
Enhance transferability by using Dropout to simulate an ensemble of substitute models.
Explore the effectiveness of multi-layer and end-to-end training scenarios for the attack.

提出的方法

Define a threat model with no access to the victim's outputs or parameters; assume the attacker can train substitute models on a similar distribution.
Propose Convex Polytope Attack that enforces the target's feature vector to lie inside the convex hull of poison features across substitute models.
Formulate an optimization that minimizes the distance between the target features and its convex combination of poison features, with constraints on perturbation size.
Solve the non-convex problem via an alternating method using forward-backward splitting for coefficients and a gradient step for poison images.
Enhance transferability by (a) applying Dropout during poison crafting to simulate an ensemble, and (b) enforcing the polytope objective across multiple network layers.]
research_questions:[]
key_findings:[]
table_headers:[]
table_rows:[]}]}』{
title

更好的研究，从现在开始

从论文设计到论文写作，大幅缩短您的研究时间。

无需绑定信用卡

本解读由 AI 生成，并经人工编辑审核。

[论文解读] Transferable Clean-Label Poisoning Attacks on Deep Neural Nets

研究动机与目标

提出的方法

相关论文

更好的研究，从现在开始