Skip to main content
QUICK REVIEW

[论文解读] Recent Advances in Adversarial Training for Adversarial Robustness

Tao Bai, Jinqi Luo|arXiv (Cornell University)|Feb 2, 2021
Adversarial Robustness in Machine Learning参考文献 79被引用 43
一句话总结

本综述回顾最近在对抗性训练(AT)方面的进展,提出一种新颖的分类法,讨论泛化挑战,并勾画未来方向。

ABSTRACT

Adversarial training is one of the most effective approaches defending against adversarial examples for deep learning models. Unlike other defense strategies, adversarial training aims to promote the robustness of models intrinsically. During the last few years, adversarial training has been studied and discussed from various aspects. A variety of improvements and developments of adversarial training are proposed, which were, however, neglected in existing surveys. For the first time in this survey, we systematically review the recent progress on adversarial training for adversarial robustness with a novel taxonomy. Then we discuss the generalization problems in adversarial training from three perspectives. Finally, we highlight the challenges which are not fully tackled and present potential future directions.

研究动机与目标

  • 提供关于对抗性训练方法及其提升鲁棒性的最新概览。
  • 介绍一种新颖的 AT 方法分类法,并将其与鲁棒性提升联系起来。
  • 讨论 AT 的泛化缺口,明确挑战与未来研究方向。

提出的方法

  • 评述并将近期的 AT 方法归入一个结构化的分类法中(对抗性正则化、课程学习、集成、自适应 epsilon、半监督/无监督、高效训练及其他变体)。
  • 总结来自 Table 1 的实验结果,以比较不同方法和数据集上的鲁棒性-性能权衡。
  • 讨论在标准准确率、对抗鲁棒性以及未知攻击上的泛化问题。
  • 强调最小-最大优化及泛化方面的理论与实践挑战,并提出超越 AT 的方向。

实验结果

研究问题

  • RQ1对抗性训练方法的关键家族有哪些?它们在公式化和目标上有何不同?
  • RQ2近期的 AT 方法在不同数据集和攻击上的表现如何?仍存在哪些泛化差距?
  • RQ3当前 AT 方法的主要局限性(例如,最小-最大优化、过拟合、未知攻击)及潜在的超越 AT 的方向有哪些?

主要发现

出版物模型架构攻击ε数据集准确度
Adversarial RegularizationQin et al. (2019)ResNet-152PGD 504/255ImageNet47.00%
Zhang et al. (2019b)Wide ResNetCW 100.031/1CIFAR-1084.03%
Wang et al. (2020)ResNet-18PGD 208/255CIFAR-1055.45%
Kannan et al. (2018)InceptionV3PGD 1016/255ImageNet27.90%
Mao et al. (2019)Wide ResNetPGD 208/255CIFAR-1050.03%
Zhang et al. (2020)Wide ResNetPGD 2016/255CIFAR-1049.86%
Cai et al. (2018)DenseNet-161PGD 78/255CIFAR-1069.27%
Wang et al. (2019)8-Layer ConvNetPGD 208/255CIFAR-1042.40%
Pang et al. (2019)Wide ResNetPGD 100.005CIFAR-10032.10%
Kariyappa and Qureshi (2019)ResNet-20PGD 300.09/1CIFAR-1046.30%
Yang et al. (2020a)ResNet-20PGD 200.01/1CIFAR-1052.4%
Balaji et al. (2019)ResNet-152PGD 10008/255ImageNet59.28%
Ding et al. (2020)Wide ResNetPGD 1008/255CIFAR-1047.18%
Cheng et al. (2020)Wide ResNetPGD 208/255CIFAR-1073.38%
Alayrac et al. (2019)Wide ResNetFGSM8/255CIFAR-1062.18%
Carmon et al. (2019)Wide ResNetPGD 108/255CIFAR-1063.10%
Zhai et al. (2019)Customized ResNetPGD 78/255CIFAR-1042.48%
Hendrycks et al. (2019)Wide ResNetPGD 200.3/1ImageNet50.40%
Shafahi et al. (2019)Wide ResNetPGD 1008/255CIFAR-1046.19%
Wong et al. (2020)ResNet-50PGD 402/255ImageNet43.43%
Andriushchenko and Flammarion (2020)ResNet-50PGD 502/255ImageNet41.40%
Kim et al. (2021)PreActResNet-18FGSM8/255CIFAR-1050.50%
Vivek and Babu (2020b)Wide ResNetPGD 408/255MNIST88.51%
Song et al. (2019)Customized ConvNetPGD 204/255CIFAR-1058.10%
Vivek and Babu (2020a)Wide ResNetPGD 1000.3/1MNIST90.03%
Huang et al. (2020)Wide ResNetPGD 208/255CIFAR-1045.80%
Zhang et al. (2019a)Wide ResNetPGD 208/255CIFAR-1047.98%
Dong et al. (2020)Wide ResNetPGD 208/255CIFAR-10029.40%
Wang and Zhang (2019)Wide ResNetCW 2004/255CIFAR-1060.30%
Zhang and Wang (2019)Wide ResNetPGD 208/255CIFAR-10047.20%
Pang et al. (2020b)Wide ResNetPGD 5008/255CIFAR-1060.75%
Lee et al. (2020)PreActResNet-18PGD 208/255Tiny ImageNet20.31%
Zhang and Xu (2020)Wide ResNetPGD 208/255CIFAR-1045.11%
Madry et al. (2018)ResNet-50PGD 208/255CIFAR-1045.80%
Wang and Zhang (2019)Wide ResNetCW 2004/255CIFAR-1060.30%
Zhang and Xu (2020)Wide ResNetPGD 208/255CIFAR-1045.11%
Pang et al. (2020a)Wide ResNetPGD 5008/255CIFAR-1060.75%
  • 对抗性训练仍是最有效的防御,但在对抗性评估下的准确性在许多数据集上仍显著低于干净准确率。
  • 存在大量的 AT 方法(正则化、课程学习、集成、自适应 epsilon、半监督/无监督、高效训练),在鲁棒性与标准准确性之间有不同的权衡。
  • 泛化差距(对抗性鲁棒泛化和对未知攻击的泛化)依然存在,尚未被当前的 AT 技术完整解决。
  • 当前做法常依赖基于 PGD 的内部优化,这并不提供正式的鲁棒性证明,且计算成本可能很高。
  • 半监督/无监督数据可以缩小样本复杂性差距并提升鲁棒性,尽管保证仍然有限。
  • 加速 AT 的努力(如 Free-AT、FAST-AT、YOPO)有助于降低训练时间,但若未加以缓解,可能引入灾难性过拟合等问题。

更好的研究,从现在开始

从论文设计到论文写作,大幅缩短您的研究时间。

无需绑定信用卡

本解读由 AI 生成,并经人工编辑审核。