QUICK REVIEW

[论文解读] Analysis of universal adversarial perturbations

Seyed-Mohsen Moosavi-Dezfooli, Alhussein Fawzi|arXiv (Cornell University)|May 26, 2017

Adversarial Robustness in Machine Learning被引用 91

一句话总结

本文首次对深度神经网络在通用对抗扰动下的鲁棒性进行了定量分析，将模型的脆弱性与决策边界的正曲率联系起来。论文证明，决策边界几何中共享的正曲率方向使得小而通用的扰动成为可能，并提出了一种基于几何的计算方法来生成这些扰动。

ABSTRACT

Deep networks have recently been shown to be vulnerable to universal perturbations: there exist very small image-agnostic perturbations that cause most natural images to be misclassified by such classifiers. In this paper, we propose the first quantitative analysis of the robustness of classifiers to universal perturbations, and draw a formal link between the robustness to universal perturbations, and the geometry of the decision boundary. Specifically, we establish theoretical bounds on the robustness of classifiers under two decision boundary models (flat and curved models). We show in particular that the robustness of deep networks to universal perturbations is driven by a key property of their curvature: there exists shared directions along which the decision boundary of deep networks is systematically positively curved. Under such conditions, we prove the existence of small universal perturbations. Our analysis further provides a novel geometric method for computing universal perturbations, in addition to explaining their properties.

研究动机与目标

对深度分类器在通用对抗扰动下的鲁棒性进行形式化分析。
建立通用扰动脆弱性与决策边界几何特性的理论联系。
将决策边界建模为平坦或弯曲形式，并在每种模型下推导鲁棒性边界。
识别曲率作为通用扰动存在的关键驱动因素。
基于决策边界曲率开发一种新型几何方法来计算通用扰动。

提出的方法

对两种决策边界模型（平坦与弯曲）下的鲁棒性进行理论分析。
基于决策边界的曲率特性，形式化推导鲁棒性的边界。
识别决策边界在其中表现出系统性正曲率的共享方向。
开发一种基于曲率信息的几何算法，用于计算通用扰动。
利用基于曲率的不变性，解释扰动在自然图像间泛化的原因。
数学证明：在共享方向上存在正曲率，意味着小规模通用扰动的存在。

实验结果

研究问题

RQ1决策边界的哪些几何特性使深度网络对通用扰动产生脆弱性？
RQ2决策边界的曲率如何影响通用扰动的存在性与大小？
RQ3能否基于决策边界几何特性，形式化界定分类器对通用扰动的鲁棒性？
RQ4共享的正曲率方向在促成通用扰动中起到什么作用？
RQ5能否基于曲率推导出一种几何方法来计算通用扰动？

主要发现

深度网络在其决策边界的共享方向上表现出系统性的正曲率，这使得小规模通用扰动成为可能。
理论边界表明，具有正曲率决策边界的分类器本质上对通用扰动具有脆弱性。
所提出的几何方法通过利用决策边界中的曲率信息，成功计算出通用扰动。
该分析解释了为何通用扰动能在多样化的自然图像间泛化，尽管其与图像无关。
正曲率被确定为驱动通用对抗样本形成的结构性关键属性。
本研究为理解与预测通用扰动脆弱性建立了正式的几何基础。

更好的研究，从现在开始

从论文设计到论文写作，大幅缩短您的研究时间。

无需绑定信用卡

本解读由 AI 生成，并经人工编辑审核。