QUICK REVIEW

[论文解读] Boundary thickness and robustness in learning models

Yaoqing Yang, Rajiv Khanna|arXiv (Cornell University)|Jul 9, 2020

Adversarial Robustness in Machine Learning被引用 5

一句话总结

本文将边界厚度引入为分类器鲁棒性的关键度量指标，表明更厚的边界可减少过拟合并提升模型在对抗样本和分布外（OOD）输入上的性能。通过将边界厚度与混合训练（mixup training）相联系，并利用噪声增强的混合训练进一步提升边界厚度，该方法在多个基准测试中显著增强了鲁棒泛化能力和对抗鲁棒性。

ABSTRACT

Robustness of machine learning models to various adversarial and non-adversarial corruptions continues to be of interest. In this paper, we introduce the notion of the boundary thickness of a classifier, and we describe its connection with and usefulness for model robustness. Thick decision boundaries lead to improved performance, while thin decision boundaries lead to overfitting (e.g., measured by the robust generalization gap between training and testing) and lower robustness. We show that a thicker boundary helps improve robustness against adversarial examples (e.g., improving the robust test accuracy of adversarial training) as well as so-called out-of-distribution (OOD) transforms, and we show that many commonly-used regularization and data augmentation procedures can increase boundary thickness. On the theoretical side, we establish that maximizing boundary thickness during training is akin to the so-called mixup training. Using these observations, we show that noise-augmentation on mixup training further increases boundary thickness, thereby combating vulnerability to various forms of adversarial attacks and OOD transforms. We can also show that the performance improvement in several lines of recent work happens in conjunction with a thicker boundary.

研究动机与目标

形式化并量化机器学习分类器中决策边界厚度的概念。
建立边界厚度与对抗样本及分布外分布鲁棒性之间理论与实证上的联系。
证明正则化与数据增强技术通过增加边界厚度来提升鲁棒性。
表明噪声增强的混合训练可进一步增厚决策边界，从而提升鲁棒泛化能力。
通过边界厚度增厚的视角，解释近期鲁棒训练方法的性能提升。

提出的方法

将边界厚度定义为特征空间中决策边界到最近训练样本的最小距离的几何度量。
理论上将边界厚度最大化与混合训练的目标相联系，后者通过插值训练样本实现。
提出噪声增强的混合训练方法，即在混合插值过程中添加随机噪声，以进一步增厚边界。
通过实证评估比较不同训练方式（包括标准训练、对抗训练和数据增强训练）下的边界厚度。
通过对抗扰动和分布外扰动下的鲁棒泛化差距与鲁棒测试准确率来衡量鲁棒性。
在多个数据集和网络架构上分析边界厚度与模型性能之间的关系。

实验结果

研究问题

RQ1决策边界厚度与模型在对抗样本和分布外样本上的鲁棒性之间有何关系？
RQ2常见的正则化与数据增强技术在多大程度上增加了边界厚度？
RQ3与标准混合训练相比，能否证明噪声增强的混合训练可进一步增厚决策边界？
RQ4近期鲁棒训练方法的性能提升是否可归因于边界厚度的增加？
RQ5边界厚度最大化与混合训练目标之间存在何种理论联系？

主要发现

更厚的决策边界与改进的鲁棒泛化能力密切相关，表现为训练与测试性能之间的鲁棒泛化差距显著减小。
采用噪声增强混合训练的模型表现出显著更厚的边界，并在对抗攻击下实现了比标准混合训练更高的鲁棒测试准确率。
多种正则化与数据增强技术（如CutMix和Cutout）均能增厚边界，且其增厚程度与鲁棒性提升呈正相关。
理论分析证实，最大化边界厚度等价于混合训练的目标函数，为混合训练提供了几何解释。
近期鲁棒训练方法（如使用对抗训练或一致性正则化的模型）的性能提升与可测量的边界厚度增加高度一致。
边界厚度为多种训练策略下提升鲁棒性的机制提供了一个统一的几何解释。

更好的研究，从现在开始

从论文设计到论文写作，大幅缩短您的研究时间。

无需绑定信用卡

本解读由 AI 生成，并经人工编辑审核。