Skip to main content
QUICK REVIEW

[論文レビュー] Towards Stable and Efficient Training of Verifiably Robust Neural Networks

Huan Zhang, Hongge Chen|arXiv (Cornell University)|Jun 14, 2019
Adversarial Robustness in Machine Learning参考文献 51被引用数 39
ひとこと要約

tldr: CROWN-IBP を提案します。これは interval bound propagation (IBP) と 緊密な線形緩和境界 (CROWN) を組み合わせて、検証可能にロバストなニューラルネットワークを効率的に訓練する認定済みロバストトレーニング手法です。

ABSTRACT

Training neural networks with verifiable robustness guarantees is challenging. Several existing approaches utilize linear relaxation based neural network output bounds under perturbation, but they can slow down training by a factor of hundreds depending on the underlying network architectures. Meanwhile, interval bound propagation (IBP) based training is efficient and significantly outperforms linear relaxation based methods on many tasks, yet it may suffer from stability issues since the bounds are much looser especially at the beginning of training. In this paper, we propose a new certified adversarial training method, CROWN-IBP, by combining the fast IBP bounds in a forward bounding pass and a tight linear relaxation based bound, CROWN, in a backward bounding pass. CROWN-IBP is computationally efficient and consistently outperforms IBP baselines on training verifiably robust neural networks. We conduct large scale experiments on MNIST and CIFAR datasets, and outperform all previous linear relaxation and bound propagation based certified defenses in $\ell_\infty$ robustness. Notably, we achieve 7.02% verified test error on MNIST at $ε=0.3$, and 66.94% on CIFAR-10 with $ε=8/255$. Code is available at https://github.com/deepmind/interval-bound-propagation (TensorFlow) and https://github.com/huanzhang12/CROWN-IBP (PyTorch).

研究の動機と目的

  • Motivate the need for verifiable robustness guarantees in DNNs and analyze limitations of existing linear-relaxation and IBP methods.
  • Introduce CROWN-IBP as a hybrid certified training method that blends IBP and CROWN bounds.
  • Demonstrate that CROWN-IBP is both computationally efficient and yields tighter robustness bounds during training.
  • Show substantial improvements over prior IBP and linear-relaxation baselines on MNIST and CIFAR-10 under l-infinity attacks.

提案手法

  • Develop a training objective that combines natural cross-entropy loss with a robust loss based on a bound that is a weighted combination of IBP and CROWN-IBP margins.
  • Perform a forward bound pass using IBP to obtain tight per-layer z bounds; apply a ReLU relaxation to create linear upper/lower bounds (Eq. 10–11).
  • Perform a backward bound pass in a CROWN-like manner starting from the last layer to obtain a tight lower bound (Eq. 12–13).
  • Merge the forward IBP bounds and the backward CROWN-IBP bounds to compute a composite margin bound underline{m}(x,ε) used in the robust loss.
  • Allow a tunable parameter β to balance IBP and CROWN-IBP contributions, and a κ parameter to trade off standard versus verified accuracy ( Eq. 9 ).
  • Achieve improved scalability by avoiding full-CROWN complexity and leveraging the small output dimension n_L for efficient backward propagation.]
  • research_questions: ["Can a hybrid bound combining IBP and CROWN-IBP provide both stability during training and tighter verifiable robustness bounds?","Does CROWN-IBP enable significantly better verified error rates than pure IBP or linear-relaxation methods across MNIST and CIFAR-10 for common l_infinity budgets?","What is the impact of κ and β scheduling on the trade-off between standard accuracy and verifiable robustness?","Is the proposed method scalable to larger networks without prohibitive computational or memory overhead?"]
  • key_findings: ["CROWN-IBP yields lower verified error than IBP alone across MNIST and CIFAR-10 for tested ε settings (e.g., MNIST at ε=0.3: 7.02% verified error vs 8.21% for IBP; CIFAR-10 at ε=2/255: 46.03% verified error vs 55.88% for IBP).","CROWN-IBP outperforms pure IBP in both standard (clean) and verified accuracy, and achieves state-of-the-art results among IBP- and linear-relaxation-based methods on the evaluated datasets.","The method maintains favorable efficiency, with backward bound propagation complexity scaled by the output size n_L rather than the network width, improving scalability over traditional CROWN/Convex Adversarial Polytope approaches.","CROWN-IBP enables flexible trading between robustness and accuracy via κ and β schedules, demonstrating improved Pareto fronts for standard vs verified accuracy.","The study reports verifiable bounds for CIFAR-10 at ε=16/255, showing the method's applicability to larger perturbation budgets and datasets."]
  • table_headers: []
  • table_rows: []

より良い研究を、今すぐ始めましょう

論文設計から論文執筆まで、研究時間を劇的に削減しましょう。

クレジットカード登録不要

このレビューはAIが作成し、人間の編集者が確認しました。