QUICK REVIEW

[論文レビュー] Technical Report on the CleverHans v2.1.0 Adversarial Examples Library

Nicolas Papernot, Fartash Faghri|arXiv (Cornell University)|Oct 3, 2016

Adversarial Robustness in Machine Learning参考文献 2被引用数 404

ひとこと要約

CleverHans v2.1.0 は、堅牢なモデルのベンチマークと敵対的トレーニングを可能にする、敵対的攻撃と防御の標準化された TensorFlow リファレンス実装を提供します。

ABSTRACT

CleverHans is a software library that provides standardized reference implementations of adversarial example construction techniques and adversarial training. The library may be used to develop more robust machine learning models and to provide standardized benchmarks of models' performance in the adversarial setting. Benchmarks constructed without a standardized implementation of adversarial example construction are not comparable to each other, because a good result may indicate a robust model or it may merely indicate a weak implementation of the adversarial example construction procedure. This technical report is structured as follows. Section 1 provides an overview of adversarial examples in machine learning and of the CleverHans software. Section 2 presents the core functionalities of the library: namely the attacks based on adversarial examples and defenses to improve the robustness of machine learning models to these attacks. Section 3 describes how to report benchmark results using the library. Section 4 describes the versioning system.

研究の動機と目的

敵対例作成技術の標準化されたリファレンス実装を提供する。
敵対的トレーニングと堅牢なモデル開発を可能にする。
共通の攻撃ライブラリを使用してベンチマーク結果を比較可能にする。
ベンチマークの構造、バージョニング、報告ガイドラインを説明する。

提案手法

Attack インターフェースを備えた攻撃モジュールを実装し、複数の特定の攻撃アルゴリズムを提供する。
Keras、TensorFlow など、さまざまなフレームワークと統合する Model 抽象化を提供する。
防御として敵対的訓練をトレーニングパイプライン内に組み込む。
再現可能なベンチマークとバージョン管理された結果を確保する報告ガイドラインを提供する。
セマンティックバージョニングを採用し、ベンチマークの比較可能性のためにリリースノートを文書化する。

実験結果

リサーチクエスチョン

RQ1標準化された攻撃実装は、敵対的堅牢性ベンチマークの比較可能性をどう向上させるか？
RQ2CleverHans v2.1.0 で実装されている攻撃と防御は何で、さまざまなモデルフレームワークとどのように相互作用するか？
RQ3研究者はバージョン間で再現性を確保するためにベンチマーク結果をどのように報告すべきか？

主な発見

攻撃ライブラリには FGSM、BIM、PGD、CW、EAD、JSMA、DeepFool、SPSA などが含まれ、対応する生成関数を備える。
防御は主にトレーニングパイプライン内で実装された敵対的訓練に焦点を当てる。
バージョニングの指針は互換性、再現性、および攻撃手法と設定の明示的な報告を強調する。
ライブラリはオープンソースで、TensorFlow ベース、そしてインターフェースレベルでフレームワーク非依存になるよう設計されている。
ベンチマークの報告には CleverHans のバージョンと攻撃設定を明記する必要があり、比較可能性を確保する。

より良い研究を、今すぐ始めましょう

論文設計から論文執筆まで、研究時間を劇的に削減しましょう。

クレジットカード登録不要

このレビューはAIが作成し、人間の編集者が確認しました。