QUICK REVIEW

[論文レビュー] Data Augmentation Revisited: Rethinking the Distribution Gap between Clean and Augmented Data

Zhuoxun He, Lingxi Xie|arXiv (Cornell University)|Sep 19, 2019

Advanced Neural Network Applications参考文献 47被引用数 35

ひとこと要約

この論文はデータ拡張を正則化として解析し、期待リスクの上界を導出し、分布ギャップを埋めるために訓練の最後に拡張を少なくするまたは拡張を終了することで拡張を洗練させ、分類で一貫した利得を得て検出への転移を改善することを提案する。

ABSTRACT

Data augmentation has been widely applied as an effective methodology to improve generalization in particular when training deep neural networks. Recently, researchers proposed a few intensive data augmentation techniques, which indeed improved accuracy, yet we notice that these methods augment data have also caused a considerable gap between clean and augmented data. In this paper, we revisit this problem from an analytical perspective, for which we estimate the upper-bound of expected risk using two terms, namely, empirical risk and generalization error, respectively. We develop an understanding of data augmentation as regularization, which highlights the major features. As a result, data augmentation significantly reduces the generalization error, but meanwhile leads to a slightly higher empirical risk. On the assumption that data augmentation helps models converge to a better region, the model can benefit from a lower empirical risk achieved by a simple method, i.e., using less-augmented data to refine the model trained on fully-augmented data. Our approach achieves consistent accuracy gain on a few standard image classification benchmarks, and the gain transfers to object detection.

研究の動機と目的

intensive data augmentation がクリーンなデータと拡張データの間に分布ギャップを作る理由を動機付ける
augmentation 下での経験的リスクと一般化誤差を分離して期待リスクの上界を再定式化する
augmentation が主要特徴を保持しつつ minor features に正則化効果を及ぼすことを説明する
訓練の終盤で拡張を減らす refined な訓練戦略を提案する
標準ベンチマークと検出タスクへの転移で改善を実証する

提案手法

データ拡張を用いた期待リスクを定式化し、P と P_aug を区別する
major features と minor features への特徴空間分解を用いて intensive augmentation の経験的リスクの収束を分析する
微小特徴を正則化し対応する重みを制約する方法をテイラー展開で正当化する
refined なデータ拡張を提案する： refinement 中は標準拡張を継続するが訓練終盤には intensive augmentation を減少または停止する
CIFAR-10/100、Tiny-ImageNet、ImageNet、Pascal VOC with Faster R-CNN を用いて分類と検出転移を検証する

実験結果

リサーチクエスチョン

RQ1データ拡張は経験的リスクと一般化誤差を介して期待リスクの上界にどのような影響を与えるのか？
RQ2大きな分布ギャップが存在する場合、 intensive augmentation は経験的リスクの収束とどのように矛盾し得るのか？
RQ3 訓練スケジュールで終盤に拡張を減らすと最終的な一般化と検出への転移は改善されるのか？
RQ4 refined な拡張の標準ベンチマークと検出タスクでの実用的な利得はどの程度か？

主な発見

intensive augmentation は一般化誤差を低減するが、拡張データ上での経験的リスクを増加させる可能性がある。
refined な拡張、すなわち終盤に拡張を終了することは経験的リスクをさらに低減し、テスト性能を改善する。
拡張はモデルをパラメータ空間のより良い領域へ収束させ、 refinement は CIFAR、Tiny-ImageNet、ImageNet での利得を維持または向上させる。
refinement は物体検出への転移を改善し、訓練終における拡張が位置情報に敏感な特徴を回復するのを助けることを示唆する。
Mixup、Manifold Mixup、CutMix、AutoAugment を含むさまざまな拡張タイプでも利点が持続し、 refine によって拡張データへの過学習を緩和する。

より良い研究を、今すぐ始めましょう

論文設計から論文執筆まで、研究時間を劇的に削減しましょう。

クレジットカード登録不要

このレビューはAIが作成し、人間の編集者が確認しました。