QUICK REVIEW

[论文解读] AutoAugment: Learning Augmentation Policies from Data

Ekin D. Cubuk, Barret Zoph|arXiv (Cornell University)|May 24, 2018

Advanced Neural Network Applications参考文献 65被引用 732

一句话总结

AutoAugment 自动搜索有效数据增强策略，使用控制器 RNN 与强化学习，在多个数据集上取得了最先进的结果，并能在数据集之间迁移策略。

ABSTRACT

Data augmentation is an effective technique for improving the accuracy of modern image classifiers. However, current data augmentation implementations are manually designed. In this paper, we describe a simple procedure called AutoAugment to automatically search for improved data augmentation policies. In our implementation, we have designed a search space where a policy consists of many sub-policies, one of which is randomly chosen for each image in each mini-batch. A sub-policy consists of two operations, each operation being an image processing function such as translation, rotation, or shearing, and the probabilities and magnitudes with which the functions are applied. We use a search algorithm to find the best policy such that the neural network yields the highest validation accuracy on a target dataset. Our method achieves state-of-the-art accuracy on CIFAR-10, CIFAR-100, SVHN, and ImageNet (without additional data). On ImageNet, we attain a Top-1 accuracy of 83.5% which is 0.4% better than the previous record of 83.1%. On CIFAR-10, we achieve an error rate of 1.5%, which is 0.6% better than the previous state-of-the-art. Augmentation policies we find are transferable between datasets. The policy learned on ImageNet transfers well to achieve significant improvements on other datasets, such as Oxford Flowers, Caltech-101, Oxford-IIT Pets, FGVC Aircraft, and Stanford Cars.

研究动机与目标

促使自动发现增强策略以提高图像分类器的泛化能力。
为策略定义一个搜索空间，该搜索空间由包含顺序图像操作的子策略组成。
使用控制器 RNN 和策略梯度强化学习来优化验证准确率。
展示学习策略的直接应用（AutoAugment-direct）和跨数据集迁移（AutoAugment-transfer）。

提出的方法

将一个策略定义为 5 个子策略，每个子策略按顺序包含两个图像操作。
每个操作有一个应用概率和一个大小值（强度）。
使用控制器 RNN 来采样策略，并训练一个子模型以将验证准确性作为奖励进行度量。
使用来自奖励的策略梯度，通过近端策略优化（PPO）来优化控制器。
将最好的 5 个策略连接成一个最终的 25 子策略策略，用于训练数据集。

实验结果

研究问题

RQ1自动学习的增强策略是否能在性能上超越手工设计的增强方法？
RQ2在一个数据集上发现的策略是否可以迁移到其他数据集和架构？
RQ3策略多样性和随机应用对泛化有什么影响？
RQ4在准确性和可迁移性方面，AutoAugment 与先前的自动化增强方法相比如何？

主要发现

数据集	GPU 小时（估计）	最优公开结果	我们的结果
CIFAR-10	5000	2.1	1.5
CIFAR-100	0	12.2	10.7
SVHN	1000	1.3	1.0
Stanford Cars	0	5.9	5.2
ImageNet	15000	3.9	3.5

AutoAugment 在 CIFAR-10 (1.5%)、CIFAR-100 (10.7%) 和 SVHN (1.0%) 上实现了最先进的错误率，以及 ImageNet 上的 Top-1 准确率 83.5%。
在 ImageNet 学到的策略可迁移到 FGVC 数据集，并在无需额外数据的情况下提升泛化。
在 CIFAR-10 上，AutoAugment 在多种架构上对比先前方法取得了显著提升。
迁移自 ImageNet 的策略分别使 Stanford Cars 和 FGVC Aircraft 的性能提升了 1.2% 和 1.8%。
直接在简化的 CIFAR-10 上发现的策略也提升了完整 CIFAR-10/100 与 SVHN 的结果，显示了对不同尺度的鲁棒性。

更好的研究，从现在开始

从论文设计到论文写作，大幅缩短您的研究时间。

无需绑定信用卡

本解读由 AI 生成，并经人工编辑审核。