QUICK REVIEW

[论文解读] Data Augmentation by Pairing Samples for Images Classification

Hiroshi Inoue|arXiv (Cornell University)|Jan 9, 2018

Machine Learning and Data Classification参考文献 11被引用 338

一句话总结

SamplePairing 将两个随机选择的训练图像叠加，以合成一个新的训练样本，使用第一个图像的标签，在多个数据集上取得显著的准确率提升，特别是在数据有限时。

ABSTRACT

Data augmentation is a widely used technique in many machine learning tasks, such as image classification, to virtually enlarge the training dataset size and avoid overfitting. Traditional data augmentation techniques for image classification tasks create new samples from the original training data by, for example, flipping, distorting, adding a small amount of noise to, or cropping a patch from an original image. In this paper, we introduce a simple but surprisingly effective data augmentation technique for image classification tasks. With our technique, named SamplePairing, we synthesize a new sample from one image by overlaying another image randomly chosen from the training data (i.e., taking an average of two images for each pixel). By using two images randomly selected from the training set, we can generate $N^2$ new samples from $N$ training samples. This simple data augmentation technique significantly improved classification accuracy for all the tested datasets; for example, the top-1 error rate was reduced from 33.5% to 29.0% for the ILSVRC 2012 dataset with GoogLeNet and from 8.22% to 6.93% in the CIFAR-10 dataset. We also show that our SamplePairing technique largely improved accuracy when the number of samples in the training set was very small. Therefore, our technique is more valuable for tasks with a limited amount of training data, such as medical imaging tasks.

研究动机与目标

推动在有限训练数据下提升图像分类的泛化能力。
引入一种简单的数据增强方法，通过配对样本扩大有效训练集。
在标准基准上评估 SamplePairing，并分析其与其他增强方法的交互影响。

提出的方法

在每个训练实例中通过平均两个随机选择的训练图像来合成新图像。
将混合图像的标签设为第一个图像的标签，忽略第二个图像的标签。
将 SamplePairing 与现有增广方法整合在一起，同时不修改网络结构。
在分阶段训练中评估，在初始若干轮后启用 SamplePairing，并在训练过程中定期禁用以进行微调。
与相关方法（如 mixup、跨类学习）进行比较，以明确等权重配对在不进行标签混合时的优势。

实验结果

研究问题

RQ1与基线增强相比，SamplePairing 是否在各数据集（ILSVRC、CIFAR-10/100、SVHN）上改进泛化能力？
RQ2从训练数据内选择叠加图像与从训练数据外部选择对性能有何影响？
RQ3等权平均（不进行标签混合）是否与混合标签或使用加权混合的方法同样有效？
RQ4当训练集较小或不平衡时，SamplePairing 的性能如何？
RQ5在训练过程中间歇性禁用 SamplePairing 对最终性能有什么影响？

主要发现

与基线相比，SamplePairing 降低了在 ILSVRC、CIFAR-10、CIFAR-100 和 SVHN 的验证错误率。
使用 GoogLeNet 的 ILSVRC 的 top-1 错误从 33.5% 降低到 29.0%。
在一个简单的6层网络下，CIFAR-10 的错误率从 8.22% 降至 6.93%。
在受限训练数据条件下（如 CIFAR-10 每类 100 个样本），准确率提升更大（如从 43.1% 提升到 31.0%）。
从训练集外部叠加图像可带来改进，但不如使用训练集内的叠加效果明显。
仅使用一个图像的标签（不进行标签混合）所得到的性能与混合标签的方法相当。

更好的研究，从现在开始

从论文设计到论文写作，大幅缩短您的研究时间。

无需绑定信用卡

本解读由 AI 生成，并经人工编辑审核。