QUICK REVIEW

[论文解读] Label Refinery: Improving ImageNet Classification through Label Progression

Hessam Bagherinezhad, Maxwell Horton|arXiv (Cornell University)|May 7, 2018

Machine Learning and Data Classification参考文献 24被引用 155

一句话总结

论文提出 Label Refinery，一种迭代方法，将作物级标签细化为软性、动态目标，在 ImageNet 上对多种架构显著提升准确率。

ABSTRACT

Among the three main components (data, labels, and models) of any supervised learning system, data and models have been the main subjects of active research. However, studying labels and their properties has received very little attention. Current principles and paradigms of labeling impose several challenges to machine learning algorithms. Labels are often incomplete, ambiguous, and redundant. In this paper we study the effects of various properties of labels and introduce the Label Refinery: an iterative procedure that updates the ground truth labels after examining the entire dataset. We show significant gain using refined labels across a wide range of models. Using a Label Refinery improves the state-of-the-art top-1 accuracy of (1) AlexNet from 59.3 to 67.2, (2) MobileNet from 70.6 to 73.39, (3) MobileNet-0.25 from 50.6 to 55.59, (4) VGG19 from 72.7 to 75.46, and (5) Darknet19 from 72.9 to 74.47.

研究动机与目标

识别图像分类中传统单标签标注的局限性。
提出一种迭代的、数据驱动的标签细化过程，产出软性、动态且集体化的标签。
在 ImageNet 上的多种架构中展示持续的准确性提升。
分析影响 refinery 质量的因素（架构、损失、动态性和对抗性标注）。

提出的方法

使用顺序的 Label Refinery 网络对数据集中的裁剪进行标签增强的形式化定义。
用标准交叉熵对齐真实标签训练第一个 refinery Cθ1。
通过最小化当前 refinery 输出与前一 refinery 的软标签之间的 KL 散度来训练后续的 refinery Cθt (t>1)，在迭代中有效地传播信息。
可选地通过修改输入以最大化相邻 refiners 之间的 KL 散度来使用对抗性抖动。
对不同的 refinery 架构进行实验，以研究跨架构的收益和过拟合行为。
比较图像级与类别级标签细化，以评估在哪些方面细化能带来最显著的收益。

实验结果

研究问题

RQ1使用模型生成的软标注方案进行逐步的裁剪级标签细化，是否能够在多种架构上改善 ImageNet 分类？
RQ2动态的（图像特定）与软标签（概率型）对性能提升有何贡献？
RQ3在一种架构上训练的 refinery 是否能有效改善另一种架构（跨架构细化）？
RQ4在 refinery 过程中使用对抗性扰动裁剪的影响如何？

主要发现

基于 refinery 的训练在各架构上持续提升 Top-1 准确率（例如 AlexNet、VGG19、ResNet50、MobileNet、Darknet19）。
软动态标签和逐裁剪的标签细化提供互补收益，联合使用可带来更进一步的提升。
跨架构细化（例如使用 ResNet-50 作为其他网络的 refinery）带来显著的准确性提升，特别是对于像 MobileNet 这样的紧凑模型。
自我细化和顺序 refinery 减少过拟合，使训练和验证性能趋于一致并提升泛化能力。
对抗性生成的标签（对抗性抖动）在与细化结合使用时，进一步提升较小模型的准确性。
基于类别层级分类法的细化并不优于图像级细化，表明视觉相似性对提升的驱动大于语义分类。
refinery 架构的选择很重要；较少过拟合的 refinery 模型（即使验证精度较低）也能产生更高质量的裁剪标签和更好的下游性能。
使用 KL 散度作为细化损失可以超越简单的交叉熵，并支持各种损失配置。

更好的研究，从现在开始

从论文设计到论文写作，大幅缩短您的研究时间。

无需绑定信用卡

本解读由 AI 生成，并经人工编辑审核。