QUICK REVIEW

[论文解读] Adversarial Dropout Regularization

Kuniaki Saito, Yoshitaka Ushiku|arXiv (Cornell University)|Nov 5, 2017

Domain Adaptation and Few-Shot Learning参考文献 25被引用 50

一句话总结

ADR 引入基于 dropout 的对抗性 critic 以实现边界感知特征对齐，用于未监督域自适应，提升目标特征在分类和分割任务中的判别性，并实现半监督 GAN 训练。

ABSTRACT

We present a method for transferring neural representations from label-rich source domains to unlabeled target domains. Recent adversarial methods proposed for this task learn to align features across domains by fooling a special domain critic network. However, a drawback of this approach is that the critic simply labels the generated features as in-domain or not, without considering the boundaries between classes. This can lead to ambiguous features being generated near class boundaries, reducing target classification accuracy. We propose a novel approach, Adversarial Dropout Regularization (ADR), to encourage the generator to output more discriminative features for the target domain. Our key idea is to replace the critic with one that detects non-discriminative features, using dropout on the classifier network. The generator then learns to avoid these areas of the feature space and thus creates better features. We apply our ADR approach to the problem of unsupervised domain adaptation for image classification and semantic segmentation tasks, and demonstrate significant improvement over the state of the art. We also show that our approach can be used to train Generative Adversarial Networks for semi-supervised learning.

研究动机与目标

将带标签的源域神经表示迁移到未标签的目标域。
解决在对齐过程中忽略类别边界的传统域 critics 的局限性。
通过边界感知的对抗 dropout 开发 ADR，使目标特征更加具备判别性。
展示 ADR 在半监督学习和 GANs 中的适用性，超越域自适应。

提出的方法

用分类器 C 取代域 critic，作为分类器和边界敏感 critic（通过 dropout）。
使用两个 dropout 引入的分类器 C1 和 C2 来测量 p(y|x) 对边界扰动的敏感性；在目标样本上最大化这种敏感性，而在生成器输出上最小化。
训练 G（特征生成器）以使目标特征远离决策边界，最小化对目标数据的 dropout 敏感性。
训练 C 以最大化目标敏感性，同时在源数据上保持判别能力，并使用源监督项。
融入条件熵项，鼓励目标特征的类别分布更加均匀。
将 ADR 扩展到半监督 GAN 训练，让 G 生成的图像欺骗 C，同时对齐真实未标记数据与生成数据。

实验结果

研究问题

RQ1基于 dropout 的边界感知 critic 是否可以在没有目标标签的情况下改善源、目标特征的对齐？
RQ2最大化对 dropout 引起的扰动在目标样本上的敏感性，是否会导致具备判别性、无边界的目标表征？
RQ3ADR 在域移位下对图像分类和语义分割是否都有效？
RQ4ADR 能否通过将生成样本置于决策边界之外，同时使生成数据与真实数据相似，提升半监督 GAN 的训练效果？

主要发现

与以往的分布匹配方法相比，ADR 提高了数字数据集的未监督域自适应（MNIST/SVHN/USPS）和图像分割任务的性能。
在 SVHN 到 MNIST 的实验中，ADR 显著优于以往方法的准确率。
ADR 降低目标熵，将目标样本从决策边界移开，而在不敏感边界的变体分类器 C′ 训练仍具竞争力。
ADR 在 VisDA2017 分类（合成到真实）上对 ResNet101 和 ResNeXt 微调实现了最先进的结果。
在分割任务（GTA5 到 Cityscapes）中，ADR 相对“仅源数据”及其他方法提升了平均 IoU。
ADR 可用于训练半监督学习的 GAN，在 SVHN 和 CIFAR-10 上获得有竞争力的结果。

更好的研究，从现在开始

从论文设计到论文写作，大幅缩短您的研究时间。

无需绑定信用卡

本解读由 AI 生成，并经人工编辑审核。