QUICK REVIEW

[论文解读] Robust Optimal Transport with Applications in Generative Modeling and Domain Adaptation

Yogesh Balaji, Rama Chellappa|arXiv (Cornell University)|Oct 12, 2020

Infrastructure Maintenance and Monitoring被引用 12

一句话总结

本文提出了一种计算高效的鲁棒最优传输（OT）对偶公式，通过放松边缘约束来减轻对异常值的敏感性，从而在深度学习中实现稳定训练。该方法在含噪声数据集上实现了最先进的 GAN 性能，并通过学习反映生成难度的样本特定权重，提升了域自适应的准确性。

ABSTRACT

Optimal Transport (OT) distances such as Wasserstein have been used in several areas such as GANs and domain adaptation. OT, however, is very sensitive to outliers (samples with large noise) in the data since in its objective function, every sample, including outliers, is weighed similarly due to the marginal constraints. To remedy this issue, robust formulations of OT with unbalanced marginal constraints have previously been proposed. However, employing these methods in deep learning problems such as GANs and domain adaptation is challenging due to the instability of their dual optimization solvers. In this paper, we resolve these issues by deriving a computationally-efficient dual form of the robust OT optimization that is amenable to modern deep learning applications. We demonstrate the effectiveness of our formulation in two applications of GANs and domain adaptation. Our approach can train state-of-the-art GAN models on noisy datasets corrupted with outlier distributions. In particular, our optimization computes weights for training samples reflecting how difficult it is for those samples to be generated in the model. In domain adaptation, our robust OT formulation leads to improved accuracy compared to the standard adversarial adaptation methods. Our code is available at this https URL.

研究动机与目标

解决标准最优传输对数据分布中噪声或异常值样本的敏感性问题。
开发一种适用于深度学习应用的稳定且可扩展的鲁棒 OT 对偶优化公式。
在受异常值分布污染的数据集上实现有效的生成建模。
通过学习更鲁棒的源域到目标域对齐方式，提升域自适应性能。
提供一种实用的、可微分的 OT 框架，兼容现代深度学习训练流水线。

提出的方法

推导出具有非平衡边缘约束的鲁棒 OT 对偶形式，以减少异常值的影响。
提出一种可微分且稳定的优化方案，适用于深度神经网络中的反向传播。
根据样本生成难度为其分配自适应权重，体现对噪声的鲁棒性。
采用松弛的边缘约束，在分布对齐过程中降低异常值的权重。
将鲁棒 OT 损失集成到 GAN 和域自适应框架中，作为可微分目标函数。
使用计算高效的求解器，避免对偶优化中的不稳定性问题。

实验结果

研究问题

RQ1是否可以将具有非平衡边缘的鲁棒最优传输以对偶形式公式化，使其在深度学习中稳定且可微分？
RQ2所提出的鲁棒 OT 公式在含异常值分布的数据集上如何改善 GAN 训练？
RQ3与标准对抗自适应方法相比，该方法在多大程度上提升了域自适应的准确性？
RQ4该方法能否学习到有意义的样本权重，以反映生成建模中的生成难度？
RQ5鲁棒 OT 损失是否能带来更稳定可靠的深度生成模型训练？

主要发现

所提出的鲁棒 OT 公式在受异常值污染数据集上的 GAN 训练中实现了最先进性能。
该方法学习到样本特定的权重，能有效识别并降低训练过程中困难或噪声样本的影响。
在域自适应中，鲁棒 OT 方法优于标准对抗自适应基线方法。
对偶优化方案稳定且可扩展，支持深度学习框架中的端到端训练。
该方法在无需数据预清洗的情况下，表现出对数据噪声的显著鲁棒性。
代码已公开，支持可复现性，并可集成到现有深度学习流水线中。

更好的研究，从现在开始

从论文设计到论文写作，大幅缩短您的研究时间。

无需绑定信用卡

本解读由 AI 生成，并经人工编辑审核。