QUICK REVIEW

[论文解读] Extended T: Learning with Mixed Closed-set and Open-set Noisy Labels

Xiaobo Xia, Tongliang Liu|arXiv (Cornell University)|Dec 2, 2020

Machine Learning and Data Classification参考文献 68被引用 24

一句话总结

本文提出 Extended T，一种依赖聚类的扩展转移矩阵，通过引入一个用于开放集类别的元类别来建模混合的闭集和开放集标签噪声，并从噪声数据中估计特定聚类的转移矩阵。该方法在具有噪声标签的真实世界人脸识别数据集上实现了最先进水平的鲁棒性，在高噪声率下准确率优于先前方法。

ABSTRACT

The label noise transition matrix $T$, reflecting the probabilities that true labels flip into noisy ones, is of vital importance to model label noise and design statistically consistent classifiers. The traditional transition matrix is limited to model closed-set label noise, where noisy training data has true class labels within the noisy label set. It is unfitted to employ such a transition matrix to model open-set label noise, where some true class labels are outside the noisy label set. Thus when considering a more realistic situation, i.e., both closed-set and open-set label noise occurs, existing methods will undesirably give biased solutions. Besides, the traditional transition matrix is limited to model instance-independent label noise, which may not perform well in practice. In this paper, we focus on learning under the mixed closed-set and open-set label noise. We address the aforementioned issues by extending the traditional transition matrix to be able to model mixed label noise, and further to the cluster-dependent transition matrix to better approximate the instance-dependent label noise in real-world applications. We term the proposed transition matrix as the cluster-dependent extended transition matrix. An unbiased estimator (i.e., extended $T$-estimator) has been designed to estimate the cluster-dependent extended transition matrix by only exploiting the noisy data. Comprehensive synthetic and real experiments validate that our method can better model the mixed label noise, following its more robust performance than the prior state-of-the-art label-noise learning methods.

研究动机与目标

填补现有标签噪声学习方法无法处理闭集和开放集标签噪声共现的问题。
克服传统转移矩阵不适用于开放集标签噪声且在两类噪声共现时产生偏差的局限性。
通过引入依赖聚类的转移矩阵而非依赖类别的转移矩阵，改进对实例相关标签噪声的近似。
仅使用噪声数据，基于锚点构建扩展转移矩阵的无偏估计器，用于元类别和真实类别。
在合成数据集和真实世界数据集上验证该方法，以证明在现实混合标签噪声条件下的鲁棒性。

提出的方法

引入一个元类别以表示所有开放集类别，使传统转移矩阵能够通过将元类别视为并列实体来建模开放集噪声。
在每个聚类中识别真实类别（闭集）和元类别（开放集）的锚点，以确保转移概率的可靠估计。
将传统转移矩阵扩展为依赖聚类的扩展转移矩阵，其中每个聚类拥有独立的转移矩阵以捕捉局部噪声模式。
基于锚点分布，仅使用噪声训练数据设计依赖聚类的扩展转移矩阵的无偏估计器。
使用估计的依赖聚类扩展转移矩阵重新加权训练损失，实现在混合标签噪声下的统计一致性学习。
通过将估计的转移矩阵整合到训练目标中，将该方法应用于深度学习模型以校正标签噪声。

实验结果

研究问题

RQ1统一的转移矩阵框架能否同时建模闭集和开放集标签噪声？
RQ2如何仅使用噪声数据构建扩展转移矩阵的无偏估计器？
RQ3通过依赖聚类的转移矩阵建模实例相关标签噪声，与依赖类别的近似相比，其鲁棒性提升程度如何？
RQ4在现实混合标签噪声设置下，所提出方法是否优于最先进标签噪声学习方法？
RQ5聚类数量对建模复杂真实世界标签噪声性能的影响如何？

主要发现

在 MS1MV0 数据集上，Extended T 达到 91.51% 的平均准确率，优于第二名方法 Extended T-2（91.38%）和所有基线方法。
在 VggFace-2 上，Extended T 达到 91.30% 的平均准确率，超过 Co-teaching+（89.40%）和 T-Revision（90.99%），表明即使在低噪声率下也具有鲁棒性。
依赖聚类的转移矩阵带来显著性能提升，尤其在高噪声的 MS1MV0 上，显示出对复杂实例相关噪声的更好建模能力。
Extended T 始终优于 NLNL，后者因在复杂真实世界噪声中难以选择可靠的互补标签而表现欠佳。
聚类数量增加时性能下降极小，表明方法在低噪声设置下对超参数选择具有稳定性和鲁棒性。
实证结果证实，所提出的扩展转移矩阵估计器是无偏的，且能仅从噪声数据中有效估计噪声模式。

更好的研究，从现在开始

从论文设计到论文写作，大幅缩短您的研究时间。

无需绑定信用卡

本解读由 AI 生成，并经人工编辑审核。