Skip to main content
QUICK REVIEW

[論文レビュー] Confidence Regularized Self-Training

Yang Zou, Zhiding Yu|arXiv (Cornell University)|Aug 26, 2019
Domain Adaptation and Few-Shot Learning参考文献 63被引用数 33
ひとこと要約

CRSTはソフト疑似ラベルと出力平滑化正則化を用いることで、教師なしドメイン適応を改善し、複数のベンチマークで最先端の結果を達成します。

ABSTRACT

Recent advances in domain adaptation show that deep self-training presents a powerful means for unsupervised domain adaptation. These methods often involve an iterative process of predicting on target domain and then taking the confident predictions as pseudo-labels for retraining. However, since pseudo-labels can be noisy, self-training can put overconfident label belief on wrong classes, leading to deviated solutions with propagated errors. To address the problem, we propose a confidence regularized self-training (CRST) framework, formulated as regularized self-training. Our method treats pseudo-labels as continuous latent variables jointly optimized via alternating optimization. We propose two types of confidence regularization: label regularization (LR) and model regularization (MR). CRST-LR generates soft pseudo-labels while CRST-MR encourages the smoothness on network output. Extensive experiments on image classification and semantic segmentation show that CRSTs outperform their non-regularized counterpart with state-of-the-art performance. The code and models of this work are available at https://github.com/yzou2/CRST.

研究の動機と目的

  • Motivate unsupervised domain adaptation and address noisy pseudo-labels in self-training.
  • Introduce a continuous pseudo-label framework to relax one-hot assumptions.
  • Propose two regularization paradigms (label regularization and model regularization) to stabilize training.
  • Theoretically relate CRST to regularized maximum likelihood and EM.
  • Empirically validate CRST on image classification and semantic segmentation benchmarks.

提案手法

  • Generalize CBST to continuous pseudo-labels on a target domain.
  • Introduce label regularization (LR) with soft pseudo-labels and a negative entropy constraint (LRENT).
  • Introduce model regularization (MR) with output-smoothing terms (MRL2, MRENT, MRKLD).
  • Optimize via alternating steps: pseudo-label generation and network retraining.
  • Provide closed-form solutions or gradients for regularizers and connect LRENT to softmax with temperature.
  • Demonstrate CRST variants outperform CBST on multiple datasets.

実験結果

リサーチクエスチョン

  • RQ1Can continuous pseudo-labels improve stability and accuracy in self-training for UDA?
  • RQ2Do label and model regularizers reduce error propagation from incorrect pseudo-labels?
  • RQ3How do CRST regularizers relate to known probabilistic/softmax formulations?
  • RQ4Which regularizer(s) provide the best performance across vision tasks?
  • RQ5What is the empirical impact of CRST on classification and segmentation benchmarks?

主な発見

  • CRST variants outperform their non-regularized CBST baseline across benchmarks.
  • MRKLD with LRENT often yields the best or competitive results among single regularizers.
  • Combining MRKLD and LRENT yields further improvements on VisDA17 and Office-31.
  • CRST achieves state-of-the-art performance in GTA5→Cityscapes and SYNTHIA→Cityscapes under DeepLabv2 and other backbones.
  • Soft pseudo-labels and output-smoothing regularizers reduce overconfident errors and improve domain adaptation performance.

より良い研究を、今すぐ始めましょう

論文設計から論文執筆まで、研究時間を劇的に削減しましょう。

クレジットカード登録不要

このレビューはAIが作成し、人間の編集者が確認しました。