[論文レビュー] Confidence Regularized Self-Training
CRSTはソフト疑似ラベルと出力平滑化正則化を用いることで、教師なしドメイン適応を改善し、複数のベンチマークで最先端の結果を達成します。
Recent advances in domain adaptation show that deep self-training presents a powerful means for unsupervised domain adaptation. These methods often involve an iterative process of predicting on target domain and then taking the confident predictions as pseudo-labels for retraining. However, since pseudo-labels can be noisy, self-training can put overconfident label belief on wrong classes, leading to deviated solutions with propagated errors. To address the problem, we propose a confidence regularized self-training (CRST) framework, formulated as regularized self-training. Our method treats pseudo-labels as continuous latent variables jointly optimized via alternating optimization. We propose two types of confidence regularization: label regularization (LR) and model regularization (MR). CRST-LR generates soft pseudo-labels while CRST-MR encourages the smoothness on network output. Extensive experiments on image classification and semantic segmentation show that CRSTs outperform their non-regularized counterpart with state-of-the-art performance. The code and models of this work are available at https://github.com/yzou2/CRST.
研究の動機と目的
- Motivate unsupervised domain adaptation and address noisy pseudo-labels in self-training.
- Introduce a continuous pseudo-label framework to relax one-hot assumptions.
- Propose two regularization paradigms (label regularization and model regularization) to stabilize training.
- Theoretically relate CRST to regularized maximum likelihood and EM.
- Empirically validate CRST on image classification and semantic segmentation benchmarks.
提案手法
- Generalize CBST to continuous pseudo-labels on a target domain.
- Introduce label regularization (LR) with soft pseudo-labels and a negative entropy constraint (LRENT).
- Introduce model regularization (MR) with output-smoothing terms (MRL2, MRENT, MRKLD).
- Optimize via alternating steps: pseudo-label generation and network retraining.
- Provide closed-form solutions or gradients for regularizers and connect LRENT to softmax with temperature.
- Demonstrate CRST variants outperform CBST on multiple datasets.
実験結果
リサーチクエスチョン
- RQ1Can continuous pseudo-labels improve stability and accuracy in self-training for UDA?
- RQ2Do label and model regularizers reduce error propagation from incorrect pseudo-labels?
- RQ3How do CRST regularizers relate to known probabilistic/softmax formulations?
- RQ4Which regularizer(s) provide the best performance across vision tasks?
- RQ5What is the empirical impact of CRST on classification and segmentation benchmarks?
主な発見
- CRST variants outperform their non-regularized CBST baseline across benchmarks.
- MRKLD with LRENT often yields the best or competitive results among single regularizers.
- Combining MRKLD and LRENT yields further improvements on VisDA17 and Office-31.
- CRST achieves state-of-the-art performance in GTA5→Cityscapes and SYNTHIA→Cityscapes under DeepLabv2 and other backbones.
- Soft pseudo-labels and output-smoothing regularizers reduce overconfident errors and improve domain adaptation performance.
より良い研究を、今すぐ始めましょう
論文設計から論文執筆まで、研究時間を劇的に削減しましょう。
クレジットカード登録不要
このレビューはAIが作成し、人間の編集者が確認しました。