[論文レビュー] Learning to Self-Train for Semi-Supervised Few-Shot Classification
The paper introduces Learning to Self-Train (LST), a semi-supervised meta-learning approach that self-labels unlabeled data in few-shot tasks and uses a soft weighting network to select and weight pseudo-labels for robust self-training.
Few-shot classification (FSC) is challenging due to the scarcity of labeled training data (e.g. only one labeled data point per class). Meta-learning has shown to achieve promising results by learning to initialize a classification model for FSC. In this paper we propose a novel semi-supervised meta-learning method called learning to self-train (LST) that leverages unlabeled data and specifically meta-learns how to cherry-pick and label such unsupervised data to further improve performance. To this end, we train the LST model through a large number of semi-supervised few-shot tasks. On each task, we train a few-shot model to predict pseudo labels for unlabeled data, and then iterate the self-training steps on labeled and pseudo-labeled data with each step followed by fine-tuning. We additionally learn a soft weighting network (SWN) to optimize the self-training weights of pseudo labels so that better ones can contribute more to gradient descent optimization. We evaluate our LST method on two ImageNet benchmarks for semi-supervised few-shot classification and achieve large improvements over the state-of-the-art method. Code is at https://github.com/xinzheli1217/learning-to-self-train.
研究の動機と目的
- Motivate semi-supervised learning for few-shot classification (SSFSC) to utilize unlabeled data when labeled data is scarce.
- Integrate self-training into meta-learning to rapidly adapt to new few-shot tasks.
- Mitigate label-noise drift by learning soft weights for pseudo-labels and by re-training with labeled data after each step.
- Demonstrate that LST yields state-of-the-art results on miniImageNet and tieredImageNet benchmarks.
提案手法
- Employ an inner-loop self-training process for a task starting from a meta-learned initialization.
- Generate pseudo labels for unlabeled data using a task-specific classifier trained on the support set.
- Hard-select a subset of pseudo-labeled samples and assign soft weights via a meta-learned Soft Weighting Network (SWN).
- Re-train the model on weighted pseudo-labeled data plus labeled support data, followed by a fine-tuning step on labeled data only.
- Meta-optimize SWN and base-learner initializations through outer-loop validation losses to improve future self-training steps.
実験結果
リサーチクエスチョン
- RQ1Can self-training with pseudo-labels improve performance in semi-supervised few-shot classification?
- RQ2How can we mitigate drift from noisy pseudo-labels in iterative self-training within a meta-learning framework?
- RQ3Does meta-learning a weighting mechanism for pseudo-labels (SWN) improve cherry-picking and overall adaptation to new tasks?
- RQ4What are the benefits and limitations of recursive/self-trained steps under distractor unlabeled data?
主な発見
| Method | miniImageNet 1-shot | miniImageNet 5-shot | tieredImageNet 1-shot | tieredImageNet 5-shot |
|---|---|---|---|---|
| MTL, SunCVPR2019 (pre) | 61.2 b1 1.8 | 75.5 b1 0.9 | 65.6 b1 1.8 | 78.6 b1 0.9 |
| LST (Ours) – recursive, hard, soft | 70.1 b1 1.9 | 78.7 b1 0.8 | 77.7 b1 1.6 | 85.2 b1 0.8 |
- LST achieves top results on miniImageNet (1-shot: 70.1%, 5-shot: 78.7%) and tieredImageNet (1-shot: 77.7%, 5-shot: 85.2%).
- Compared with the meta-learning baseline (MTL), LST improves miniImageNet by 8.9% (1-shot) and 3.2% (5-shot).
- On tieredImageNet, LST improves by 12.1% (1-shot) and 6.6% (5-shot) over the baseline.
- Hard selection of pseudo-labels generally boosts performance, and combining hard selection with soft weighting (SWN) yields further gains.
- Recursive self-training can improve performance, especially when combined with SWN and hard selection, though distractors can hurt gains in some settings.
より良い研究を、今すぐ始めましょう
論文設計から論文執筆まで、研究時間を劇的に削減しましょう。
クレジットカード登録不要
このレビューはAIが作成し、人間の編集者が確認しました。