QUICK REVIEW

[论文解读] Meta-Learning for Semi-Supervised Few-Shot Classification

Mengye Ren, Eleni Triantafillou|arXiv (Cornell University)|Mar 2, 2018

Domain Adaptation and Few-Shot Learning参考文献 21被引用 726

一句话总结

该论文将 Prototypical Networks 扩展到半监督小样本学习，通过在 episode 中引入未标记数据，提出若干改进（soft k-means、distractor 处理、Masking），并在 Omniglot、mini-ImageNet 与 tiered-ImageNet 上实现了一致的性能提升。

ABSTRACT

In few-shot classification, we are interested in learning algorithms that train a classifier from only a handful of labeled examples. Recent progress in few-shot classification has featured meta-learning, in which a parameterized model for a learning algorithm is defined and trained on episodes representing different classification problems, each with a small labeled training set and its corresponding test set. In this work, we advance this few-shot classification paradigm towards a scenario where unlabeled examples are also available within each episode. We consider two situations: one where all unlabeled examples are assumed to belong to the same set of classes as the labeled examples of the episode, as well as the more challenging situation where examples from other distractor classes are also provided. To address this paradigm, we propose novel extensions of Prototypical Networks (Snell et al., 2017) that are augmented with the ability to use unlabeled examples when producing prototypes. These models are trained in an end-to-end way on episodes, to learn to leverage the unlabeled examples successfully. We evaluate these methods on versions of the Omniglot and miniImageNet benchmarks, adapted to this new framework augmented with unlabeled examples. We also propose a new split of ImageNet, consisting of a large set of classes, with a hierarchical structure. Our experiments confirm that our Prototypical Networks can learn to improve their predictions due to unlabeled examples, much like a semi-supervised algorithm would.

研究动机与目标

Motivate and formalize semi-supervised few-shot learning within meta-learning episodes.
Extend Prototypical Networks to leverage unlabeled data during both training and inference.
Develop robust unlabeled-data refinements that handle distractors in the unlabeled pool.
Evaluate on adapted Omniglot and ImageNet-based benchmarks, and introduce tiered ImageNet for hierarchical class splits.

提出的方法

Represent episodes with labeled support set, unlabeled pool, and a query set within a meta-learning framework.
Refine class prototypes using unlabeled data via (i) Soft k-means extension of Prototypical Networks, (ii) Soft k-means with a distractor cluster, and (iii) Masked Soft k-means that uses a learned masking to downweight potential distractors.
Train end-to-end with the refined prototypes using the standard Prototypical Network loss, enabling the embedding to adapt to semi-supervised refinement.
Adapt Omniglot, mini-ImageNet, and tiered-ImageNet with labeled/unlabeled splits per class and optional distractors; evaluate 1-shot and 5-shot accuracy with and without distractors.
Provide publicly available code for reproducibility (GitHub link).

实验结果

研究问题

RQ1Can unlabeled data within an episode improve few-shot classification when classes in the unlabeled set cover the target classes or include distractors?
RQ2Do semi-supervised refinements of prototypes learned during meta-training outperform purely supervised Prototypical Networks and naive semi-supervised inference?
RQ3Which semi-supervised refinement strategy (soft k-means variants with/distractors/masking) offers the best robustness and accuracy across datasets and shot settings?
RQ4How does the tiered ImageNet hierarchy affect few-shot semi-supervised learning and generalization to distinct test classes?

主要发现

Model	Acc.	Acc. w/ D
Supervised	94.62 \u0000b1 0.09	94.62 \u0000b1 0.09
Semi-Supervised Inference	97.45 \u0000b1 0.05	95.08 \u0000b1 0.09
Soft k-Means	97.25 \u0000b1 0.10	95.01 \u0000b1 0.09
Soft k-Means+Cluster	97.68 \u0000b1 0.07	97.17 \u0000b1 0.04
Masked Soft k-Means	97.52 \u0000b1 0.07	97.30 \u0000b1 0.08

All proposed semi-supervised Prototypical Network variants outperform the purely supervised baseline on Omniglot, mini-ImageNet, and tiered-ImageNet in most settings.
In non-distractor settings, at least one semi-supervised variant beats baselines across datasets and shot numbers, with no single model universally best.
In distractor scenarios, Masked Soft k-Means shows the most robust performance, often achieving state-of-the-art among the evaluated methods.
Increasing the unlabeled set size M improves test accuracy, indicating the models learn to leverage unlabeled data through meta-training.
The study introduces tiered ImageNet as a large-scale, hierarchically structured benchmark for semi-supervised few-shot learning.

更好的研究，从现在开始

从论文设计到论文写作，大幅缩短您的研究时间。

无需绑定信用卡

本解读由 AI 生成，并经人工编辑审核。