QUICK REVIEW

[论文解读] Learning to Propagate Labels: Transductive Propagation Network for Few-shot Learning

Yanbin Liu, Juho Lee|arXiv (Cornell University)|May 25, 2018

Domain Adaptation and Few-Shot Learning被引用 486

一句话总结

TPN 引入端到端的传导标签传播，用于小样本学习，通过对支持集和查询集的并集学习图，在 mini ImageNet 和 tiered ImageNet 上达到最先进的结果。

ABSTRACT

The goal of few-shot learning is to learn a classifier that generalizes well even when trained with a limited number of training instances per class. The recently introduced meta-learning approaches tackle this problem by learning a generic classifier across a large number of multiclass classification tasks and generalizing the model to a new task. Yet, even with such meta-learning, the low-data problem in the novel classification task still remains. In this paper, we propose Transductive Propagation Network (TPN), a novel meta-learning framework for transductive inference that classifies the entire test set at once to alleviate the low-data problem. Specifically, we propose to learn to propagate labels from labeled instances to unlabeled test instances, by learning a graph construction module that exploits the manifold structure in the data. TPN jointly learns both the parameters of feature embedding and the graph construction in an end-to-end manner. We validate TPN on multiple benchmark datasets, on which it largely outperforms existing few-shot learning approaches and achieves the state-of-the-art results.

研究动机与目标

通过利用对整个查询集的传导推理来缓解数据稀缺，从而解决小样本分类的动机。
提出一个端到端可学习的框架，联合学习特征嵌入和用于标签传播的任务自适应图。
利用 episodic 训练来模拟测试条件并优化标签传播机制。
证明通过传导图学习传播标签比在标准基准上的归纳/元学习基线更准确。

提出的方法

用 CNN 将输入嵌入，得到特征表示 f_phi(x)。
通过对 f_phi(x_i) 对所有 S ∪ Q 应用的图构造模块 g_phi，构造一个逐样本的长度尺度图 sigma_i。
使用每个样本缩放的嵌入来计算 kNN 相似度图 W，并用图拉普拉斯 S = D^(-1/2) W D^(-1/2) 进行归一化。
执行标签传播 F* = (I - αS)^(-1) Y，将标签从支持集传播到查询集，其中 Y 编码已知标签。
通过端到端训练，在 S ∪ Q 上对传播分数 F*（softmax 处理）与真标签之间的交叉熵最小化，并通过反向传播更新 φ 和 φ。
每个 episode 的图构造（N-way K-shot）学会适应任务流形；复杂度保持较小，因为 n ≈ N×K + T（例如 80–100）。

实验结果

研究问题

RQ1传导标签传播，在一个 episodic 元学习框架中端到端学习，能否超越归纳式的小样本方法？
RQ2学习逐样本图的尺度和距离是否提升在新类别空间中的标签传播？
RQ3在更高的-shot 或更高的-way 训练策略对传导小样本性能有何影响？
RQ4相较于状态-of-the-art 方法，传导传播在标准基准（mini ImageNet、tiered ImageNet）上的表现如何？

主要发现

5 类 1-shot	5 类 5-shot	10 类 1-shot	10 类 5-shot
53.75	69.43	36.62	52.32
55.51	69.86	38.44	52.77
57.53	72.85	40.93	59.17
59.91	73.30	44.80	59.44
52.31	68.18	35.23	51.24
53.75	69.43	36.62	52.32
55.51	69.86	38.44	52.77
57.53	72.85	40.93	59.17

TPN 在 mini ImageNet 和 tiered ImageNet 上达到最先进的结果，超越归纳和其他传导基线。
学习一个逐样本、任务自适应的用于标签传播的图在1-shot 情况下带来显著提升。
更高的 shot 训练（每个任务更多样本）进一步提升 TPN 的性能，在1-shot 情况下收益更大。
半监督适应（TPN-semi）在同类半监督小样本基线上进一步提升。
利用整个查询集的传导方法在标注数据稀缺时尤其有利，随着标注集规模扩大，收益递减。

更好的研究，从现在开始

从论文设计到论文写作，大幅缩短您的研究时间。

无需绑定信用卡

本解读由 AI 生成，并经人工编辑审核。