QUICK REVIEW

[论文解读] Learning to Learn By Self-Critique

Antreas Antoniou, Amos Storkey|arXiv (Cornell University)|Jan 1, 2019

Domain Adaptation and Few-Shot Learning被引用 13

一句话总结

该论文提出了一种名为 Self-Critique and Adapt (SCA) 的新型少样本学习框架，通过自我批判机制学习无标签损失函数，利用推理阶段的无标签目标集数据来提升泛化能力。通过优化该学习到的损失函数以提升模型在目标任务上的性能，SCA 在 Mini-ImageNet 和 CUB-200 上取得了最先进结果，显著降低了与仅使用支持集的基线方法相比的错误率。

ABSTRACT

In few-shot learning, a machine learning system learns from a small set of labelled examples relating to a specific task, such that it can generalize to new examples of the same task. Given the limited availability of labelled examples in such tasks, we wish to make use of all the information we can. Usually a model learns task-specific information from a small training-set (support-set) to predict on an unlabelled validation set (target-set). The target-set contains additional task-specific information which is not utilized by existing few-shot learning methods. Making use of the target-set examples via transductive learning requires approaches beyond the current methods; at inference time, the target-set contains only unlabelled input data-points, and so discriminative learning cannot be used. In this paper, we propose a framework called Self-Critique and Adapt or SCA, which learns to learn a label-free loss function, parameterized as a neural network. A base-model learns on a support-set using existing methods (e.g. stochastic gradient descent combined with the cross-entropy loss), and then is updated for the incoming target-task using the learnt loss function. This label-free loss function is itself optimized such that the learnt model achieves higher generalization performance. Experiments demonstrate that SCA offers substantially reduced error-rates compared to baselines which only adapt on the support-set, and results in state of the art benchmark performance on Mini-ImageNet and Caltech-UCSD Birds 200.

研究动机与目标

为了解决现有少样本学习方法在推理过程中无法利用无标签目标集数据的局限性。
开发一种方法，仅使用支持集和无标签目标集实现归纳适应，而无需目标集上的标签。
学习一种可微分的、由神经网络参数化的损失函数，通过自我批判提升模型泛化能力。
通过在适应过程中引入目标集信息，在标准少样本基准上实现最先进性能。

提出的方法

SCA 训练一个神经网络以学习一种无标签损失函数，该函数在推理过程中指导模型在无标签目标集上的更新。
基础模型首先使用标准方法（如随机梯度下降配合交叉熵损失）在支持集上进行训练。
在推理阶段，模型使用自学习的损失函数在目标集上进行更新，该损失函数被优化以最大化泛化性能。
损失函数通过元学习目标进行训练，以最小化适应后在目标任务上的误差。
该框架通过可微分的、端到端可训练的过程，有效利用无标签目标数据，实现了归纳式少样本学习。
学习到的损失函数充当评判者，无需真实标签即可评估并改进模型在目标数据上的预测。

实验结果

研究问题

RQ1模型能否通过在无标签目标数据上自我批判其预测，来提升自身的泛化性能？
RQ2当标签不可用时，如何有效利用无标签目标集样例进行少样本学习？
RQ3能否训练一个由神经网络参数化的损失函数，以改善模型在未见任务上的适应能力？
RQ4通过学习到的损失函数引入目标集信息，是否能带来比标准归纳式少样本方法更好的泛化性能？

主要发现

与仅在支持集上适应的基线方法相比，SCA 显著降低了错误率。
该方法在 Mini-ImageNet 基准上设定了新的最先进性能。
SCA 在 Caltech-UCSD 鸟类 200 (CUB-200) 数据集上也取得了最先进结果。
学习到的无标签损失函数通过在推理阶段利用无标签目标数据，有效提升了模型泛化能力。
该框架证明了无需目标集标签即可实现归纳式少样本学习的可行性。

更好的研究，从现在开始

从论文设计到论文写作，大幅缩短您的研究时间。

无需绑定信用卡

本解读由 AI 生成，并经人工编辑审核。