QUICK REVIEW

[Paper Review] Finding Task-Relevant Features for Few-Shot Learning by Category Traversal

Hongyang Li, David Eigen|arXiv (Cornell University)|May 27, 2019

Domain Adaptation and Few-Shot Learning41 references96 citations

TL;DR

The paper introduces a Category Traversal Module (CTM) that looks across all support classes to identify task-relevant feature dimensions, improving metric-based few-shot learning performance by about 5–10% on miniImageNet and tieredImageNet. CTM is plug-and-play and boosts several baselines.

ABSTRACT

Few-shot learning is an important area of research. Conceptually, humans are readily able to understand new concepts given just a few examples, while in more pragmatic terms, limited-example training situations are common in practice. Recent effective approaches to few-shot learning employ a metric-learning framework to learn a feature similarity comparison between a query (test) example, and the few support (training) examples. However, these approaches treat each support class independently from one another, never looking at the entire task as a whole. Because of this, they are constrained to use a single set of features for all possible test-time tasks, which hinders the ability to distinguish the most relevant dimensions for the task at hand. In this work, we introduce a Category Traversal Module that can be inserted as a plug-and-play module into most metric-learning based few-shot learners. This component traverses across the entire support set at once, identifying task-relevant features based on both intra-class commonality and inter-class uniqueness in the feature space. Incorporating our module improves performance considerably (5%-10% relative) over baseline systems on both mini-ImageNet and tieredImageNet benchmarks, with overall performance competitive with recent state-of-the-art systems.

Motivation & Objective

Motivate the need for task-aware feature selection in few-shot learning where support classes are treated independently.
Develop a plug-and-play module that leverages intra-class commonality and inter-class uniqueness to identify task-relevant feature dimensions.
Integrate CTM with existing metric-based few-shot learners to improve discriminativeness of embeddings.
Demonstrate CTM’s effectiveness through extensive ablations and comparisons on standard benchmarks.

Proposed method

Introduce CTM consisting of a concentrator (intra-class commonality) and a projector (inter-class uniqueness).
Concentrator reduces dimensionality and aggregates within-class features to produce a class-wise embedding ‘o’.
Projector aggregates across classes to produce a mask ‘p’ that selects task-relevant feature dimensions.
Apply the mask to support and query embeddings to obtain improved features I(S) and I(Q).
Embed CTM into existing metric-based few-shot learners (Matching Net, Prototypical Net, Relation Net) by replacing their similarity with M(r(S) ⊙ p, r(Q) ⊙ p).
Train and evaluate in episodic few-shot settings on miniImageNet and tieredImageNet with standard 5-way/1–5-shot setups.

Experimental results

Research questions

RQ1Can a cross-category view of the support set improve the identification of task-relevant features in few-shot learning?
RQ2Does CTM improve existing metric-based few-shot learners, and by how much across benchmarks?
RQ3Which CTM components (concentrator, projector) are essential for performance gains?
RQ4How does CTM affect the learned feature space in terms of discriminability?

Key findings

CTM yields consistent relative gains of about 5–10% on miniImageNet and tieredImageNet when integrated with existing metric-based methods.
CTM improves Matching Net, Prototypical Net, and Relation Net by several percentage points across 1-shot and 5-shot settings.
A deeper backbone (ResNet-18) with CTM substantially increases performance (e.g., 1-shot miniImageNet improves to 62.05% with CTM in Table 4).
In ablations, removing the concentrator or the projector degrades performance, confirming both components are necessary for best results.
CTM’s data-augmented version further improves results (e.g., miniImageNet 1-shot 64.12%, tiered 1-shot 68.41% with augmentation).
t-SNE visualizations show tighter, more discriminative clusters after applying CTM-based masking.

Better researchstarts right now

From paper design to paper writing, dramatically reduce your research time.

No credit card · Free plan available

This review was created by AI and reviewed by human editors.