Skip to main content
QUICK REVIEW

[論文レビュー] Interventional Few-Shot Learning

Zhongqi Yue, Hanwang Zhang|arXiv (Cornell University)|Sep 28, 2020
Domain Adaptation and Few-Shot Learning参考文献 77被引用数 94
ひとこと要約

IFSL は事前学習知識を少数ショット学習の混乱因子として扱い、バックドア調整を用いて介入を行い、mini ImageNet、tiered ImageNet、クロスドメイン CUB で 1-shot および 5-shot の最先端結果を達成する。

ABSTRACT

We uncover an ever-overlooked deficiency in the prevailing Few-Shot Learning (FSL) methods: the pre-trained knowledge is indeed a confounder that limits the performance. This finding is rooted from our causal assumption: a Structural Causal Model (SCM) for the causalities among the pre-trained knowledge, sample features, and labels. Thanks to it, we propose a novel FSL paradigm: Interventional Few-Shot Learning (IFSL). Specifically, we develop three effective IFSL algorithmic implementations based on the backdoor adjustment, which is essentially a causal intervention towards the SCM of many-shot learning: the upper-bound of FSL in a causal view. It is worth noting that the contribution of IFSL is orthogonal to existing fine-tuning and meta-learning based FSL methods, hence IFSL can improve all of them, achieving a new 1-/5-shot state-of-the-art on extit{mini}ImageNet, extit{tiered}ImageNet, and cross-domain CUB. Code is released at https://github.com/yue-zhongqi/ifsl.

研究の動機と目的

  • Identify and formalize the deficiency where pre-training acts as a confounder in FSL.
  • Propose a causal framework (SCM) for FSL to justify interventions.
  • Develop three practical IFSL implementations based on backdoor adjustment.
  • Demonstrate that IFSL is orthogonal to and improves existing fine-tuning and meta-learning FSL methods.
  • Provide empirical evidence of improved 1-/5-shot performance on standard benchmarks and analyze performance across similarity between support and query sets.

提案手法

  • Formulate a Structural Causal Model (SCM) to represent the causal relations among pre-trained knowledge, sample features, and labels.
  • Apply backdoor adjustment to estimate P(Y|do(X)) as a causal intervention.
  • Present three practical IFSL implementations: feature-wise adjustment, class-wise adjustment, and a combined adjustment.
  • Leverage NWGM (Normalized Weighted Geometric Mean) to approximate the class-wise component for efficiency.
  • Ensure IFSL can be plugged into existing FSL baselines without altering their core training objectives.
  • Provide algorithmic details and equations in the appendices for exact instantiations across backbones and classifiers.

実験結果

リサーチクエスチョン

  • RQ1Can pre-trained knowledge act as a confounder in few-shot learning, biasing P(Y|X) away from true causal effects?
  • RQ2How can backdoor adjustment be instantiated practically for FSL to approximate P(Y|do(X)) without many-shot data?
  • RQ3Do feature-wise, class-wise, or combined adjustments improve FSL performance, and are these improvements orthogonal to fine-tuning and meta-learning methods?
  • RQ4What are the empirical gains of IFSL on standard benchmarks (mini ImageNet, tiered ImageNet) and cross-domain tasks (CUB) across 1-shot and 5-shot settings?
  • RQ5How does IFSL influence model attention (CAM-Acc) and robustness across query hardness?

主な発見

  • IFSL yields consistent accuracy gains when plugged into both fine-tuning and meta-learning baselines across 1-/5-shot settings.
  • IFSL achieves new state-of-the-art results on mini ImageNet and tiered ImageNet in 1-/5-shot scenarios.
  • IFSL provides improvements in cross-domain generalization (mini ImageNet to CUB) over linear classifiers and remains beneficial for inductive as well as transductive baselines.
  • The gains are larger in 1-shot settings, indicating higher susceptibility to confounding bias with fewer examples.
  • IFSL improves attention to the true object as evidenced by CAM-Acc visualizations, suggesting reliance on correct visual semantics rather than confounded cues.
  • IFSL is orthogonal to existing FSL methods, improving baselines without requiring changes to their core training procedures.
  • Across varying similarity between S (support) and Q (query), IFSL improves performance in all regimes, including harder query samples.

より良い研究を、今すぐ始めましょう

論文設計から論文執筆まで、研究時間を劇的に削減しましょう。

クレジットカード登録不要

このレビューはAIが作成し、人間の編集者が確認しました。