[論文レビュー] Interventional Few-Shot Learning
IFSL は事前学習知識を少数ショット学習の混乱因子として扱い、バックドア調整を用いて介入を行い、mini ImageNet、tiered ImageNet、クロスドメイン CUB で 1-shot および 5-shot の最先端結果を達成する。
We uncover an ever-overlooked deficiency in the prevailing Few-Shot Learning (FSL) methods: the pre-trained knowledge is indeed a confounder that limits the performance. This finding is rooted from our causal assumption: a Structural Causal Model (SCM) for the causalities among the pre-trained knowledge, sample features, and labels. Thanks to it, we propose a novel FSL paradigm: Interventional Few-Shot Learning (IFSL). Specifically, we develop three effective IFSL algorithmic implementations based on the backdoor adjustment, which is essentially a causal intervention towards the SCM of many-shot learning: the upper-bound of FSL in a causal view. It is worth noting that the contribution of IFSL is orthogonal to existing fine-tuning and meta-learning based FSL methods, hence IFSL can improve all of them, achieving a new 1-/5-shot state-of-the-art on extit{mini}ImageNet, extit{tiered}ImageNet, and cross-domain CUB. Code is released at https://github.com/yue-zhongqi/ifsl.
研究の動機と目的
- Identify and formalize the deficiency where pre-training acts as a confounder in FSL.
- Propose a causal framework (SCM) for FSL to justify interventions.
- Develop three practical IFSL implementations based on backdoor adjustment.
- Demonstrate that IFSL is orthogonal to and improves existing fine-tuning and meta-learning FSL methods.
- Provide empirical evidence of improved 1-/5-shot performance on standard benchmarks and analyze performance across similarity between support and query sets.
提案手法
- Formulate a Structural Causal Model (SCM) to represent the causal relations among pre-trained knowledge, sample features, and labels.
- Apply backdoor adjustment to estimate P(Y|do(X)) as a causal intervention.
- Present three practical IFSL implementations: feature-wise adjustment, class-wise adjustment, and a combined adjustment.
- Leverage NWGM (Normalized Weighted Geometric Mean) to approximate the class-wise component for efficiency.
- Ensure IFSL can be plugged into existing FSL baselines without altering their core training objectives.
- Provide algorithmic details and equations in the appendices for exact instantiations across backbones and classifiers.
実験結果
リサーチクエスチョン
- RQ1Can pre-trained knowledge act as a confounder in few-shot learning, biasing P(Y|X) away from true causal effects?
- RQ2How can backdoor adjustment be instantiated practically for FSL to approximate P(Y|do(X)) without many-shot data?
- RQ3Do feature-wise, class-wise, or combined adjustments improve FSL performance, and are these improvements orthogonal to fine-tuning and meta-learning methods?
- RQ4What are the empirical gains of IFSL on standard benchmarks (mini ImageNet, tiered ImageNet) and cross-domain tasks (CUB) across 1-shot and 5-shot settings?
- RQ5How does IFSL influence model attention (CAM-Acc) and robustness across query hardness?
主な発見
- IFSL yields consistent accuracy gains when plugged into both fine-tuning and meta-learning baselines across 1-/5-shot settings.
- IFSL achieves new state-of-the-art results on mini ImageNet and tiered ImageNet in 1-/5-shot scenarios.
- IFSL provides improvements in cross-domain generalization (mini ImageNet to CUB) over linear classifiers and remains beneficial for inductive as well as transductive baselines.
- The gains are larger in 1-shot settings, indicating higher susceptibility to confounding bias with fewer examples.
- IFSL improves attention to the true object as evidenced by CAM-Acc visualizations, suggesting reliance on correct visual semantics rather than confounded cues.
- IFSL is orthogonal to existing FSL methods, improving baselines without requiring changes to their core training procedures.
- Across varying similarity between S (support) and Q (query), IFSL improves performance in all regimes, including harder query samples.
より良い研究を、今すぐ始めましょう
論文設計から論文執筆まで、研究時間を劇的に削減しましょう。
クレジットカード登録不要
このレビューはAIが作成し、人間の編集者が確認しました。