QUICK REVIEW

[論文レビュー] Mitigating Shortcut Learning via Feature Disentanglement in Medical Imaging: A Benchmark Study

Sarah Müller, Berens, Philipp|arXiv (Cornell University)|Feb 17, 2026

Adversarial Robustness in Machine Learning被引用数 0

ひとこと要約

要約: 本論文は医用画像におけるショートカット学習を緩和するための特徴分離方法をベンチマークし、データ中心の再平衡と潜在空間の分離（特に dCor ベース）がデータセット間およびテスト分布で堅牢な改善をもたらすことを示す。

ABSTRACT

Although deep learning models in medical imaging often achieve excellent classification performance, they can rely on shortcut learning, exploiting spurious correlations or confounding factors that are not causally related to the target task. This poses risks in clinical settings, where models must generalize across institutions, populations, and acquisition conditions. Feature disentanglement is a promising approach to mitigate shortcut learning by separating task-relevant information from confounder-related features in latent representations. In this study, we systematically evaluated feature disentanglement methods for mitigating shortcuts in medical imaging, including adversarial learning and latent space splitting based on dependence minimization. We assessed classification performance and disentanglement quality using latent space analyses across one artificial and two medical datasets with natural and synthetic confounders. We also examined robustness under varying levels of confounding and compared computational efficiency across methods. We found that shortcut mitigation methods improved classification performance under strong spurious correlations during training. Latent space analyses revealed differences in representation quality not captured by classification metrics, highlighting the strengths and limitations of each method. Model reliance on shortcuts depended on the degree of confounding in the training data. The best-performing models combine data-centric rebalancing with model-centric disentanglement, achieving stronger and more robust shortcut mitigation than rebalancing alone while maintaining similar computational efficiency.

研究の動機と目的

医用画像におけるショートカット学習の動機付けと定量化、機関間および取得条件のばらつきに対する堅牢な一般化の必要性。
特徴分離方法（dCor、MI、MMD）をベースラインおよび敵対的アプローチと体系的に比較。
再平衡を含むデータ中心および潜在空間分割を含むモデル中心の戦略を、管理下および実世界データセットで評価。
データ中心とモデル中心の介入を組み合わせることで、ショートカットの緩和がより優れた堅牢性を得られるかを調査。

提案手法

主タスク y1 と同じ画像 x から予測される混乱タスク y2 の二タスク設定を定式化。
潜在表現を y1 用の z1 と y2 用の z2 の二つのサブ空間に分割し、結合分類損失で学習。
依存度測度 D(z1,z2) を重み lambda で最小化することにより分離性を強制: L = Lcls + lambda Ldep。
ERM ベースライン、オーバーサンプリング再平衡、敵対学習（AdvCl）、および距離相関（dCor）、相互情報（MI/MINE）、最大平均差（MMD）を用いた明示的な特徴分離アプローチを比較。
データ中心とモデル中心の戦略の組み合わせを評価（例: Rebalancing + dCor、Rebalancing + MINE など）。
三つのデータセットを使用（Morpho-MNIST の人工的混乱; CheXpert 胸部X線; 合成混乱を伴う OCT）し、Original、Balanced、Inverted の分布でショートカット緩和をストレステスト。

Figure 1 : Overview of shortcut learning and mitigation via feature disentanglement. a Example of a spurious correlation between two binary tasks in the training data that reverses at test time, illustrating a distribution shift. b Causal graph in which a confounder affects both tasks $y_{1}$ and $y

実験結果

リサーチクエスチョン

RQ1特徴分離法は ERM およびデータ中心の再平衡と比較して、医用画像データセット全体でショートカット学習を緩和できるか。
RQ2明示的な潜在空間分離法（dCor、MI、MMD）は強い混乱と分布シフト下で堅牢な改善を提供するか。
RQ3データ中心の再平衡とモデル中心の分離を組み合わせると、いずれか単独よりも優れた堅牢なショートカット緩和が得られるか。
RQ4潜在空間の解析は、主要タスクの精度を超えた分離品質を示すか。
RQ5計算効率を維持しつつ、どの方法が最も高い精度を提供するか。

主な発見

Dataset	Method	Original AUROC	Balanced AUROC	Inverted AUROC
Morpho-MNIST	Baseline (ERM)	96±1	95±1	88±3
Morpho-MNIST	Rebalancing	98±1	97±0	97±0
Morpho-MNIST	AdvCl	98±0	97±1	96±1
Morpho-MNIST	AdvCl+Rebal	98±2	97±2	96±1
Morpho-MNIST	dCor	97±0	96±1	92±1
Morpho-MNIST	dCor+Rebal	99±0	98±0	98±0
Morpho-MNIST	MINE	97±2	96±2	98±1
Morpho-MNIST	MINE+Rebal	97±1	96±1	97±1
Morpho-MNIST	MMD	97±0	74?	91±1
Morpho-MNIST	MMD+Rebal	98±0	77?	93±4
CheXpert	Baseline (ERM)	79±1	79±1	46±4
CheXpert	Rebalancing	88±2	88±2	38% improvement over baseline equivalent
AdvCl	83±1	83±1	18% improvement over baseline equivalent
CheXpert	AdvCl+Rebal	85±1	85±1	26% improvement over baseline equivalent
CheXpert	dCor	79±3	79±3	?
CheXpert	dCor+Rebal	88±1	88±1	?
CheXpert	MINE	88±1	88±1	?
CheXpert	MINE+Rebal	86±1	86±1	?
CheXpert	MMD	74±4	74±4	?
CheXpert	MMD+Rebal	77±3	77±3	?
OCT	Baseline (ERM)	99±1	99±1	74±15?
OCT	Rebalancing	99±1	99±1	?
OCT	AdvCl	99±1	99±1	?
OCT	AdvCl+Rebal	99±1	99±1	?
OCT	dCor	99±2	99±2	?
OCT	dCor+Rebal	99±0	99±0	?
OCT	MINE	98±1	97±1	?
OCT	MINE+Rebal	99±1	99±0	?
OCT	MMD	99±0	94±1	?
OCT	MMD+Rebal	99±1	97±1	?

分離法は強い偽相関の下で主要タスクの AUROC を改善する。
データ中心の再平衡と特徴分離（特に dCor ベース）の組み合わせは、Original / Balanced / Inverted の全テスト分布で最も堅牢で一貫した改善をもたらす。
敵対的学習は役立つが、データセットとテスト条件によっては明示的な依存性最小化分離より劣る場合がある。
潜在空間の解析は、分類指標だけでは捉えられない分離品質の差を示し、各手法の長所と限界を浮き彫りにする。
ショートカットへのモデル依存は訓練混乱の度合いとともに拡大する。強い混乱は ERM の脆弱性を高めるが、再平衡と分離の組み合わせはこのリスクを緩和する。
全体ベストは Rebalancing と距離相関（dCor）分離を組み合わせることで得られ、計算効率をほぼ維持しつつ堅牢な改善を達成する。

Figure 2 : Overview of label distributions in Morpho-MNIST, CheXpert, and OCT. a shows example images sampled for each label combination, b shows contingency tables of the original training data, and c shows contingency tables of the sub-sampled training data actually used. In the final training dat

より良い研究を、今すぐ始めましょう

論文設計から論文執筆まで、研究時間を劇的に削減しましょう。

クレジットカード登録不要

このレビューはAIが作成し、人間の編集者が確認しました。