QUICK REVIEW

[論文レビュー] When Unseen Domain Generalization is Unnecessary? Rethinking Data Augmentation

Ling Zhang, Xiaosong Wang|arXiv (Cornell University)|Jun 7, 2019

Domain Adaptation and Few-Shot Learning参考文献 13被引用数 39

ひとこと要約

この論文は、3D医用画像分割の未知ドメイン一般化を改善するためのDST（Deep Stacked Transformations）拡張を提案し、DSTが従来の拡張やCycleGANベースのドメイン適応を上回り、大規模なソースデータセットを用いた場合には監視付き最先端と同等の性能に匹敵できることを示します。

ABSTRACT

Recent advances in deep learning for medical image segmentation demonstrate expert-level accuracy. However, in clinically realistic environments, such methods have marginal performance due to differences in image domains, including different imaging protocols, device vendors and patient populations. Here we consider the problem of domain generalization, when a model is trained once, and its performance generalizes to unseen domains. Intuitively, within a specific medical imaging modality the domain differences are smaller relative to natural images domain variability. We rethink data augmentation for medical 3D images and propose a deep stacked transformations (DST) approach for domain generalization. Specifically, a series of n stacked transformations are applied to each image in each mini-batch during network training to account for the contribution of domain-specific shifts in medical images. We comprehensively evaluate our method on three tasks: segmentation of whole prostate from 3D MRI, left atrial from 3D MRI, and left ventricle from 3D ultrasound. We demonstrate that when trained on a small source dataset, (i) on average, DST models on unseen datasets degrade only by 11% (Dice score change), compared to the conventional augmentation (degrading 39%) and CycleGAN-based domain adaptation method (degrading 25%); (ii) when evaluation on the same domain, DST is also better albeit only marginally. (iii) When training on large-sized data, DST on unseen domains reaches performance of state-of-the-art fully supervised models. These findings establish a strong benchmark for the study of domain generalization in medical imaging, and can be generalized to the design of robust deep segmentation models for clinical deployment.

研究の動機と目的

医療画像におけるプロトコル/ベンダー/人口差によるドメイン一般化の課題を動機づける。
画像空間でスタックされた変換を用いてドメインシフトを模擬する体系的な拡張法（DST）を提案する。
未知ドメイン一般化を定量化するために3つの3D分割タスクでDSTを評価する。
より大きなソースデータセットが提供される場合、DSTが最先端の性能に近づくまたはそれを上回ることを示す。

提案手法

各訓練画像に対し、9つのスタック画像空間変換のシーケンスを適用し、変換ごとに2つのハイパーパラメータ（確率と振幅）を用いる。
等方性リサンプリングとDice損失最適化を用いた3D分割バックボーン（AH-Net）を使用する。
3つのタスク（前立腺MRI、左心房MRI、左心室超音波）にわたり、複数の公開データセットを用いて未知ドメインへの一般化を評価する。
DSTを単一の拡張、トップ4の拡張組み合わせ、CycleGANベースのドメイン適応と比較する。
大規模データ環境では、465ボリュームのMRIデータセットを用いてDSTを監視付き手法と比較する。

実験結果

リサーチクエスチョン

RQ1DSTの深層スタックドTransformations拡張は3D医用画像分割の教師なしドメイン一般化を改善するか。
RQ2DSTは未知ドメインにおいて従来の拡張とCycleGANドメイン適応とどう比較されるか。
RQ3より大きな訓練データを前提とした場合、DSTは最先端の監視付き性能に近づくか。
RQ4DST内のどの拡張がMRIと超音波タスク全体で一般化に最も寄与するか。

主な発見

DSTは未知ドメインの一般化を大幅に改善し、未知ドメインでのDice平均は80%、基準値49.8%、CycleGANは63.5%である。
3D MRIではシャープニング、コントラスト、明るさ、および強度の摂動が大きな改善を生み出し、空間変換はタスクによって異なる。
DSTは個別拡張やトップ4の組み合わせよりも優れており、総合的な拡張スイートの必要性を示している。
より大規模な訓練データ（465件）では、未知ドメインにおけるDSTは最先端の監視付き手法のDiceに0.8%未満の差で到達する。
ProstateXの未知データセットでは、DSTはDice 91.9%を達成し、本研究での放射線科医の一致基準と整合している。

より良い研究を、今すぐ始めましょう

論文設計から論文執筆まで、研究時間を劇的に削減しましょう。

クレジットカード登録不要

このレビューはAIが作成し、人間の編集者が確認しました。