QUICK REVIEW

[论文解读] A Systematic Review of Generalization Research in Medical Image Classification

Sarah Matta, Mathieu Lamard|arXiv (Cornell University)|Mar 18, 2024

AI in cancer detection被引用 1

一句话总结

本篇系统性综述整合了深度学习在医学图像分类中领域泛化技术的最新进展，按目标分布偏移类型（协变量偏移或概念偏移）对方法进行分类。研究发现基于学习的方法在应对两类偏移方面均展现出新兴解决方案的潜力，强调了改进评估协议以及开发多模态、隐私保护框架的必要性，以实现在临床环境中稳健且可泛化的人工智能。

ABSTRACT

Numerous Deep Learning (DL) classification models have been developed for a large spectrum of medical image analysis applications, which promises to reshape various facets of medical practice. Despite early advances in DL model validation and implementation, which encourage healthcare institutions to adopt them, a fundamental questions remain: how can these models effectively handle domain shift? This question is crucial to limit DL models performance degradation. Medical data are dynamic and prone to domain shift, due to multiple factors. Two main shift types can occur over time: 1) covariate shift mainly arising due to updates to medical equipment and 2) concept shift caused by inter-grader variability. To mitigate the problem of domain shift, existing surveys mainly focus on domain adaptation techniques, with an emphasis on covariate shift. More generally, no work has reviewed the state-of-the-art solutions while focusing on the shift types. This paper aims to explore existing domain generalization methods for DL-based classification models through a systematic review of literature. It proposes a taxonomy based on the shift type they aim to solve. Papers were searched and gathered on Scopus till 10 April 2023, and after the eligibility screening and quality evaluation, 77 articles were identified. Exclusion criteria included: lack of methodological novelty (e.g., reviews, benchmarks), experiments conducted on a single mono-center dataset, or articles not written in English. The results of this paper show that learning based methods are emerging, for both shift types. Finally, we discuss future challenges, including the need for improved evaluation protocols and benchmarks, and envisioned future developments to achieve robust, generalized models for medical image classification.

研究动机与目标

为解决医学图像分类中因领域偏移导致的模型性能下降这一关键挑战。
系统性回顾现有领域泛化技术，重点区分协变量偏移与概念偏移。
识别方法论上的缺口，包括评估协议不足及缺乏标准化基准。
探索未来研究方向，如多模态学习、联邦领域泛化以及对对抗性样本和分布外样本的鲁棒性提升。
倡导统一术语与评估框架，以提升人工智能模型的可复现性及临床信任度。

提出的方法

使用Scopus作为主要数据库开展系统性文献回顾，搜索关键词聚焦于领域泛化、医学影像及偏移类型。
应用严格纳入标准：仅包含具有方法学新颖性的原始研究、使用多中心或多领域数据集且为英文文献的研究。
在初步搜索与质量评估后筛选出77篇论文，排除综述、基准研究及使用单中心数据的研究。
基于其旨在缓解的偏移类型，提出领域泛化方法的分类体系：协变量偏移（数据分布变化）与概念偏移（标签关系变化）。
从三类方法分析：数据增强、表示学习与模型优化，特别关注处理标签噪声与数据异质性的技术。
整合联邦学习与多模态学习的见解，以应对临床部署中隐私保护与数据多样性方面的挑战。

实验结果

研究问题

RQ1当前医学图像分类中领域泛化技术的最先进方法有哪些？它们如何按偏移类型进行分类？
RQ2现有方法在多大程度上缓解了因成像协议、设备或采集参数差异导致的协变量偏移？
RQ3当前方法在多大程度上缓解了因阅片者间差异与标签方案不一致导致的概念偏移？
RQ4该研究领域在评估协议、基准测试与术语标准化方面存在哪些关键局限？
RQ5未来研究方向（如多模态学习、联邦领域泛化与基础模型）在实现稳健、可泛化的医学人工智能方面具有哪些潜力？

主要发现

通过系统性筛选共识别出77篇高质量研究，证实了领域泛化在医学影像领域日益增长的研究兴趣。
基于学习的方法在应对协变量偏移与概念偏移方面均展现出作为有效解决方案的潜力，尤其在表示学习与数据增强方面。
评估方面仍存在显著挑战，包括缺乏用于分布外泛化的标准化基准与评估协议。
联邦领域泛化是一种有前景的方法，可在不共享原始数据的前提下保护数据隐私，同时促进多机构协作。
多模态整合（结合影像、电子健康记录与基因组数据）既带来机遇也带来风险，例如若管理不当可能引入生成数据偏移。
目前缺乏统一的偏移类型术语（如协变量偏移与概念偏移），这阻碍了研究间的可复现性与可比性。

更好的研究，从现在开始

从论文设计到论文写作，大幅缩短您的研究时间。

无需绑定信用卡

本解读由 AI 生成，并经人工编辑审核。