QUICK REVIEW

[論文レビュー] A Survey on Computational Pathology Foundation Models: Datasets, Adaptation Strategies, and Evaluation Tasks

Dong Li, Guihong Wan|ArXiv.org|Jan 27, 2025

AI in cancer detection被引用数 3

ひとこと要約

この調査は計算病理ファウンデーションモデル（CPathFMs）をレビューし、データセット、適応戦略（単一モーダルおよび多モーダル）、評価タスクを詳述し、課題と将来の方向性を論じる。

ABSTRACT

Computational pathology foundation models (CPathFMs) have emerged as a powerful approach for analyzing histopathological data, leveraging self-supervised learning to extract robust feature representations from unlabeled whole-slide images. These models, categorized into uni-modal and multi-modal frameworks, have demonstrated promise in automating complex pathology tasks such as segmentation, classification, and biomarker discovery. However, the development of CPathFMs presents significant challenges, such as limited data accessibility, high variability across datasets, the necessity for domain-specific adaptation, and the lack of standardized evaluation benchmarks. This survey provides a comprehensive review of CPathFMs in computational pathology, focusing on datasets, adaptation strategies, and evaluation tasks. We analyze key techniques, such as contrastive learning and multi-modal integration, and highlight existing gaps in current research. Finally, we explore future directions from four perspectives for advancing CPathFMs. This survey serves as a valuable resource for researchers, clinicians, and AI practitioners, guiding the advancement of CPathFMs toward robust and clinically applicable AI-driven pathology solutions.

研究の動機と目的

CPathFMsの事前学習に使用される既存の病理データセットを分析し、データソース間の一般化の課題を特定する。
単一モーダルおよび多モーダルのCPathFMsの適応技術を系統的にレビューし、強みと限界を明らかにする。
事前学習済みCPathFMsの評価タスクの状況をまとめ、六つの視点という分類法に整理する。
堅牢で臨床適用可能なCPathFMsを進展させるためのギャップを特定し、将来の方向性を提案する。

提案手法

データセット、適応戦略、評価タスクに焦点を当てた最新のCPathFM文献の調査と統合。
前学習データタイプ（WSI、タイル、キャプション、レポート）とモダリティ（単一モーダル対多モーダル）の分類法の体系的整理。
病理に適用されたSSLフレームワーク（DINO、DINOv2、MAE、CLIP、CoCa）とそのドメイン固有の用途の説明。
CPathFMsの六つの視点による評価分類と、ファインチューニング有無の評価戦略の議論。

実験結果

リサーチクエスチョン

RQ1現在のCPathFMsを支えるデータセットとデータクレンジング実践は何か、そしてそれらが明らかにする一般化の課題は何か？
RQ2CPathFMsに用いられる適応戦略（単一モーダル vs 多モーダル）は何か、そしてそれぞれの利点と限界は何か？
RQ3トレーニング後のCPathFMsはどのように評価され、評価の全体像を六つの視点はどう構成されるか？
RQ4CPathFMsの標準ベンチマークと評価プロトコルにはどのようなギャップがあり、将来の方向性は何か？

主な発見

CPathFMsの事前学習データセットはますます大規模で多様化しており、公開データと私設WSIおよび複数ソースからのタイル-キャプションデータを組み合わせることが多い。
単一モーダルCPathFMsは主に病理タイルやWSI上で自己教師あり学習（DINO/DINOv2、MAE）を利用し、オンラインパッチングや KDE ベースの強化などドメイン固有の適応を行う。
多モーダルCPathFMsはCLIP/CoCaフレームワークを用いて画像-テキストデータを整列させ、横断モーダル検索・生成を可能にし、一部のモデルは病理タスクのためにLLMを統合している。
評価タスクは標準化されておらず、分類、検索、生成、セグメンテーション、生存予測、VQA などを含み、ファインチューニング有無の両方の設定が用いられている。
標準化されたベンチマーク、機関間の一般化、WSIレベルの多モーダルLLMおよびMxIF特化モデルの開発には顕著なギャップがある。
この調査はCPathFMsを評価する六つの視点を強調し、信頼性が高く、説明可能で臨床的に翻訳可能なモデルの必要性を強調している。

より良い研究を、今すぐ始めましょう

論文設計から論文執筆まで、研究時間を劇的に削減しましょう。

クレジットカード登録不要

このレビューはAIが作成し、人間の編集者が確認しました。