QUICK REVIEW

[論文レビュー] PanNuke Dataset Extension, Insights and Baselines

Jevgenij Gamper, Navid Alemi Koohbanani|arXiv (Cornell University)|Mar 24, 2020

AI in cancer detection参考文献 45被引用数 156

ひとこと要約

この論文は PanNuke を ~190k 個の核に対して 19 種類の組織に拡張し、セグメンテーション/分類モデルをベンチマークし、実世界の一般化を分析し、NuClick を用いた半自動で QC 付きの Ground-Truth パイプラインを提供します。

ABSTRACT

The emerging area of computational pathology (CPath) is ripe ground for the application of deep learning (DL) methods to healthcare due to the sheer volume of raw pixel data in whole-slide images (WSIs) of cancerous tissue slides. However, it is imperative for the DL algorithms relying on nuclei-level details to be able to cope with data from `the clinical wild', which tends to be quite challenging. We study, and extend recently released PanNuke dataset consisting of ~200,000 nuclei categorized into 5 clinically important classes for the challenging tasks of segmenting and classifying nuclei in WSIs. Previous pan-cancer datasets consisted of only up to 9 different tissues and up to 21,000 unlabeled nuclei and just over 24,000 labeled nuclei with segmentation masks. PanNuke consists of 19 different tissue types that have been semi-automatically annotated and quality controlled by clinical pathologists, leading to a dataset with statistics similar to the clinical wild and with minimal selection bias. We study the performance of segmentation and classification models when applied to the proposed dataset and demonstrate the application of models trained on PanNuke to whole-slide images. We provide comprehensive statistics about the dataset and outline recommendations and research directions to address the limitations of existing DL tools when applied to real-world CPath applications.

研究の動機と目的

ピクセルレベルの境界と多クラスラベルを備えた、より大規模で臨床的に代表的な核データセットを提供する。
NuClick を活用した半自動の Ground-Truth 生成と品質管理パイプラインを説明する。
PanNuke 上で複数の最先端核セグメンテーションモデルをベンチマークし、ベースラインを確立する。
訓練済みモデルの unseen tissues への一般化を分析し（例：脳）、実世界での適用可能性を議論する。
実世界の計算病理学における DL ツールの限界に対処するための指針と研究方向を提案する。

提案手法

複数の公開データセットを統合して核を検出する FCNN デテクタを訓練し、その後反復的な病理検証を行うことで半自動の Ground-Truth 生成を行う。
NuClick を用いて検証済みの核ドットを条件として核セグメンテーションマスクを生成し、正確なピクセル単位の境界を作成する。
パッチ抽出と組織間で公正なクロスモデル比較を可能にするための三つ折りのランダムデータセット分割。
マルチクラス Panoptic Quality (mPQ) および Binary PQ (bPQ) 指標を用いた評価。真陽性の IoU 閾値は 0.5。
同時に核のセグメンテーションと分類を行うよう適合させた four models (DIST, Mask-RCNN, Micro-Net, HoVer-Net) のベンチマーク；検出ベースの U-Net バリアントも含む。
PanNuke で訓練されたモデルを unseen brain tissue に適用する一般化テスト。

実験結果

リサーチクエスチョン

RQ1Can PanNuke provide robust, nuclei-level segmentation and classification across 19 diverse tissues with clinically relevant artifacts?
RQ2How do state-of-the-art segmentation models perform on PanNuke using standardized, multi-class PQ metrics?
RQ3Does training on PanNuke generalize to unseen tissues, and what are the limitations in terms of nucleus category prediction?
RQ4What are the practical implications and limitations of current DL tools when deployed on real-world computational pathology data?

主な発見

PanNuke contains 189,744 exhaustively annotated nuclei from 20,000+ WSIs across 19 tissues, with 5 nucleus classes.
HoVer-Net achieves the best average performance (mPQ and bPQ) across tissues for nucleus segmentation and classification.
Dead nuclei are challenging (small size yields low PQ), and class imbalance affects performance in some categories.
Segmentation models generally outperform a detection-based approach when evaluated on nucleus centroids for detection metrics.
Models trained on PanNuke show promising segmentation on an unseen brain tissue, indicating generalization for segmentation but challenging category prediction in a new tissue.
Providing an evaluation framework (mPQ/bPQ) and releasing evaluation code promotes reproducibility and further benchmarking.

より良い研究を、今すぐ始めましょう

論文設計から論文執筆まで、研究時間を劇的に削減しましょう。

クレジットカード登録不要

このレビューはAIが作成し、人間の編集者が確認しました。