QUICK REVIEW

[论文解读] PanNuke Dataset Extension, Insights and Baselines

Jevgenij Gamper, Navid Alemi Koohbanani|arXiv (Cornell University)|Mar 24, 2020

AI in cancer detection参考文献 45被引用 156

一句话总结

本文将 PanNuke 扩展到跨越 19 种组织的约 190k 个细胞核，基准化分割/分类模型，分析真实世界的泛化，并提供一个半自动、经过质量控制的地面真实数据管线，使用 NuClick。

ABSTRACT

The emerging area of computational pathology (CPath) is ripe ground for the application of deep learning (DL) methods to healthcare due to the sheer volume of raw pixel data in whole-slide images (WSIs) of cancerous tissue slides. However, it is imperative for the DL algorithms relying on nuclei-level details to be able to cope with data from `the clinical wild', which tends to be quite challenging. We study, and extend recently released PanNuke dataset consisting of ~200,000 nuclei categorized into 5 clinically important classes for the challenging tasks of segmenting and classifying nuclei in WSIs. Previous pan-cancer datasets consisted of only up to 9 different tissues and up to 21,000 unlabeled nuclei and just over 24,000 labeled nuclei with segmentation masks. PanNuke consists of 19 different tissue types that have been semi-automatically annotated and quality controlled by clinical pathologists, leading to a dataset with statistics similar to the clinical wild and with minimal selection bias. We study the performance of segmentation and classification models when applied to the proposed dataset and demonstrate the application of models trained on PanNuke to whole-slide images. We provide comprehensive statistics about the dataset and outline recommendations and research directions to address the limitations of existing DL tools when applied to real-world CPath applications.

研究动机与目标

提供一个更大、具有临床代表性的细胞核数据集，具备像素级边界和多类标签。
描述一个利用 NuClick 的半自动地面真相生成与质量控制管线。
在 PanNuke 上对多种最先进的细胞核分割模型进行基准测试并建立基线。
分析训练模型对未见组织（如大脑）的泛化能力，并讨论在真实世界中的适用性。
提供在真实世界计算病理学中解决深度学习工具局限性的指南和研究方向。

提出的方法

通过聚合多个公开数据集来训练用于细胞核的 FCNN 检测器，随后进行迭代的病理学验证，实现半自动地面真相生成。
利用 NuClick，在经验证的细胞核点条件下生成细胞核分割掩码，以获得准确的像素级边界。
进行 patch 提取，并进行三折随机数据集划分，以实现跨组织的公平跨模型比较。
使用多类 Panoptic Quality (mPQ) 与二元 PQ (bPQ) 指标进行评估，真阳性 IoU 阈值为 0.5。
对四种模型（DIST、Mask-RCNN、Micro-Net、HoVer-Net）进行基准测试，改造为实现同时的细胞核分割和分类；还包括一个基于检测的 U-Net 变体。
通过将 PanNuke 训练的模型应用于未见的大脑组织来进行泛化测试。

实验结果

研究问题

RQ1PanNuke 是否能够在包含临床相关伪影的 19 种多样组织中提供稳健的细胞核级分割与分类？
RQ2使用标准化的多类 PQ 指标，最先进的分割模型在 PanNuke 上的表现如何？
RQ3在 PanNuke 上的训练是否会泛化到未见组织，且在细胞核类别预测方面的局限性是什么？
RQ4在真实世界计算病理数据上部署当前深度学习工具时，其实际影响和局限性是什么？

主要发现

PanNuke 包含来自 19 种组织的 20,000 以上 WSIs 的 189,744 个被穷举注释的细胞核，且有 5 个细胞核类别。
HoVer-Net 在跨组织的细胞核分割与分类上实现了最佳平均性能（mPQ 和 bPQ）。
死亡细胞核具有挑战性（尺寸较小导致 PQ 低），而类别不平衡会影响某些类别的性能。
在以细胞核质心进行检测度量评估时，分割模型通常优于基于检测的方法。
在未见的大脑组织上，PanNuke 训练的模型展现出有前景的分割能力，表明对分割有泛化性，但在新组织中的类别预测较具挑战。
提供评估框架（mPQ/bPQ）并发布评估代码，促进可重复性和进一步的基准测试。

更好的研究，从现在开始

从论文设计到论文写作，大幅缩短您的研究时间。

无需绑定信用卡

本解读由 AI 生成，并经人工编辑审核。