QUICK REVIEW

[论文解读] Classification and Disease Localization in Histopathology Using Only Global Labels: A Weakly-Supervised Approach

Pierre Courtiol, Eric W. Tramel|arXiv (Cornell University)|Feb 1, 2018

AI in cancer detection参考文献 32被引用 99

一句话总结

本文提出 CHOWDER，一种弱监督方法，利用仅图像级标签对全 slides 病理切片图像中的疾病进行分类和定位，在没有像素级标注的情况下实现具有竞争力的结果。

ABSTRACT

Analysis of histopathology slides is a critical step for many diagnoses, and in particular in oncology where it defines the gold standard. In the case of digital histopathological analysis, highly trained pathologists must review vast whole-slide-images of extreme digital resolution ($100,000^2$ pixels) across multiple zoom levels in order to locate abnormal regions of cells, or in some cases single cells, out of millions. The application of deep learning to this problem is hampered not only by small sample sizes, as typical datasets contain only a few hundred samples, but also by the generation of ground-truth localized annotations for training interpretable classification and segmentation models. We propose a method for disease localization in the context of weakly supervised learning, where only image-level labels are available during training. Even without pixel-level annotations, we are able to demonstrate performance comparable with models trained with strong annotations on the Camelyon-16 lymph node metastases detection challenge. We accomplish this through the use of pre-trained deep convolutional networks, feature embedding, as well as learning via top instances and negative evidence, a multiple instance learning technique from the field of semantic segmentation and object detection.

研究动机与目标

解决病理组织学图像分析中像素级标注成本高、稀缺的问题。
开发一种弱监督学习方法，仅使用全-slide 标签即可本地化疾病区域。
利用带有多实例学习的预训练 CNN 特征来识别最具指示性的块和负面证据。
生成可解释的定位图，与真实区域对齐且不需要分割掩码。

提出的方法

使用从 224x224 瓷砖中提取的 ResNet-50 特征（2048 维）作为瓦片嵌入。
跨瓦片索引应用一维嵌入层以产生一致的瓦片级嵌入。
保留前 R 个和后 R 个瓦片嵌入（前置实例和负面证据）用于 MIL 基于的分类。
在所选的前/后嵌入上用多层感知机进行分类，以获得切片级预测。
基线聚合方法（MaxPool、MeanPool）与提出的 CHOWDER 方法进行比较。
通过 Adam 训练，使用二进制交叉熵损失，采用集成（E 网络）以提高泛化能力。

实验结果

研究问题

RQ1是否可以仅使用图像级标签（弱监督）在全-slide病理图像中实现疾病定位，而无需像素级掩码？
RQ2在异构 WSI 数据集上，基于 MIL 的前部实例与负面证据的方法相较于标准池化的表现如何？
RQ3在 Camelyon-16 和 TCGA 数据集上，CHOWDER 能在何种程度上实现疾病区域定位，同时达到或超过最先进的弱监督方法？

主要发现

CHOWDER 在不同数据集上相比强基线池化和 WELDON 取得了具竞争力或更高的 AUC。
在 Camelyon-16 上，CHOWDER 结合合适的 R 能产生比基线更高的 AUC，并在不使用分割掩码的情况下的表现与人类水平相当。
在 TCGA-Lung 上，CHOWDER 展现出在弥散性疾病情形中的鲁棒性，R 在 {1,10,100} 时的 AUC 约为 0.90–0.92。
CHOWDER 生成的定位图在没有像素级训练数据的情况下突出显示肿瘤区域，在定性可视化中与地面实况标注高度一致。
与 Camelyon-16 上已发表的最强方法相比，CHOWDER 在没有地面 truth 分割掩码的情况下实现显著的性能提升（AUC 接近 Bejnordi 等 2017）。
对多个 CHOWDER 模型进行集成有助于在数据集规模较小、超参数调优有限的情况下稳定性能。

更好的研究，从现在开始

从论文设计到论文写作，大幅缩短您的研究时间。

无需绑定信用卡

本解读由 AI 生成，并经人工编辑审核。