QUICK REVIEW

[论文解读] Tversky loss function for image segmentation using 3D fully convolutional deep networks

Seyed Sadegh Mohseni Salehi, Deniz Erdoğmuş|arXiv (Cornell University)|Jun 18, 2017

AI in cancer detection被引用 27

一句话总结

本文提出一种用于3D全卷积网络的Tversky损失函数，以改善在高度不平衡医学数据（如多发性硬化病变检测）中的图像分割性能。通过调整Tversky指数的超参数以优先考虑召回率而非精确率，该方法在F₂分数、Dice系数和精确率-召回率曲线下方面积方面显著优于标准Dice损失，其中β=0.7时性能最优。

ABSTRACT

Fully convolutional deep neural networks carry out excellent potential for fast and accurate image segmentation. One of the main challenges in training these networks is data imbalance, which is particularly problematic in medical imaging applications such as lesion segmentation where the number of lesion voxels is often much lower than the number of non-lesion voxels. Training with unbalanced data can lead to predictions that are severely biased towards high precision but low recall (sensitivity), which is undesired especially in medical applications where false negatives are much less tolerable than false positives. Several methods have been proposed to deal with this problem including balanced sampling, two step training, sample re-weighting, and similarity loss functions. In this paper, we propose a generalized loss function based on the Tversky index to address the issue of data imbalance and achieve much better trade-off between precision and recall in training 3D fully convolutional deep neural networks. Experimental results in multiple sclerosis lesion segmentation on magnetic resonance images show improved F2 score, Dice coefficient, and the area under the precision-recall curve in test data. Based on these results we suggest Tversky loss function as a generalized framework to effectively train deep neural networks.

研究动机与目标

解决医学图像分割中的数据不平衡问题，特别是在病变体素远多于非病变体素的情况下。
改善基于深度学习的分割中精确率与召回率之间的权衡，尤其侧重于提高召回率以减少假阴性。
开发一种广义损失函数，在高度不平衡场景（如多发性硬化病变分割）中优于标准Dice损失。
在真实MRI数据上，通过3D全卷积U-Net架构验证Tversky损失的有效性。

提出的方法

基于Tversky指数（Dice系数和Fβ分数的推广）提出一种Tversky损失函数，以平衡精确率与召回率。
将Tversky损失层集成到3D U-Net架构中，实现具有类别不平衡感知能力的端到端训练。
使用超参数α和β控制对假阳性（FPs）和假阴性（FNs）的惩罚，其中较高的β值可增强对FN的敏感性。
采用具有跳跃连接的3D全卷积网络，以结合高分辨率局部特征与低分辨率全局特征。
使用不同α和β值训练网络，以评估其对分割性能的影响。
采用标准指标评估性能，包括Dice相似系数（DSC）、F₂分数、精确率-召回率曲线下方面积（APR）以及敏感性/特异性。

实验结果

研究问题

RQ1与标准Dice损失相比，Tversky损失函数是否能在高度不平衡的医学图像数据中提升分割性能？
RQ2Tversky指数中的超参数α和β如何影响病变分割中的精确率-召回率权衡？
RQ3使用β=0.7的Tversky损失是否能实现更好的泛化能力并提高对小尺寸或稀疏病变的检测敏感性？
RQ4在不平衡数据集中，Tversky损失是否比Dice损失更有效地提升精确率-召回率曲线下方面积（APR）？
RQ5Tversky损失能否作为训练具有高度类别不平衡的医学图像分割任务深度网络的通用框架？

主要发现

当β=0.7时，Tversky损失函数的F₂分数达到最高值57.32，显著优于使用β=0.5的Dice损失（F₂分数为51.77）。
β=0.7时，Tversky损失的Dice系数为56.42，高于Dice损失的53.42，表明与真实标签的重叠程度更高。
在β=0.7时，精确率-召回率曲线下方面积（APR）达到56.04，为所有测试配置中的最高值，证实其在不平衡数据上的优越性能。
在β=0.7时，敏感性（召回率）达到56.85%，显著高于Dice损失的49.85%，表明对病变的检测能力更强。
在β=0.7下使用Tversky损失训练的网络成功检测到Dice损失模型完全失败的低密度病变病例。
结果表明，调节β值可对模型进行微调以优先考虑召回率，这在临床应用中至关重要，因为假阴性更具危害性。

更好的研究，从现在开始

从论文设计到论文写作，大幅缩短您的研究时间。

无需绑定信用卡

本解读由 AI 生成，并经人工编辑审核。