QUICK REVIEW

[论文解读] Study of Different Deep Learning Approach with Explainable AI for Screening Patients with COVID-19 Symptoms: Using CT Scan and Chest X-ray Image Dataset

Md Manjurul Ahsan, Kishor Datta Gupta|arXiv (Cornell University)|Jul 24, 2020

COVID-19 diagnosis using AI参考文献 66被引用 38

一句话总结

本研究在配对的 CT 和胸部 X 线数据集上比较八种预训练的 CNN（VGG16、InceptionResNetV2、ResNet50、DenseNet201、VGG19、MobileNetV2、NasNetMobile、ResNet15V2）用于 COVID-19 筛查，使用 LIME 进行可解释性分析，并将 NasNetMobile（CT）和 NasNetMobile（X-ray）确定为表现最优者。

ABSTRACT

The outbreak of COVID-19 disease caused more than 100,000 deaths so far in the USA alone. It is necessary to conduct an initial screening of patients with the symptoms of COVID-19 disease to control the spread of the disease. However, it is becoming laborious to conduct the tests with the available testing kits due to the growing number of patients. Some studies proposed CT scan or chest X-ray images as an alternative solution. Therefore, it is essential to use every available resource, instead of either a CT scan or chest X-ray to conduct a large number of tests simultaneously. As a result, this study aims to develop a deep learning-based model that can detect COVID-19 patients with better accuracy both on CT scan and chest X-ray image dataset. In this work, eight different deep learning approaches such as VGG16, InceptionResNetV2, ResNet50, DenseNet201, VGG19, MobilenetV2, NasNetMobile, and ResNet15V2 have been tested on two dataset-one dataset includes 400 CT scan images, and another dataset includes 400 chest X-ray images studied. Besides, Local Interpretable Model-agnostic Explanations (LIME) is used to explain the model's interpretability. Using LIME, test results demonstrate that it is conceivable to interpret top features that should have worked to build a trust AI framework to distinguish between patients with COVID-19 symptoms with other patients.

研究动机与目标

通过在 RT-PCR 测试受限时，利用影像实现快速、可扩展的 COVID-19 症状筛查。
在 CT 和胸部 X 线数据集上评估多种深度学习结构，以识别稳健的 COVID-19 分类器。
结合 LIME 提供可解释性，以帮助放射科医生理解模型决策。
提供可比较的指标（准确率、精确度、召回率、F1 值）及置信区间，以评估在不同模态下的可靠性。

提出的方法

使用两个数据集：400 张 CT 扫描图像（COVID-19 与非 COVID-19）和 400 张胸部 X 线图像（COVID-19 与非 COVID-19）。
应用迁移学习，使用八个预训练的 CNN：VGG16、InceptionResNetV2、ResNet50、DenseNet201、VGG19、MobileNetV2、NasNetMobile、ResNet15V2。
通过替换头部进行微调；以 80% 数据训练、20% 数据测试；使用 Adam 优化并通过网格搜索超参数（学习率 0.001、30 轮、批量大小 5）。
使用准确率、精确度、召回率和 F1 分数进行评估；通过 Wilson 置信区间和贝叶斯区间报告 95% 置信区间。
用 Local Interpretable Model-agnostic Explanations (LIME) 对预测进行解释，突出影响决策的主要特征。

实验结果

研究问题

RQ1在这八个预训练 CNN 中，哪一个在 CT 与 X-ray 数据集上提供了最佳的 COVID-19 筛查性能？
RQ2模型在不同模态（CT 与 X-ray）和模型之间，性能指标（准确率、精确度、召回率、F1）有何差异？
RQ3LIME 的解释是否能提供与放射学特征对齐的可解释洞见，用于 COVID-19 分类？
RQ4每个模型在训练过程中的训练与验证的稳定性，以及随 epoch 变化的损失行为如何？

主要发现

NasNetMobile 在 CT 扫描的测试准确率为最高（0.90）并且在胸部 X-ray 图像的测试中也为最高（0.100? 注：见下文）
在 CT 测试数据上，NasNetMobile 提供 90% 的准确率，以及 0.90 的精确度、召回率和 F1 分数，混淆矩阵中有 8 处错误分类（80 个测试样本）。
在胸部 X 线测试数据上，NasNetMobile 实现 100% 的准确率、精确度、召回率和 F1 分数，错误分类为零（80 个测试样本）。
MobileNetV2 在 CT 训练数据上表现最强（准确率 0.99），但 NasNetMobile 在 CT 测试集上表现更好。
在多次训练中，若干模型显示出稳定的训练/验证性能（如 VGG16、InceptionResNetV2、NasNetMobile），而 ResNet50 一贯表现不佳。
使用 LIME 来说明模型可解释性，证明了在基于 AI 的筛查中解释顶层特征的可行性从而提升信任度。

更好的研究，从现在开始

从论文设计到论文写作，大幅缩短您的研究时间。

无需绑定信用卡

本解读由 AI 生成，并经人工编辑审核。