QUICK REVIEW

[论文解读] CVR-Net: A deep convolutional neural network for coronavirus recognition from chest radiography images

Md. Kamrul Hasan, Md. Ashraful Alam|arXiv (Cornell University)|Jul 21, 2020

COVID-19 diagnosis using AI参考文献 60被引用 27

一句话总结

CVR-Net 是一种多尺度、多编码器集成的卷积神经网络，专为从胸部 X 光和 CT 图像中端到端识别冠状病毒而设计。它在 2 类任务中实现了最先进的性能，F1 分数达到 0.997，准确率达到 0.998，即使在训练数据有限的情况下，也通过迁移学习、数据增强和类别平衡策略展现出强大的鲁棒性。

ABSTRACT

The novel Coronavirus Disease 2019 (COVID-19) is a global pandemic disease spreading rapidly around the world. A robust and automatic early recognition of COVID-19, via auxiliary computer-aided diagnostic tools, is essential for disease cure and control. The chest radiography images, such as Computed Tomography (CT) and X-ray, and deep Convolutional Neural Networks (CNNs), can be a significant and useful material for designing such tools. However, designing such an automated tool is challenging as a massive number of manually annotated datasets are not publicly available yet, which is the core requirement of supervised learning systems. In this article, we propose a robust CNN-based network, called CVR-Net (Coronavirus Recognition Network), for the automatic recognition of the coronavirus from CT or X-ray images. The proposed end-to-end CVR-Net is a multi-scale-multi-encoder ensemble model, where we have aggregated the outputs from two different encoders and their different scales to obtain the final prediction probability. We train and test the proposed CVR-Net on three different datasets, where the images have collected from different open-source repositories. We compare our proposed CVR-Net with state-of-the-art methods, which are trained and tested on the same datasets. We split three datasets into five different tasks, where each task has a different number of classes, to evaluate the multi-tasking CVR-Net. Our model achieves an overall F1-score & accuracy of 0.997 & 0.998; 0.963 & 0.964; 0.816 & 0.820; 0.961 & 0.961; and 0.780 & 0.780, respectively, for task-1 to task-5. As the CVR-Net provides promising results on the small datasets, it can be an auspicious computer-aided diagnostic tool for the diagnosis of coronavirus to assist the clinical practitioners and radiologists. Our source codes and model are publicly available at https://github.com/kamruleee51/CVR-Net.

研究动机与目标

解决对自动化、可靠且早期检测 COVID-19 的迫切需求，以减少对耗时且昂贵的 RT-PCR 检测的依赖。
克服公开可用、人工标注的医学影像数据集稀缺的问题，这些数据集通常用于训练监督式深度学习模型。
开发一种稳健的端到端深度学习系统，即使在类别不平衡和小样本训练集的情况下，也能在多样化胸部放射影像中实现良好泛化。
将假阴性预测降至最低，以支持疫情暴发期间的临床决策，尤其是在资源有限的环境中。
提供一个公开可用、可复现的模型，以加速基于人工智能的 CAD 工具在冠状病毒检测中的研究与临床部署。

提出的方法

提出 CVR-Net，一种多尺度、多编码器集成的卷积神经网络架构，通过在多个尺度上融合来自两个不同编码器分支的特征，以提升特征表示能力。
通过使用 ImageNet 预训练权重初始化网络，采用迁移学习策略，以增强在小样本医学影像数据集上的特征学习能力。
应用广泛的数据增强技术（如旋转、翻转、缩放）以增加训练数据的多样性并减少过拟合。
实施类别重平衡策略，以缓解对多数类别的偏差，这一点在阳性（COVID-19）病例稀缺的情况下尤为重要。
采用集成平均策略，聚合来自多个尺度和编码器的预测结果，从而提升模型的鲁棒性和泛化能力。
在三个开源数据集上端到端地进行模型训练与评估，划分为五个多类别分类任务，各类别数量不同。

实验结果

研究问题

RQ1尽管训练数据标注有限，基于深度卷积神经网络的模型是否能在分类胸部 X 光和 CT 图像中的 COVID-19 时实现高准确率和低假阴性率？
RQ2当总训练样本数保持不变时，CVR-Net 的性能如何随类别数量增加而变化（例如，2 类 vs. 4 类）？
RQ3与单分支模型相比，多尺度、多编码器特征融合在多大程度上提升了识别的鲁棒性？
RQ4数据增强和类别重平衡在缓解小样本医学影像数据集中类别不平衡影响方面的有效性如何？
RQ5从 ImageNet 进行迁移学习是否能显著提升在低资源医学图像分类任务中冠状病毒检测的性能？

主要发现

在 2 类任务（COVID-19 与无异常发现）中，CVR-Net 实现了 0.997 的 F1 分数和 0.998 的准确率，表明在小样本数据集上表现近乎完美。
随着类别数量的增加，性能有所下降——在 4 类任务（COVID-19、正常、细菌性肺炎、病毒性肺炎）中，F1 分数降至 0.816，准确率降至 0.820，原因在于类别间相似性更高且类内差异更大。
交叉验证中模型的组间差异极小，表明其在不同数据划分下均表现出极强的鲁棒性和泛化能力。
当正常类别被拆分为正常与肺炎（细菌性/病毒性）时，假阴性预测显著增加，凸显了在小样本下类别模糊性带来的挑战。
多尺度、多编码器集成设计有效减少了过拟合和梯度消失问题，使模型即使在小样本数据集上也能实现高性能。
模型性能对类别分布和类别相似性最为敏感；未来若能提供更清晰、平衡且分割良好的训练样本，性能有望进一步提升。

更好的研究，从现在开始

从论文设计到论文写作，大幅缩短您的研究时间。

无需绑定信用卡

本解读由 AI 生成，并经人工编辑审核。