QUICK REVIEW

[论文解读] Malaria detection using Deep Convolution Neural Network

Sumit Kumar, Vardhan, Harsh|arXiv (Cornell University)|Mar 4, 2023

Digital Imaging for Blood Diseases被引用 14

一句话总结

本文构建了一个两层卷积神经网络，用于将疟疾感染的红血细胞与未感染的红血球进行分类，使用 NIH 的疟疾数据集，在资源受限设备上实现轻量模型，测试准确率约为 95.4%。

ABSTRACT

The latest WHO report showed that the number of malaria cases climbed to 219 million last year, two million higher than last year. The global efforts to fight malaria have hit a plateau and the most significant underlying reason is international funding has declined. Malaria, which is spread to people through the bites of infected female mosquitoes, occurs in 91 countries but about 90% of the cases and deaths are in sub-Saharan Africa. The disease killed 4,35,000 people last year, the majority of them children under five in Africa. AI-backed technology has revolutionized malaria detection in some regions of Africa and the future impact of such work can be revolutionary. The malaria Cell Image Data-set is taken from the official NIH Website NIH data. The aim of the collection of the dataset was to reduce the burden for microscopists in resource-constrained regions and improve diagnostic accuracy using an AI-based algorithm to detect and segment the red blood cells. The goal of this work is to show that the state of the art accuracy can be obtained even by using 2 layer convolution network and show a new baseline in Malaria detection efforts using AI.

研究动机与目标

在资金与人力资源受限的背景下，推动疟疾检测的改进。
证明一个紧凑型 CNN 能在疟疾细胞图像上达到接近最先进水平的准确率。
展示在资源受限设备（如手机）上的部署可行性。

提出的方法

以 LeNet5 为起点并适配到 3 通道 RGB 图像。
构建一个两层卷积网络，每层包含 32 个 3x3 过滤器，输入尺寸为 64x64。
应用最大池化、批量归一化和 dropout 进行正则化。
将 14x14x32 的特征图展平，以输入到一个两层全连接网络（512 和 256 个神经元）。
使用 Adam 优化器以 categorical_crossentropy 损失进行训练；在验证准确率达到 95% 时使用提前停止。

实验结果

研究问题

RQ1一个紧凑型 CNN（两层）是否能够在疟疾细胞图像分类上达到与更深层模型相近的高准确率？
RQ2在 NIH 疟疾数据集上，轻量级模型对寄生化（感染）与未感染细胞的表现如何？
RQ3该模型是否适合在资源受限设备上部署且不需要大量预处理？

主要发现

训练在达到验证准确率 95% 时提前停止，共完成 50 轮。
测试准确率约为 95.4% 。
模型大约包含 3.36 百万个可学习参数，在给定硬件条件下大约训练了 20 分钟。
最终架构输出 2 类 softmax，隐藏层使用 ReLU 激活。
预处理将图像降至 64x64 再输入到 CNN。
所用数据集包含 27,558 张图像，感染与未感染细胞数量相等。

更好的研究，从现在开始

从论文设计到论文写作，大幅缩短您的研究时间。

无需绑定信用卡

本解读由 AI 生成，并经人工编辑审核。