QUICK REVIEW

[论文解读] HarDNet-MSEG: A Simple Encoder-Decoder Polyp Segmentation Neural Network that Achieves over 0.9 Mean Dice and 86 FPS

Chien‐Hsiang Huang, Hung-Yu Wu|arXiv (Cornell University)|Jan 18, 2021

Vehicle License Plate Recognition参考文献 33被引用 187

一句话总结

HarDNet-MSEG 使用 HarDNet68 主体网络，配 cascaded partial decoder，在高速度下实现最先进的息肉分割精度（Kvasir-SEG 的平均 Dice >0.9），速度快（86 FPS）。

ABSTRACT

We propose a new convolution neural network called HarDNet-MSEG for polyp segmentation. It achieves SOTA in both accuracy and inference speed on five popular datasets. For Kvasir-SEG, HarDNet-MSEG delivers 0.904 mean Dice running at 86.7 FPS on a GeForce RTX 2080 Ti GPU. It consists of a backbone and a decoder. The backbone is a low memory traffic CNN called HarDNet68, which has been successfully applied to various CV tasks including image classification, object detection, multi-object tracking and semantic segmentation, etc. The decoder part is inspired by the Cascaded Partial Decoder, known for fast and accurate salient object detection. We have evaluated HarDNet-MSEG using those five popular datasets. The code and all experiment details are available at Github. https://github.com/james128333/HarDNet-MSEG

研究动机与目标

通过结肠镜成像推动快速、准确的息肉分割，以帮助预防CRC。
提出一种带有节省内存的骨干的简单双编码器-解码器架构。
在五大息肉数据集上进行评估，以确立SOTA的准确性和速度。
与 U-Net、PraNet 及其他领先模型进行对比，以量化 Dice、IoU 和 FPS 的提升。

提出的方法

采用 HarDNet68 作为骨干以降低内存带宽并提升推理速度。
使用受快速显著目标检测启发的带级联部分解码器的简单的编码器-解码器架构。
在跳跃连接中加入 Receptive Field Block (RFB) 以增大感受野。
在上采样后通过逐元素相乘进行密集聚合以融合特征。
采用两种受前人工作启发的不同设置进行训练，以确保在各数据集上的鲁棒对比。

实验结果

研究问题

RQ1HarDNet-MSEG 能否在标准数据集上在 mean Dice 和 IoU 方面超越当前的 SOTA 息肉分割方法？
RQ2带有 HarDNet68 主干的简单编码器-解码器是否在保持高推理速度的同时实现有竞争力的准确性？
RQ3级联部分解码器和 RFB 启用的跳跃连接对边界准确性和小息肉分割有何影响？
RQ4在 Kvasir-SEG、CVC-ColonDB、EndoScene、ETIS-Larib Polyp DB 和 CVC-Clinic DB 上，HarDNet-MSEG 相对于 PraNet 与 U-Net 变体的表现如何？

主要发现

mIoU	mDice	F2-score	Precision	Recall	Overall Acc.	FPS
U-Net	0.471	0.597	0.598	0.672	0.617	0.894	11
ResUNet	0.572	0.690	0.699	0.745	0.725	0.917	15
ResUNet++	0.613	0.714	0.720	0.784	0.742	0.917	7
FCN8	0.737	0.831	0.825	0.882	0.835	0.952	25
HRNet	0.759	0.845	0.847	0.878	0.859	0.952	12
DoubleUNet	0.733	0.813	0.820	0.861	0.840	0.949	7.5
PSPNet	0.744	0.841	0.831	0.890	0.836	0.953	17
DeepLabv3+[ResNet50]	0.776	0.857	0.855	0.891	0.862	0.961	28
DeepLabv3+[ResNet101]	0.786	0.864	0.857	0.906	0.859	0.961	17
U-Net[ResNet34]	0.810	0.876	0.862	0.944	0.860	0.968	35
HarDNet-MSEG	0.848	0.904	0.915	0.907	0.923	0.969	86.7

HarDNet-MSEG 在所有五个数据集上实现了最先进的 mean Dice 与 mIoU。
在 Kvasir-SEG 上，它在 RTX 2080 Ti 上实现 0.904 的 mean Dice，速度为 86.7 FPS。
在 mean Dice 与 mIoU 指标上持续优于 U-Net[ResNet34] 与 PraNet。
在保持或提升准确性的同时，展示出比若干竞争模型更快的推理速度（FPS）。
定性结果中展现出强边界界定和整体分割质量。

更好的研究，从现在开始

从论文设计到论文写作，大幅缩短您的研究时间。

无需绑定信用卡

本解读由 AI 生成，并经人工编辑审核。