QUICK REVIEW

[论文解读] RetinaMask: A Face Mask detector

Mingjie Jiang, Xinqi Fan|arXiv (Cornell University)|May 8, 2020

Face recognition and analysis参考文献 35被引用 156

一句话总结

本文提出RetinaFaceMask，一种基于特征金字塔网络和新型上下文注意力模块的一阶段人脸口罩检测器，以提升检测精度。在公开数据集上，其精度和召回率分别较基线模型提升2.3%和1.5%，召回率分别提升11.0%和5.9%，同时通过使用MobileNet实现轻量化部署。

ABSTRACT

Coronavirus disease 2019 has affected the world seriously. One major protection method for people is to wear masks in public areas. Furthermore, many public service providers require customers to use the service only if they wear masks correctly. However, there are only a few research studies about face mask detection based on image analysis. In this paper, we propose RetinaFaceMask, which is a high-accuracy and efficient face mask detector. The proposed RetinaFaceMask is a one-stage detector, which consists of a feature pyramid network to fuse high-level semantic information with multiple feature maps, and a novel context attention module to focus on detecting face masks. In addition, we also propose a novel cross-class object removal algorithm to reject predictions with low confidences and the high intersection of union. Experiment results show that RetinaFaceMask achieves state-of-the-art results on a public face mask dataset with $2.3\%$ and $1.5\%$ higher than the baseline result in the face and mask detection precision, respectively, and $11.0\%$ and $5.9\%$ higher than baseline for recall. Besides, we also explore the possibility of implementing RetinaFaceMask with a light-weighted neural network MobileNet for embedded or mobile devices.

研究动机与目标

应对公共卫生与安全领域对自动化人脸口罩检测日益增长的需求。
提升真实场景中人脸口罩识别的检测精度与效率。
开发一种在保持高性能的同时可部署于移动或嵌入式设备的模型。
通过引入上下文感知的特征学习与基于置信度的预测过滤，克服现有口罩检测系统存在的局限性。

提出的方法

采用基于RetinaNet的一阶段目标检测框架，集成特征金字塔网络以融合多尺度特征图，提升定位能力。
提出一种新型上下文注意力模块，通过聚焦与口罩检测相关的区域，增强特征表示。
应用跨类别目标移除算法，通过IoU阈值筛选，过滤低置信度预测，减少误报。
设计基于MobileNet作为主干网络的轻量化变体，以实现在移动与嵌入式平台上的高效推理。
在公开人脸口罩检测数据集上端到端训练模型，并通过数据增强提升鲁棒性。
使用焦点损失（focal loss）优化损失函数，以缓解有口罩与无口罩人脸之间的类别不平衡问题。

实验结果

研究问题

RQ1与现有方法相比，结合特征金字塔融合与注意力机制的一阶段检测器是否能在人脸与口罩检测性能上实现更优表现？
RQ2所提出的上下文注意力模块在提升口罩区域检测精度方面效果如何？
RQ3跨类别目标移除算法在减少误报与提升检测可靠性方面的作用有多大？
RQ4该模型是否能在轻量化移动或嵌入式硬件上保持高精度并实现高效部署？

主要发现

与基线模型相比，RetinaFaceMask在人脸检测中精度提升2.3%，在口罩检测中精度提升1.5%。
模型在人脸检测中召回率提升11.0%，在口罩检测中召回率提升5.9%，表明对戴口罩人脸的检测能力显著增强。
所提出的上下文注意力模块显著提升了特征表示能力，尤其在复杂或遮挡场景中表现突出。
跨类别目标移除算法有效减少了低置信度预测，提升了整体检测可靠性。
基于MobileNet的轻量化变体可在移动与嵌入式设备上实现高效部署，且精度损失较小。
总体而言，RetinaFaceMask在公开人脸口罩检测基准上树立了新的SOTA性能记录。

更好的研究，从现在开始

从论文设计到论文写作，大幅缩短您的研究时间。

无需绑定信用卡

本解读由 AI 生成，并经人工编辑审核。