[论文解读] Orientation Aware Object Detection with Application to Firearms
本文提出了一种面向枪械检测的定向感知目标检测器(OAOD),通过两阶段流程改进检测性能:首先预测目标方向以旋转候选区域,然后对旋转后的区域进行分类与定位。该方法采用定向边界框实现更优的定位效果,在新收集的11,000张图像标注的ITU枪械(ITUF)数据集上,性能优于当前最先进检测器。
Automatic detection of firearms is important for enhancing security and safety of people, however, it is a challenging task owing to the wide variations in shape, size and appearance of firearms. To handle these challenges we propose an Orientation Aware Object Detector (OAOD) which has achieved improved firearm detection and localization performance. The proposed detector has two phases. In the Phase-1 it predicts orientation of the object which is used to rotate the object proposal. Maximum area rectangles are cropped from the rotated object proposals which are again classified and localized in the Phase-2 of the algorithm. The oriented object proposals are mapped back to the original coordinates resulting in oriented bounding boxes which localize the weapons much better than the axis aligned bounding boxes. Being orientation aware, our non-maximum suppression is able to avoid multiple detection of the same object and it can better resolve objects which lie in close proximity to each other. This two phase system leverages OAOD to predict object oriented bounding boxes while being trained only on the axis aligned boxes in the ground-truth. In order to train object detectors for firearm detection, a dataset consisting of around eleven thousand firearm images is collected from the internet and manually annotated. The proposed ITU Firearm (ITUF) dataset contains wide range of guns and rifles. The OAOD algorithm is evaluated on the ITUF dataset and compared with current state of the art object detectors. Our experiments demonstrate the excellent performance of the proposed detector for the task of firearm detection.
研究动机与目标
- 为解决枪械在形状、尺寸和外观上的高度可变性所带来的检测挑战。
- 通过引入目标方向感知能力,将目标定位精度超越轴对齐边界框。
- 开发一种可使用轴对齐真实标注进行训练,但能预测定向边界框的检测器。
- 通过定向感知的非极大值抑制,减少近距离枪械之间的误检与重叠检测。
- 构建并发布一个大规模、多样化的枪械数据集(ITUF),用于检测模型的基准测试。
提出的方法
- OAOD框架采用两阶段方法:阶段一通过区域建议网络预测目标候选区域的方向。
- 根据预测的方向旋转候选区域,使其与目标实际方向对齐。
- 从旋转后的候选区域中裁剪出最大面积的矩形区域,以保留最相关特征。
- 阶段二使用分类器头对旋转后的区域进行分类并优化定位。
- 将预测的定向边界框映射回原始图像坐标,输出最终检测结果。
- 对非极大值抑制进行改进,考虑方向信息,有效减少对同一把枪的重复检测。
实验结果
研究问题
- RQ1与轴对齐基线方法相比,融合目标方向感知的两阶段目标检测框架是否能显著提升枪械检测的准确率?
- RQ2定向感知的非极大值抑制在解决重叠或近距离枪械检测问题上有多有效?
- RQ3在仅使用轴对齐真实标注进行训练的情况下,检测器在预测准确定向边界框方面具有多强的泛化能力?
- RQ4所提出方法在具有显著视觉差异的多样化真实世界枪械数据集上的表现如何?
- RQ5使用旋转区域建议对枪械的定位精确率与召回率有何影响?
主要发现
- OAOD模型在ITUF数据集上达到最先进性能,显著优于现有检测器在枪械定位方面的表现。
- 采用定向感知候选区域与定向边界框,相比轴对齐边界框,能实现更精确的定位。
- 两阶段设计使模型能够从轴对齐标注中学习,同时输出定向预测,提升泛化能力。
- 定向感知的非极大值抑制能有效减少对同一把枪的多次检测,尤其在密集场景中表现突出。
- ITUF数据集包含约11,000张人工标注的枪械图像,为枪械检测研究提供了强有力的基准。
更好的研究,从现在开始
从论文设计到论文写作,大幅缩短您的研究时间。
无需绑定信用卡
本解读由 AI 生成,并经人工编辑审核。