QUICK REVIEW

[论文解读] Region Proposal by Guided Anchoring

Jiaqi Wang, Kai Chen|arXiv (Cornell University)|Jan 10, 2019

Advanced Neural Network Applications参考文献 36被引用 59

一句话总结

本文提出 Guided Anchoring (GA)，在联合预测锚点的位置与形状的同时，加入锚点引导的特征自适应，在显著减少锚点数量的情况下获得更高的召回率，并提升 Fast R-CNN、Faster R-CNN 与 RetinaNet 的检测性能。

ABSTRACT

Region anchors are the cornerstone of modern object detection techniques. State-of-the-art detectors mostly rely on a dense anchoring scheme, where anchors are sampled uniformly over the spatial domain with a predefined set of scales and aspect ratios. In this paper, we revisit this foundational stage. Our study shows that it can be done much more effectively and efficiently. Specifically, we present an alternative scheme, named Guided Anchoring, which leverages semantic features to guide the anchoring. The proposed method jointly predicts the locations where the center of objects of interest are likely to exist as well as the scales and aspect ratios at different locations. On top of predicted anchor shapes, we mitigate the feature inconsistency with a feature adaption module. We also study the use of high-quality proposals to improve detection performance. The anchoring scheme can be seamlessly integrated into proposal methods and detectors. With Guided Anchoring, we achieve 9.1% higher recall on MS COCO with 90% fewer anchors than the RPN baseline. We also adopt Guided Anchoring in Fast R-CNN, Faster R-CNN and RetinaNet, respectively improving the detection mAP by 2.2%, 2.7% and 1.2%. Code will be available at https://github.com/open-mmlab/mmdetection.

研究动机与目标

动机在于减少对密集、统一锚点的依赖，通过预测物体可能出现的位置及在这些位置的形状。
开发一个两分支的锚点预测模块（位置与形状）以生成稀疏、自适应的锚点。
引入锚点引导的特征自适应，以使特征与位置特定的锚点形状对齐。
研究高质量的 GA 提议如何提升两阶段和单阶段检测器，并提供一个实用的微调工作流。

提出的方法

提出一个两分支锚点生成模块，用于预测物体中心位置 (p(x,y|FI)) 和与位置相关的锚点形状 (w,h)。
将形状预测为 w=σ·s·exp(dw) 和 h=σ·s·exp(dh)，通过变换来稳定学习。
采用多层次（FPN）实现，在各层之间共享锚点生成参数。
通过可变形卷积应用锚点引导的特征自适应，使特征与预测的锚点形状对齐。
训练时使用多任务损失，包括锚点定位损失 (L_loc) 和锚点形状损失 (L_shape)，同标准的分类和回归损失 (L_cls, L_reg)。
展示一个应用场景：GA-RPN 在比 RPN 少 90% 锚点的情况下提供更高的召回率，并展示将 GA 嵌入 Fast R-CNN、Faster R-CNN 和 RetinaNet 时的改进。

实验结果

研究问题

RQ1是否可以通过联合预测物体中心和形状，以稀疏、非均匀的方式有效生成锚点？
RQ2通过专用自适应模块将特征与预测的锚点形状对齐，是否能提高检测精度？
RQ3与传统的密集锚点相比，基于 GA 的提议对两阶段和单阶段检测器有何影响？
RQ4高质量的 GA 提议能否提升现有检测器的训练和微调效果？

主要发现

GA-RPN 在 COCO 上相比密集 RPN 基线，召回率提高 9.1%，锚点数量减少 90%。
用 GA 取代锚点使 mAP 提升 2.2%（GA-Fast-RCNN）、2.7%（GA-Faster-RCNN）和 1.2%（GA-RetinaNet）。
锚点位置与形状的预测，以及特征自适应，带来显著的召回增益和效率提升。
使用 GA-RPN 提议的微调方案，在 3 个 epoch 内可使已训练的 Faster R-CNN 的 AP 从 37.4% 提升到 39.6%。
使用高质量的 GA 提议并具有更高 IoU 阈值时，减少大量提议的需求并在较少样本训练时提升检测器性能。

更好的研究，从现在开始

从论文设计到论文写作，大幅缩短您的研究时间。

无需绑定信用卡

本解读由 AI 生成，并经人工编辑审核。