QUICK REVIEW

[論文レビュー] Region Proposal by Guided Anchoring

Jiaqi Wang, Kai Chen|arXiv (Cornell University)|Jan 10, 2019

Advanced Neural Network Applications参考文献 36被引用数 59

ひとこと要約

本論文は、Guided Anchoring (GA) を用いてアンカーの位置と形状を共同予測し、アンカー数を大幅に減らしてリコールを高め、Fast R-CNN、Faster R-CNN、RetinaNet の検出性能を向上させる。

ABSTRACT

Region anchors are the cornerstone of modern object detection techniques. State-of-the-art detectors mostly rely on a dense anchoring scheme, where anchors are sampled uniformly over the spatial domain with a predefined set of scales and aspect ratios. In this paper, we revisit this foundational stage. Our study shows that it can be done much more effectively and efficiently. Specifically, we present an alternative scheme, named Guided Anchoring, which leverages semantic features to guide the anchoring. The proposed method jointly predicts the locations where the center of objects of interest are likely to exist as well as the scales and aspect ratios at different locations. On top of predicted anchor shapes, we mitigate the feature inconsistency with a feature adaption module. We also study the use of high-quality proposals to improve detection performance. The anchoring scheme can be seamlessly integrated into proposal methods and detectors. With Guided Anchoring, we achieve 9.1% higher recall on MS COCO with 90% fewer anchors than the RPN baseline. We also adopt Guided Anchoring in Fast R-CNN, Faster R-CNN and RetinaNet, respectively improving the detection mAP by 2.2%, 2.7% and 1.2%. Code will be available at https://github.com/open-mmlab/mmdetection.

研究の動機と目的

denseで均一なアンカーへの依存を減らす動機付けとして、物体が存在し得る場所とそれらの場所での形状を予測する。
2-branch のアンカー予測モジュール（位置と形状）を開発して、スパースで適応的なアンカーを生成する。
位置特異的なアンカー形状と特徴を整列させるアンカーガイド型特徴適応を導入する。
高品質な GA 提案が二段階および単段検出器を改善できるかを調査し、実用的な微調整ワークフローを提供する。

提案手法

オブジェクト中心位置を予測する2枝アンカー生成モジュール（p(x,y|FI)）と、アンカーの位置依存的な形状（w,h）を予測する。
形状を w=σ·s·exp(dw) および h=σ·s·exp(dh) として予測し、変換で学習を安定化させる。
レベル間で共有されたアンカー生成パラメータを用いたマルチレベル（FPN）実装を採用する。
予測されたアンカー形状に特徴を合わせるため、変形可能畳み込みを用いたアンカーガイド型特徴適応を適用する。
標準の分類損失と回帰損失（L_cls, L_reg）に加え、アンカー局在損失（L_loc）とアンカー形状損失（L_shape）を含む多タスク損失で学習する。
GA-RPN が RPN より90%少ないアンカーで高いリコールを提供するユースケースを示し、GA を Fast R-CNN、Faster R-CNN、RetinaNet に組み込んだ場合の改善を実証する。

実験結果

リサーチクエスチョン

RQ1オブジェクトの中心と形状を同時に予測することで、スパースで非均一な方法でアンカーを効果的に生成できるか？
RQ2専用の適応モジュールを介して予測アンカー形状に特徴を合わせることは検出精度を改善するか？
RQ3GAベースの提案が従来の密なアンカーと比べて二段階・単段検出器にどのような影響を与えるか？
RQ4高品質な GA 提案は既存の検出器の学習と微調整を改善できるか？

主な発見

Method	Backbone	AR100	AR300	AR1000	AR_S	AR_M	AR_L	runtime(s)/img
RPN	ResNet-50-FPN	47.5	54.7	59.4	31.7	55.1	64.6	0.09
GA-RPN	ResNet-50-FPN	59.2	65.2	68.5	40.9	67.8	79.0	0.13

GA-RPN は COCO 上で密な RPN ベースラインよりアンカーを90%削減してもリコールを9.1%高める。
アンカーを GA に置換することで mAP が GA-Fast-RCNN で2.2%、GA-Faster-RCNN で2.7%、GA-RetinaNet で1.2%向上。
アンカーの位置と形状の予測と特徴適応の組み合わせが、リコールの大幅な向上と効率化をもたらす。
GA-RPN 提案を用いたファインチューニングスキームにより、訓練済み Faster R-CNN を3エポックで AP 37.4% から 39.6% に向上させることができる。
高品質の GA 提案を IoU 閾値の高い設定で使用することで、提案数を抑えて検出器の性能を改善することができ、少ないサンプルでの学習時にも効果的である。

より良い研究を、今すぐ始めましょう

論文設計から論文執筆まで、研究時間を劇的に削減しましょう。

クレジットカード登録不要

このレビューはAIが作成し、人間の編集者が確認しました。