QUICK REVIEW

[論文レビュー] Revisiting Feature Alignment for One-stage Object Detection

Yuntao Chen, Chenxia Han|arXiv (Cornell University)|Aug 5, 2019

Advanced Neural Network Applications参考文献 22被引用数 63

ひとこと要約

この論文は、1段検出器におけるバウンディングボックスと特徴を整列させる RoIConv を提案し、学習済みアンカーを使用する Fully Convolutional AlignDet を実現。これは速度を損なうことなく最先端の COCO 結果を達成する。

ABSTRACT

Recently, one-stage object detectors gain much attention due to their simplicity in practice. Its fully convolutional nature greatly reduces the difficulty of training and deployment compared with two-stage detectors which require NMS and sorting for the proposal stage. However, a fundamental issue lies in all one-stage detectors is the misalignment between anchor boxes and convolutional features, which significantly hinders the performance of one-stage detectors. In this work, we first reveal the deep connection between the widely used im2col operator and the RoIAlign operator. Guided by this illuminating observation, we propose a RoIConv operator which aligns the features and its corresponding anchors in one-stage detection in a principled way. We then design a fully convolutional AlignDet architecture which combines the flexibility of learned anchors and the preciseness of aligned features. Specifically, our AlignDet achieves a state-of-the-art mAP of 44.1 on the COCO test-dev with ResNeXt-101 backbone.

研究の動機と目的

1. 1段検出器におけるアンカー box と畳み込み特徴の不整合を特定する。
2. このギャップを埋める原理的な特徴整列手法を提案する。
3. 学習済みアンカーを正確な整列とともに活用する完全畳み込み検出器を開発する。
4. アンカー設計を最小限に抑えつつ COCO で最先端の性能を示す。

提案手法

im2col と RoIAlign の関係を明らかにして新しい RoIConv 演算子を動機づける。
アンカー誘起のオフセットを用いて全ての位置で特徴をアンカーと整列させる RoIConv を定義する。
Dense Proposal Module (DPM) と Aligned Detection Module (ADM) を備えた Fully Convolutional AlignDet を構築する。
RoIConv を ADM に組み込んで、1段検出器における正確でプラグアンドプレー可能な整列を実現する。
RoIConv のアブレーションを行い、素の conv、変形 conv、アンカー誘導変形 conv との比較を行う。

実験結果

リサーチクエスチョン

RQ1アンカーとバックボーン特徴の明示的な特徴整列は、速度を損なうことなく1段検出器を改善できるのか。
RQ2RoIConv はマルチスケール特徴を超える原理的で位置認識的な整列を提供するのか。
RQ3学習済みアンカーは整列とどのように相互作用し、COCO の検出精度にどのような影響を与えるのか。

主な発見

Model	Backbone	#Anchor	AP	AP50	APs	APm	APl
RetinaNet	ResNet-50 FPN	9	35.7	55.0	18.9	38.9	46.3
RetinaNet	ResNet-101 FPN	9	37.8	57.5	20.2	41.1	49.2
AlignDet w/o ADM	ResNet-50 FPN	1	32.4	52.9	17.5	35.9	43.0
AlignDet w/o ADM	ResNet-101 FPN	1	34.5	55.7	18.1	38.4	45.6
AlignDet	ResNet-50 FPN	1	37.9	57.7	21.5	41.1	50.8
AlignDet	ResNet-101 FPN	1	39.8	60.0	22.6	43.4	52.8

RoIConv は効果的で効率的な特徴整列を提供し、単一スケールの RetinaNet を 5.0 AP 向上させる。
ADM を備えた AlignDet は 1 アンカーで ResNet-50 FPN が 37.9 AP、ResNet-101 FPN が 39.8 AP を達成し、複数アンカーを用いた RetinaNet を上回る。
ResNet-101 FPN と 1 アンカーを用いた AlignDet は COCO minival で 39.8 AP および 52.8 APl に到達し、RetinaNet のベースラインを上回る。
ResNet バックボーンで RetinaNet に比べ minival の AP が 5.5–5.3 ポイント改善される。
より大きなカーネルと高い出力チャネルを持つ RoIConv 設計は AP を高め、7x7 および 1024 チャネルで顕著な向上を達成する。
COCO test-dev で ResNeXt-101-FPN バックボーンを用いた AlignDet は 44.1 AP を達成し、速度を維持しつつ RetinaNet を 3.3 AP 上回る。

より良い研究を、今すぐ始めましょう

論文設計から論文執筆まで、研究時間を劇的に削減しましょう。

クレジットカード登録不要

このレビューはAIが作成し、人間の編集者が確認しました。