QUICK REVIEW

[論文レビュー] Going Deeper Into Face Detection: A Survey

Shervin Minaee, Ping Luo|arXiv (Cornell University)|Mar 27, 2021

Face recognition and analysis参考文献 86被引用数 51

ひとこと要約

ディープラーニングベースの顔検出手法、アーキテクチャ、データセット、ベンチマーク、課題についての包括的な調査であり、モデルカテゴリ別に整理されている。

ABSTRACT

Face detection is a crucial first step in many facial recognition and face analysis systems. Early approaches for face detection were mainly based on classifiers built on top of hand-crafted features extracted from local image regions, such as Haar Cascades and Histogram of Oriented Gradients. However, these approaches were not powerful enough to achieve a high accuracy on images of from uncontrolled environments. With the breakthrough work in image classification using deep neural networks in 2012, there has been a huge paradigm shift in face detection. Inspired by the rapid progress of deep learning in computer vision, many deep learning based frameworks have been proposed for face detection over the past few years, achieving significant improvements in accuracy. In this work, we provide a detailed overview of some of the most representative deep learning based face detection methods by grouping them into a few major categories, and present their core architectural designs and accuracies on popular benchmarks. We also describe some of the most popular face detection datasets. Finally, we discuss some current challenges in the field, and suggest potential future research directions.

研究の動機と目的

深層学習時代の始まりから現在までのディープラーニングベースの顔検出の進展をレビューする。
主要なモデルファミリ（Cascade-CNN、RCNN/Faster-RCNN、SSD、FPN、Transformers、その他）を分類し、比較する。
広く用いられている顔検出データセットと評価ベンチマークを要約する。
現在の課題を議論し、将来の研究の潜在的方向性を提案する。

提案手法

既存の深層顔検出手法を、アーキテクチュラルな貢献に基づいて主要カテゴリーに分類する。
バックボーンとコアとなるアーキテクチャのアイデア（CNNs、R-CNN、SSD、FPN、GANs など）の概要を提供する。
各カテゴリ内の主要モデルとその主な技術的貢献を要約する。
ディープフェイス検出の人気のあるベンチマークと評価指標を概説する。
制約のない設定での課題を議論し、将来の研究方向を提案する。

実験結果

リサーチクエスチョン

RQ1顔検出に用いられる主なディープラーニングカテゴリとそのコアとなるアーキテクチャのアイデアは何か？
RQ2バックボーンアーキテクチャ、損失関数、およびトレーニング戦略は、標準的な顔検出ベンチマークでの性能にどのような影響を与えるか？
RQ3現在の顔検出研究を支配するデータセットと評価指標は何か、そして残るギャップは何か？
RQ4野外での顔検出における主要な課題と、それに対処する潜在的な方向性は何か？

主な発見

本調査は、複数のアーキテクチャカテゴリにまたがる50を超えるディープフェイス検出モデルを網羅している。
トレーニングデータ、ネットワークバックボーン、損失関数、トレーニング戦略が性能向上にどう寄与しているかの洞察を提供する。
FDDBやWider Faceなどのデータセットの傾向や、人気のあるベンチマークと指標について論じる。
姿勢、スケール、照明、遮蔽などの変動といった課題を強調し、トランスフォーマーベースの手法やコンテキストモデリングといった方向性を示唆する。
Cascade-CNN および R-CNN ベースの手法からシングルショット検出器と特徴ピラミッドネットワークへの進化を強調し、最近のトランスフォーマーベースのアイデアにも注目する。

より良い研究を、今すぐ始めましょう

論文設計から論文執筆まで、研究時間を劇的に削減しましょう。

クレジットカード登録不要

このレビューはAIが作成し、人間の編集者が確認しました。