QUICK REVIEW

[논문 리뷰] Improved Selective Refinement Network for Face Detection

Shifeng Zhang, Rui Zhu|arXiv (Cornell University)|2019. 01. 20.

Face recognition and analysis참고 문헌 55인용 수 32

한 줄 요약

이 논문은 데이터 증강, 더 강한 백본, MS COCO 사전학습, 분류 모듈의 분리, 세분화 분기, SE 블록을 통합하여 얼굴 탐지를 위한 Selective Refinement Network (SRN)을 다듬고 WIDER FACE에서 최첨단 성능을 달성합니다.

ABSTRACT

As a long-standing problem in computer vision, face detection has attracted much attention in recent decades for its practical applications. With the availability of face detection benchmark WIDER FACE dataset, much of the progresses have been made by various algorithms in recent years. Among them, the Selective Refinement Network (SRN) face detector introduces the two-step classification and regression operations selectively into an anchor-based face detector to reduce false positives and improve location accuracy simultaneously. Moreover, it designs a receptive field enhancement block to provide more diverse receptive field. In this report, to further improve the performance of SRN, we exploit some existing techniques via extensive experiments, including new data augmentation strategy, improved backbone network, MS COCO pretraining, decoupled classification module, segmentation branch and Squeeze-and-Excitation block. Some of these techniques bring performance improvements, while few of them do not well adapt to our baseline. As a consequence, we present an improved SRN face detector by combining these useful techniques together and obtain the best performance on widely used face detection benchmark WIDER FACE dataset.

연구 동기 및 목표

도전적인 WIDER FACE 벤치마크에서 작은 얼굴과 가려진 얼굴을 가진 상황에서 SRN 성능을 향상시키려는 동기를 제시한다.
SRN 성능에 대한 구조적 및 학습 개선의 영향을 조사한다.
SRN 기반에 결합될 때 어떤 기법들이 효과적이거나 비효과적인지 식별한다.

제안 방법

포토메트릭 왜곡, 임의 패치 자르기, 선택적 데이터 앵커 샘플링을 포함한 데이터 증강 전략을 채택한다.
ResNet-50을 Root-ResNet 기반 구조로 수정하고 DRN에서 영감을 받은 조정을 적용하여 백본을 개선한다.
수정된 백본을 MS COCO에서 프리트레인한 후 WIDER FACE에서 미세조정하며, 학습을 처음부터 가능하게 하기 위해 Group Normalization을 사용한다.
분리된 분류 모듈을 적용하고 성능 향상을 탐색하기 위해 세분화 분기와 SE 블록을 고려한다.
SRN 내에서 STC와 STR를 사용하고 다양한 수용 필드를 위한 RFE를 도입하며 최상위 탐지 결과와 NMS로 추론을 수행한다.
SGD로 학습하고, 특정 학습률 스케줄, 1024x1024의 대형 입력을 사용하며 5-에포크 워밍업과 총 260 에포크를 사용한다.

실험 결과

연구 질문

RQ1데이터 증강과 백본 개선이 WIDER FACE에서 SRN 성능을 크게 향상시킬 수 있는가?
RQ2MS COCO 사전학습, 분리된 분류, 세분화 감독 및 SE 블록이 Easy/Medium/Hard 하위 집합 전반에 걸쳐 SRN을 일관되게 개선하는가?
RQ3탐색된 기법들 중 어떤 것이 얼굴 탐지를 위한 SRN에 유익하고, 중립적이며, 해로운지 판단한다.

주요 결과

제안된 ISRN은 validation 및 testing 세트에서 Easy, Medium, Hard에 걸쳐 WIDER FACE에서 최첨단 평균 정밀도(AP)를 달성한다.
Validation APs: Easy 96.7%, Medium 95.8%, Hard 90.9%; Testing APs: Easy 96.3%, Medium 95.4%, Hard 90.3%.
STC/STR가 향상된 백본과 사전학습으로 이득에 기여하며 특히 작은 얼굴(Hard 하위집합)에 대해 효과적이다.
일부 기법들(예: 세분화 분기, SE 블록)은 구성에 따라 이 기준선에서 항상 성능을 향상시키지 못할 수 있다.

더 나은 연구,지금 바로 시작하세요

연구 설계부터 논문 작성까지, 연구 시간을 획기적으로 줄여보세요.

카드 등록 없음 · 무료 플랜 제공

이 리뷰는 AI가 만들고, 인간 에디터가 검토했습니다.