QUICK REVIEW

[論文レビュー] Facial Emotion Recognition: State of the Art Performance on FER2013

Yousif Khaireddin, Zhuofa Chen|arXiv (Cornell University)|May 8, 2021

Emotion and Mood Recognition参考文献 37被引用数 158

ひとこと要約

本論文は、FER2013でファインチューニングされた VGGNet を用い、追加の学習データなしで単一ネットワークによる最高の精度(73.28%)を達成した。

ABSTRACT

Facial emotion recognition (FER) is significant for human-computer interaction such as clinical practice and behavioral description. Accurate and robust FER by computer models remains challenging due to the heterogeneity of human faces and variations in images such as different facial pose and lighting. Among all techniques for FER, deep learning models, especially Convolutional Neural Networks (CNNs) have shown great potential due to their powerful automatic feature extraction and computational efficiency. In this work, we achieve the highest single-network classification accuracy on the FER2013 dataset. We adopt the VGGNet architecture, rigorously fine-tune its hyperparameters, and experiment with various optimization methods. To our best knowledge, our model achieves state-of-the-art single-network accuracy of 73.28 % on FER2013 without using extra training data.

研究の動機と目的

顔の多様性と画像条件の異質性に起因する FER 性能の向上を動機づける。
慎重に調整された VGGNet が外部データなしで FER2013 で最先端の精度を達成できることを示す。
FER性能に対するハイパーパラメータの微調整と最適化戦略の影響を調査する。

提案手法

VGGNet アーキテクチャをバックボーンとして採用した。
ネットワークのハイパーパラメータの厳密な微調整。
様々な最適化手法を試みた。
単一ネットワークで高い性能を達成することに焦点を当てた。
FER2013 以外の追加の学習データの使用を避けた。

実験結果

リサーチクエスチョン

RQ1単一のCNNネットワークが外部データなしでFER2013における最先端の FER 性能を達成できるか？
RQ2FER2013 の精度に対するハイパーパラメータの微調整と最適化手法の影響は何か？
RQ3注意深い学習によってVGGNetはトップFER性能の実現可能なバックボーンか？

主な発見

単一ネットワークで FER2013 の精度 73.28% を達成。
追加データなしで最も高い単一ネットワーク FER2013 結果として報告。
ハイパーパラメータの微調整と最適化の選択が最先端の性能を生むことを検証。
適切に調整された場合、FER における VGGNet バックボーンの有効性を示した。

より良い研究を、今すぐ始めましょう

論文設計から論文執筆まで、研究時間を劇的に削減しましょう。

クレジットカード登録不要

このレビューはAIが作成し、人間の編集者が確認しました。