QUICK REVIEW

[論文レビュー] VISER: Visually-Informed System for Enhanced Robustness in Open-Set Iris Presentation Attack Detection

Byron Dowling, Eleanor Frederick|arXiv (Cornell University)|Mar 18, 2026

Biometric Identification and Security被引用数 0

ひとこと要約

VISERは、どの人間の顕著性タイプ（手作業アノテーション、目の追跡、セグメンテーション）と基底モデルの埋め込みがオープンセット虹彩PADを改善するかを調査し、ノイズ除去されたい初期の目追跡顕著性が交差エントロピーのベースラインより最も大きな改善をもたらすことを示す。

ABSTRACT

Human perceptual priors have shown promise in saliency-guided deep learning training, particularly in the domain of iris presentation attack detection (PAD). Common saliency approaches include hand annotations obtained via mouse clicks and eye gaze heatmaps derived from eye tracking data. However, the most effective form of human saliency for open-set iris PAD remains underexplored. In this paper, we conduct a series of experiments comparing hand annotations, eye tracking heatmaps, segmentation masks, and DINOv2 embeddings to a state-of-the-art deep learning-based baseline on the task of open-set iris PAD. Results for open-set PAD in a leave-one-attack-type out paradigm indicate that denoised eye tracking heatmaps show the best generalization improvement over cross entropy in terms of Area Under the ROC curve (AUROC) and Attack Presentation Classification Error Rate (APCER) at Bona Fide Presentation Classification Error Rate (BPCER) of 1%. Along with this paper, we offer trained models, code, and saliency maps for reproducibility and to facilitate follow-up research efforts.

研究の動機と目的

人間の顕著性の形態（手作業アノテーション、目追跡、セグメンテーション）がオープンセット虹彩PADで最も良い一般化を提供するかを評価する。
人間の顕著性ガイド付きモデルをオープンセット虹彩PADで基盤モデルの埋め込みと比較する。
目追跡ヒートマップのデノイジング手法を導入し、顕著性ガイド付きトレーニングを改善する。
再現性と追跡研究を支援するデータ、コード、顕著性マップを提供する。

提案手法

DenseNet-121 ベースの D-NetPAD をオープンセット虹彩PADのバックボーンとして使用する。
モデルを、ターゲット顕著性と一致するCAMを持つ顕著性損失で補強したクロスエントロピー損失で訓練する（Boydらの手法に基づく）。
セグメンテーションマスク、エントロピーが異なる手作業アノテーション、目追跡ヒートマップ（全フェーズと初期フェーズ）、およびHDBSCANを用いたデノイズ変種の評価を行う。
基盤モデルの埋め込み（DINOv2-Base）をロジスティック回帰、SVM（線形）、SVM（RBF）と分類器と共にテストする。
AUROCとBPCER=1%でのAPCERを用いて性能を評価し、XENTベースラインに対する改善を報告する。

Figure 2 : Example of applying HDBSCAN to de-noise the eye tracking data. Clusters made up of valid fixations are distinguished by color and larger marks denote longer fixations, not fixation area. Black crosses indicate fixations marked as noise excluded from the final saliency map.

実験結果

リサーチクエスチョン

RQ1目追跡顕著性は、オープンセット虹彩PADにおいて未見の攻撃タイプへ手作業アノテーションやセグメンテーションより一般化が良いか。
RQ2顕著性ガイド付きトレーニングは、基盤モデルの埋め込みを用いる現代的なPADソリューションより優れているか。
RQ3目追跡の視線マップのデノイジングは顕著性ガイド付きトレーニングの性能を改善するか。

主な発見

Method	Printout	Diseased	Post Mortem	Synthetic	Contacts + Print	Textured Contact	Artificial	Average Δ
Baseline (DenseNet XENT)	0.000	0.000	0.000	0.000	0.000	0.000	0.000	0.000
Segmentation Masks \| Baseline Saliency-based	+0.0087	-0.0052	+0.0385	-0.0511	-0.0212	+0.0345	+0.0232	+0.0039
Hand Annotations \| High Entropy	-0.0184	+0.0520	+0.0073	-0.0712	-0.0123	-0.0050	+0.0175	-0.0043
Hand Annotations \| Equal Entropy	-0.0010	+0.0847	-0.0098	-0.0122	-0.0021	+0.0274	+0.0277	+0.0164
Hand Annotations \| Low Entropy	-0.0001	-0.0077	+0.0294	-0.0160	-0.0384	+0.0140	-0.0157	-0.0049
Eye Tracking \| Full
+0.0473	+0.0282	-0.0468	+0.0941	-0.0155	+0.0474	+0.2387	+0.0562
Eye Tracking \| Initial
+0.0513	+0.0192	-0.0155	+0.0236	-0.0481	+0.0650	+0.2672	+0.0518
De-noised Full ET	+0.0604	+0.0345	-0.0820	+0.0621	-0.0782	+0.0597	+0.2438	+0.0429
De-noised Initial ET	+0.0627	+0.0574	-0.0645	+0.1090	-0.0453	+0.0372	+0.2692	+0.0608
Foundation Model (No Saliency) \| DINOv2 + LogReg	-0.0326	-0.1079	+0.0378	-0.0340	+0.0086	-0.0327	+0.1520	-0.0013
Foundation Model (No Saliency) \| DINOv2 + SVM-Linear	-0.0523	-0.1247	-0.0033	-0.0613	+0.0017	-0.0351	+0.1737	-0.0145
Foundation Model (No Saliency) \| DINOv2 + SVM-RBF	-0.0104	+0.0023	+0.1796	+0.1121	-0.0159	-0.0448	+0.1167	+0.0485

目追跡顕著性、特にデノイズされた初期目追跡は、XENTベースラインに対して最大のAUROC改善をもたらす（約 +0.061）、およびBPCER 1%での最大のAPCER改善をもたらす（約 +0.1063）。
デノイズ付き初期の目追跡マップは、他の顕著性タイプやベースラインを複数の攻撃カテゴリで上回る；セグメンテーションマスクと手作業アノテーションは一般にAPCER@BPCER1%でベースラインを上回らない。
DINOv2を用いた基盤モデルは混在した結果を示し、AUROCをわずかに改善するのはDINOv2+SVM-RBFのみで、他のバリアントは一貫してベースラインを超えない。
デノイジングの利点は初期の目追跡で顕著であり（AUROC +0.0608、APCER Δ +0.1063）、全体の目追跡では一貫性が低い（AUROC +0.0429、APCER Δ +0.0857）。
総じて、本研究設定内では、目追跡ベースの顕著性ガイド付きトレーニングが基盤モデルベースのアプローチよりもオープンセット虹彩PADで優れている。

(a) Segmentation Mask of the Iris Region

より良い研究を、今すぐ始めましょう

論文設計から論文執筆まで、研究時間を劇的に削減しましょう。

クレジットカード登録不要

このレビューはAIが作成し、人間の編集者が確認しました。