TU Darmstadt / ULB / TUbiblio

Learnable Multi-level Frequency Decomposition and Hierarchical Attention Mechanism for Generalized Face Presentation Attack Detection

Fang, Meiling ; Damer, Naser ; Kirchbuchner, Florian ; Kuijper, Arjan (2022)
Learnable Multi-level Frequency Decomposition and Hierarchical Attention Mechanism for Generalized Face Presentation Attack Detection.
IEEE/CVF Winter Conference on Applications of Computer Vision. virtual Conference (04.01.2022-08.01.2022)
doi: 10.1109/WACV51458.2022.00120
Konferenzveröffentlichung, Bibliographie

Kurzbeschreibung (Abstract)

With the increased deployment of face recognition systems in our daily lives, face presentation attack detection (PAD) is attracting much attention and playing a key role in securing face recognition systems. Despite the great performance achieved by the hand-crafted and deep-learning based methods in intra-dataset evaluations, the performance drops when dealing with unseen scenarios. In this work, we propose a dual-stream convolution neural networks (CNNs) framework. One stream adapts four learnable frequency filters to learn features in the frequency domain, which are less influenced by variations in sensors/illuminations. The other stream leverages the RGB images to complement the features of the frequency domain. Moreover, we propose a hierarchical attention module integration to join the information from the two streams at different stages by considering the nature of deep features in different layers of the CNN. The proposed method is evaluated in the intra-dataset and cross-dataset setups, and the results demonstrate that our proposed approach enhances the generalizability in most experimental setups in comparison to state-of-the-art, including the methods designed explicitly for domain adaption/shift problems. We successfully prove the design of our proposed PAD solution in a stepwise ablation study that involves our proposed learnable frequency decomposition, our hierarchical attention module design, and the used loss function. Training codes and pre-trained models are publicly released.

Typ des Eintrags: Konferenzveröffentlichung
Erschienen: 2022
Autor(en): Fang, Meiling ; Damer, Naser ; Kirchbuchner, Florian ; Kuijper, Arjan
Art des Eintrags: Bibliographie
Titel: Learnable Multi-level Frequency Decomposition and Hierarchical Attention Mechanism for Generalized Face Presentation Attack Detection
Sprache: Englisch
Publikationsjahr: 15 Februar 2022
Verlag: IEEE
Buchtitel: Proceedings: 2022 IEEE Winter Conference on Applications of Computer Vision: WACV 2022
Veranstaltungstitel: IEEE/CVF Winter Conference on Applications of Computer Vision
Veranstaltungsort: virtual Conference
Veranstaltungsdatum: 04.01.2022-08.01.2022
DOI: 10.1109/WACV51458.2022.00120
Kurzbeschreibung (Abstract):

With the increased deployment of face recognition systems in our daily lives, face presentation attack detection (PAD) is attracting much attention and playing a key role in securing face recognition systems. Despite the great performance achieved by the hand-crafted and deep-learning based methods in intra-dataset evaluations, the performance drops when dealing with unseen scenarios. In this work, we propose a dual-stream convolution neural networks (CNNs) framework. One stream adapts four learnable frequency filters to learn features in the frequency domain, which are less influenced by variations in sensors/illuminations. The other stream leverages the RGB images to complement the features of the frequency domain. Moreover, we propose a hierarchical attention module integration to join the information from the two streams at different stages by considering the nature of deep features in different layers of the CNN. The proposed method is evaluated in the intra-dataset and cross-dataset setups, and the results demonstrate that our proposed approach enhances the generalizability in most experimental setups in comparison to state-of-the-art, including the methods designed explicitly for domain adaption/shift problems. We successfully prove the design of our proposed PAD solution in a stepwise ablation study that involves our proposed learnable frequency decomposition, our hierarchical attention module design, and the used loss function. Training codes and pre-trained models are publicly released.

Freie Schlagworte: Biometrics, Face recognition, Deep learning, Spoofing attacks
Fachbereich(e)/-gebiet(e): 20 Fachbereich Informatik
20 Fachbereich Informatik > Graphisch-Interaktive Systeme
20 Fachbereich Informatik > Mathematisches und angewandtes Visual Computing
Hinterlegungsdatum: 03 Mär 2022 09:14
Letzte Änderung: 03 Mär 2022 09:14
PPN:
Export:
Suche nach Titel in: TUfind oder in Google
Frage zum Eintrag Frage zum Eintrag

Optionen (nur für Redakteure)
Redaktionelle Details anzeigen Redaktionelle Details anzeigen