Learn Convolutional Neural Network for Face Anti-Spoofing (1408.5601v2)

Published 24 Aug 2014 in cs.CV

Abstract: Though having achieved some progresses, the hand-crafted texture features, e.g., LBP [23], LBP-TOP [11] are still unable to capture the most discriminative cues between genuine and fake faces. In this paper, instead of designing feature by ourselves, we rely on the deep convolutional neural network (CNN) to learn features of high discriminative ability in a supervised manner. Combined with some data pre-processing, the face anti-spoofing performance improves drastically. In the experiments, over 70% relative decrease of Half Total Error Rate (HTER) is achieved on two challenging datasets, CASIA [36] and REPLAY-ATTACK [7] compared with the state-of-the-art. Meanwhile, the experimental results from inter-tests between two datasets indicates CNN can obtain features with better generalization ability. Moreover, the nets trained using combined data from two datasets have less biases between two datasets.

Citations (413)

View on Semantic Scholar

Summary

The paper presents a CNN-based approach that significantly reduces HTER by over 70% compared to traditional handcrafted feature methods.
The methodology employs extensive spatial and temporal data augmentation to capture robust discriminative cues in face images.
Experimental results on CASIA and REPLAY-ATTACK datasets validate improved intra-dataset performance while uncovering inter-dataset generalization challenges.

Convolutional Neural Networks for Robust Face Anti-Spoofing

The paper "Learn Convolutional Neural Network for Face Anti-Spoofing" by Jianwei Yang, Zhen Lei, and Stan Z. Li discusses a novel approach utilizing Convolutional Neural Networks (CNNs) for enhancing face anti-spoofing techniques. Traditionally, the field has relied on hand-crafted features such as Local Binary Patterns (LBP) and Histogram of Oriented Gradients (HOG) to detect spoofing in facial recognition systems. These methods, while effective to an extent, often struggle to generalize across diverse spoofing types and environmental conditions.

Methodology

The authors propose leveraging the feature extraction capabilities of CNNs to automatically derive discriminative features from face images, thus enhancing the robustness and efficacy of face anti-spoofing systems. The CNN architecture implemented in this paper follows a configuration similar to the one outlined in Krizhevsky et al. (2012), comprising five convolutional layers followed by three fully connected layers, with ReLU non-linearity and dropout layers for regularization.

Key to this paper is the use of data augmentation, both spatially and temporally. The former is achieved by enlarging the face images to include background regions, while the latter involves using multi-frame inputs to incorporate temporal cues. This strategy aims to harness additional contextual and dynamic information, improving the model's ability to distinguish between genuine and spoofed faces.

Experimental Results

The experiments were conducted on two key datasets: CASIA and REPLAY-ATTACK. These datasets include a variety of spoofing attacks realized through different methods such as print attacks, mask attacks, and digital replay attacks.

The CNN-based approach demonstrates substantial improvements, yielding over a 70% relative decrease in Half Total Error Rate (HTER) compared to state-of-the-art hand-crafted feature methods. Notably, the model achieves excellent performance in intra-dataset tests, with HTERs significantly reduced compared to existing benchmarks. For inter-dataset tests, the CNN approach still exhibits challenges due to dataset biases, but outperforms traditional methods effectively.

Implications and Future Work

The implications of this work are noteworthy, as it highlights CNN's potential to generalize across diverse datasets in face anti-spoofing tasks, which is critical in real-world applications where systems must reliably detect spoof attempts under varying conditions. The demonstration of data-driven feature learning providing significant advantages over manual feature design suggests a direction for future antifraud systems.

This research opens several avenues for future exploration. Primarily, addressing the dataset bias in inter-dataset evaluations is crucial for deployment in uncontrolled environments. Exploring advanced transfer learning methods could facilitate the adaptation of these models to new datasets with minimal retraining. Moreover, integrating additional biometric cues such as motion and 3D shape information into the CNN framework may provide further robustness against sophisticated spoofing attempts.

In summary, the paper underscores the promising capabilities of CNNs in face anti-spoofing, establishing a benchmark for future research in developing more adaptive and generalizable biometric security systems.

PDF Markdown