Data Uncertainty Learning in Face Recognition (2003.11339v1)

Published 25 Mar 2020 in cs.CV

Abstract: Modeling data uncertainty is important for noisy images, but seldom explored for face recognition. The pioneer work, PFE, considers uncertainty by modeling each face image embedding as a Gaussian distribution. It is quite effective. However, it uses fixed feature (mean of the Gaussian) from an existing model. It only estimates the variance and relies on an ad-hoc and costly metric. Thus, it is not easy to use. It is unclear how uncertainty affects feature learning. This work applies data uncertainty learning to face recognition, such that the feature (mean) and uncertainty (variance) are learnt simultaneously, for the first time. Two learning methods are proposed. They are easy to use and outperform existing deterministic methods as well as PFE on challenging unconstrained scenarios. We also provide insightful analysis on how incorporating uncertainty estimation helps reducing the adverse effects of noisy samples and affects the feature learning.

Citations (250)

View on Semantic Scholar

Summary

The paper introduces data uncertainty learning in face recognition by modeling face embeddings as Gaussian distributions that capture both mean and variance.
It proposes both classification-based (DUL_cls) and regression-based (DUL_rgs) methods that integrate KL-divergence and variance-weighted loss functions for enhanced feature learning.
Experimental results on benchmarks like IJB-C show superior robustness, with DUL_cls achieving 88.25% TPR at 0.001% FPR compared to deterministic models.

Data Uncertainty Learning in Face Recognition: A Comprehensive Analysis

The paper "Data Uncertainty Learning in Face Recognition" presents a novel approach to integrating data uncertainty learning (DUL) into face recognition systems. Traditional face recognition methods often represent each face image as a deterministic point embedding in the latent space, which can be suboptimal when dealing with noisy images. This research seeks to address this limitation by modeling each face image embedding as a Gaussian distribution, thereby capturing both feature (mean) and uncertainty (variance) simultaneously.

Key Contributions

The authors propose two learning methodologies: a classification-based DUL (DUL\textsubscript{cls}) and a regression-based DUL (DUL\textsubscript{rgs}).

DUL\textsubscript{cls}: This method enhances a conventional face classification model by predicting a Gaussian distribution for each image's embedding. The model optimizes a joint loss that combines softmax loss with a KL-Divergence regularization term, promoting feature compactness while incorporating data uncertainty.
DUL\textsubscript{rgs}: Aimed at refining existing models, DUL\textsubscript{rgs} constructs a continuous target space using pre-trained model parameters. Leveraging regression with input-dependent noise, this method learns both the identity embeddings and their associated uncertainties. It introduces a novel loss function that weights errors against the predicted variance, thus dynamically adjusting the influence of noisy samples.

Experimental Evaluation

The efficacy of DUL is demonstrated through extensive experiments on multiple benchmarks, including LFW, MegaFace, CFP-FP, YTF, and IJB-C. The results indicate that both DUL\textsubscript{cls} and DUL\textsubscript{rgs} consistently outperform traditional deterministic models and maintain competitive performance against the probabilistic face embedding method (PFE).

Notably, DUL\textsubscript{cls} models, trained on MS-Celeb-1M, achieve remarkable improvements in scenarios with significant noise, such as the IJB-C benchmark, where robustness is crucial.
Numerical results: DUL\textsubscript{cls} achieves a TPR of 88.25% at a 0.001% FPR on IJB-C, surpassing the baseline and PFE models.

Implications and Future Work

The introduction of DUL in face recognition has several implications:

Enhanced Robustness: By modeling uncertainty, DUL provides more reliable face representations, particularly in unconstrained environments with variable image quality.
Metric Compatibility: Unlike PFE that requires the mutual likelihood score for matching, DUL's learned features can be directly evaluated using conventional similarity metrics, offering computational efficiency.
Quality Assessment: The predicted variance serves as an intrinsic measure of the embedding quality, which could obviate the need for separate image quality assessment models.

Moving forward, the application of DUL might be extended to other computer vision tasks requiring resilience to data noise, such as object detection and semantic segmentation. Additionally, further research could refine the trade-off mechanisms within DUL models to balance model complexity with computational demands.

Conclusion

The paper offers significant advancements in face recognition by effectively incorporating data uncertainty into the learning process. This approach not only enhances the performance of face recognition systems on challenging datasets but also provides a framework for dealing with uncertainty in neural network predictions, paving the way for more robust AI systems.

PDF Markdown