Confidence-Aware RGB-D Face Recognition via Virtual Depth Synthesis (2403.06529v2)
Abstract: 2D face recognition encounters challenges in unconstrained environments due to varying illumination, occlusion, and pose. Recent studies focus on RGB-D face recognition to improve robustness by incorporating depth information. However, collecting sufficient paired RGB-D training data is expensive and time-consuming, hindering wide deployment. In this work, we first construct a diverse depth dataset generated by 3D Morphable Models for depth model pre-training. Then, we propose a domain-independent pre-training framework that utilizes readily available pre-trained RGB and depth models to separately perform face recognition without needing additional paired data for retraining. To seamlessly integrate the two distinct networks and harness the complementary benefits of RGB and depth information for improved accuracy, we propose an innovative Adaptive Confidence Weighting (ACW). This mechanism is designed to learn confidence estimates for each modality to achieve modality fusion at the score level. Our method is simple and lightweight, only requiring ACW training beyond the backbone models. Experiments on multiple public RGB-D face recognition benchmarks demonstrate state-of-the-art performance surpassing previous methods based on depth estimation and feature fusion, validating the efficacy of our approach.
- Challenges in determining the depth in 2-d images. In 2022 International Conference on INnovations in Intelligent SysTems and Applications (INISTA), pages 1–6. IEEE, 2022.
- A morphable model for the synthesis of 3d faces. In Seminal Graphics Papers: Pushing the Boundaries, Volume 2, pages 157–164. 1999.
- Mobilefacenets: Efficient cnns for accurate real-time face verification on mobile devices. In Biometric Recognition: 13th Chinese Conference, CCBR 2018, Urumqi, China, August 11-12, 2018, Proceedings 13, pages 428–438. Springer, 2018.
- High-accuracy rgb-d face recognition via segmentation-aware face depth estimation and mask-guided attention network. In 2021 16th IEEE International Conference on Automatic Face and Gesture Recognition (FG 2021), pages 1–8, Dec. 2021.
- Rgb-d face recognition with identity-style disentanglement and depth augmentation. IEEE Transactions on Biometrics, Behavior, and Identity Science, pages 1–1, 2023.
- Rgb-d face recognition via learning-based reconstruction. In 2016 IEEE 8th International Conference on Biometrics Theory, Applications and Systems (BTAS), pages 1–7. IEEE, 2016.
- Improving 2d face recognition via discriminative face depth estimation. In 2018 International Conference on Biometrics (ICB), pages 140–147, Feb. 2018.
- Arcface: Additive angular margin loss for deep face recognition. IEEE TPAMI, 44:5962–5979, 2018.
- Learning confidence for out-of-distribution detection in neural networks. arXiv preprint arXiv:1802.04865, 2018.
- Morphable face models-an open framework. In 2018 13th IEEE International Conference on Automatic Face & Gesture Recognition (FG 2018), pages 75–82. IEEE, 2018.
- Syed Zulqarnain Gilani. Learning from millions of 3d scans for large-scale 3d face recognition. In CVPR, pages 1896–1905, June 2018.
- On rgb-d face recognition using kinect. In 2013 IEEE Sixth International Conference on Biometrics: Theory, Applications and Systems (BTAS), pages 1–6. IEEE, 2013.
- Towards 3d face recognition in the real: a registration-free approach using fine-grained matching of 3d keypoint descriptors. IJCV, 113:128–142, 2015.
- Learning a model of facial shape and expression from 4d scans. ACM TOG, 36(6):194–1, 2017.
- High quality facial data synthesis and fusion for 3d low-quality face recognition. In 2021 IEEE International Joint Conference on Biometrics (IJCB), pages 1–8. IEEE, 2021.
- An efficient multimodal 2d-3d hybrid approach to automatic face recognition. IEEE TPAMI, 29(11):1927–1943, Nov. 2007.
- Led3d: A lightweight and efficient deep approach to recognizing low-quality 3d faces. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 5773–5782, 2019.
- Face reconstruction from skull shapes and physical attributes. In Pattern Recognition: 31st DAGM Symposium, Jena, Germany, September 9-11, 2009. Proceedings 31, pages 232–241. Springer, 2009.
- Bosphorus database for 3d face analysis. In Biometrics and Identity Management: First European Workshop, BIOID 2008, Roskilde, Denmark, May 7-9, 2008. Revised Selected Papers 1, pages 47–56. Springer, 2008.
- Face recognition: a novel multi-level taxonomy based survey. IET Biometrics, 9(2):58–67, 2020.
- Depth as attention for face representation learning. IEEE Transactions on Information Forensics and Security, 16:2461–2476, 2021.
- Teacher-student adversarial depth hallucination to improve face recognition. In ICCV, pages 3671–3680, 2021.
- Two-level attention-based fusion learning for rgb-d face recognition. In ICPR, pages 10120–10127. IEEE, 2021.
- Improving rgb-d face recognition via transfer learning from a pretrained 2d network. In International Symposium on Benchmarking, Measuring and Optimization, pages 141–148. Springer, 2019.
- Learning face representation from scratch. arXiv preprint arXiv:1411.7923, 2014.
- Rgb-d face recognition via deep complementary and common feature learning. In 2018 13th IEEE International Conference on Automatic Face & Gesture Recognition (FG 2018), pages 8–15. IEEE, 2018.
- Lock3dface: A large-scale database of low-cost kinect 3d faces. In 2016 International Conference on Biometrics (ICB), pages 1–8. IEEE, 2016.
- Lmfnet: A lightweight multiscale fusion network with hierarchical structure for low-quality 3-d face recognition. IEEE Transactions on Human-Machine Systems, 2022.
- Exploiting enhanced and robust rgb-d face representation via progressive multi-modal learning. Pattern Recognition Letters, 166:38–45, 2023.
- Zijian Chen (27 papers)
- Mei Wang (41 papers)
- Weihong Deng (71 papers)
- Hongzhi Shi (7 papers)
- Dongchao Wen (14 papers)
- Yingjie Zhang (30 papers)
- Xingchen Cui (4 papers)
- Jian Zhao (218 papers)