Rethinking the Domain Gap in Near-infrared Face Recognition (2312.00627v1)
Abstract: Heterogeneous face recognition (HFR) involves the intricate task of matching face images across the visual domains of visible (VIS) and near-infrared (NIR). While much of the existing literature on HFR identifies the domain gap as a primary challenge and directs efforts towards bridging it at either the input or feature level, our work deviates from this trend. We observe that large neural networks, unlike their smaller counterparts, when pre-trained on large scale homogeneous VIS data, demonstrate exceptional zero-shot performance in HFR, suggesting that the domain gap might be less pronounced than previously believed. By approaching the HFR problem as one of low-data fine-tuning, we introduce a straightforward framework: comprehensive pre-training, succeeded by a regularized fine-tuning strategy, that matches or surpasses the current state-of-the-art on four publicly available benchmarks. Corresponding codes can be found at https://github.com/michaeltrs/RethinkNIRVIS.
- Human and machine recognition of faces: A survey. Proceedings of the IEEE, 83(5):705–741, 1995.
- Knowledge distillation with the reused teacher classifier. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 11933–11942, June 2022.
- Learning mappings for face synthesis from near infrared to visual light images. In 2009 IEEE Conference on Computer Vision and Pattern Recognition, pages 156–163, 2009.
- MobileFacenets: Efficient CNNs for accurate real-time face verification on mobile devices. In Chinese Conference on Biometric Recognition, 2018.
- Learning a similarity metric discriminatively, with application to face verification. In IEEE Conference on Computer Vision and Pattern Recognition, 2005.
- The buaa-visnir face database instructions. volume School Comput Sci Eng, Beihang Univ, Beijing, China, Tech Rep IRIP-TR-12-FR-001, 2012.
- Retinaface: Single-shot multi-level face localisation in the wild. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), June 2020.
- ArcFace: Additive angular margin loss for deep face recognition. In IEEE Conference on Computer Vision and Pattern Recognition, 2019.
- Lightweight face recognition challenge. In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV) Workshops, Oct 2019.
- Dual variational generation for low shot heterogeneous face recognition. In H. Wallach, H. Larochelle, A. Beygelzimer, F. d'Alché-Buc, E. Fox, and R. Garnett, editors, Advances in Neural Information Processing Systems, volume 32. Curran Associates, Inc., 2019.
- Dvg-face: Dual variational generation for heterogeneous face recognition. IEEE Transactions on Pattern Analysis and Machine Intelligence, pages 1–1, 2021.
- Heterogeneous face recognition: A common encoding feature discriminant approach. IEEE Transactions on Image Processing, 26(5):2079–2089, 2017.
- MS-Celeb-1M: A dataset and benchmark for large-scale face recognition. In European Conference on Computer Vision, 2016.
- Deep pyramidal residual networks. In 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pages 6307–6315, 2017.
- Adversarial cross-spectral face completion for nir-vis face recognition. IEEE Transactions on Pattern Analysis and Machine Intelligence, 42(5):1025–1037, 2020.
- Heterogeneous face recognition: Recent advances in infrared-to-visible matching. pages 883–890, 05 2017.
- Orthogonal modality disentanglement and representation alignment network for nir-vis face recognition. IEEE Transactions on Circuits and Systems for Video Technology, 32(6):3630–3643, 2022.
- Dual face alignment learning network for nir-vis face recognition. IEEE Transactions on Circuits and Systems for Video Technology, 32(4):2411–2424, 2022.
- Fine-tuning can distort pretrained features and underperform out-of-distribution. In International Conference on Learning Representations, 2022.
- The casia nir-vis 2.0 face database. In 2013 IEEE Conference on Computer Vision and Pattern Recognition Workshops, pages 348–353, 2013.
- Sphereface: Deep hypersphere embedding for face recognition. In IEEE Conference on Computer Vision and Pattern Recognition, 2017.
- Physically-based face rendering for nir-vis face recognition. In S. Koyejo, S. Mohamed, A. Agarwal, D. Belgrave, K. Cho, and A. Oh, editors, Advances in Neural Information Processing Systems, volume 35, pages 22752–22764. Curran Associates, Inc., 2022.
- A survey on transfer learning. IEEE Transactions on Knowledge and Data Engineering, 22(10):1345–1359, 2010.
- Dlface: Deep local descriptor for cross-modality face recognition. Pattern Recognition, 90:161–171, 2019.
- Estimation of visible spectrum faces from polarimetric thermal faces. In 2016 IEEE 8th International Conference on Biometrics Theory, Applications and Systems (BTAS), pages 1–7, 2016.
- Deep perceptual mapping for cross-modal face recognition. International Journal of Computer Vision 122, 426–438 (2017).
- Facenet: A unified embedding for face recognition and clustering. In Proceedings of the IEEE conference on computer vision and pattern recognition, pages 815–823, 2015.
- Facenet: A unified embedding for face recognition and clustering. In IEEE Conference on Computer Vision and Pattern Recognition, 2015.
- Infrared face recognition: A comprehensive review of methodologies and databases. Pattern Recognition, 47(9):2807–2824, 2014.
- Hybrid deep learning for face verification. In IEEE International Conference on Computer Vision, 2013.
- Deepface: Closing the gap to human-level performance in face verification. In IEEE Conference on Computer Vision and Pattern Recognition, 2014.
- Additive margin softmax for face verification. IEEE Signal Processing Letters, 2018.
- Cosface: Large margin cosine loss for deep face recognition. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), June 2018.
- A discriminative feature learning approach for deep face recognition. In European Conference on Computer Vision, 2016.
- A light cnn for deep face representation with noisy labels. IEEE Transactions on Information Forensics and Security, 13(11):2884–2896, 2018.
- Lamp-hq: A large-scale multi-pose high-quality database and benchmark for nir-vis face recognition. International Journal of Computer Vision, 2021.
- Synthesis of high-quality visible faces from polarimetric thermal faces using generative adversarial networks. International Journal of Computer Vision, 127(6-7):845–862, 2019.
- Face recognition: A literature survey. ACM computing surveys (CSUR), 35(4):399–458, 2003.
- Local-adaptive face recognition via graph-based meta-clustering and regularized adaptation. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 20301–20310, June 2022.