Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
97 tokens/sec
GPT-4o
53 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
5 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Identity Overlap Between Face Recognition Train/Test Data: Causing Optimistic Bias in Accuracy Measurement (2405.09403v1)

Published 15 May 2024 in cs.CV

Abstract: A fundamental tenet of pattern recognition is that overlap between training and testing sets causes an optimistic accuracy estimate. Deep CNNs for face recognition are trained for N-way classification of the identities in the training set. Accuracy is commonly estimated as average 10-fold classification accuracy on image pairs from test sets such as LFW, CALFW, CPLFW, CFP-FP and AgeDB-30. Because train and test sets have been independently assembled, images and identities in any given test set may also be present in any given training set. In particular, our experiments reveal a surprising degree of identity and image overlap between the LFW family of test sets and the MS1MV2 training set. Our experiments also reveal identity label noise in MS1MV2. We compare accuracy achieved with same-size MS1MV2 subsets that are identity-disjoint and not identity-disjoint with LFW, to reveal the size of the optimistic bias. Using more challenging test sets from the LFW family, we find that the size of the optimistic bias is larger for more challenging test sets. Our results highlight the lack of and the need for identity-disjoint train and test methodology in face recognition research.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (35)
  1. Partial FC: training 10 million identities on a single machine. In ICCVW, pages 1445–1449. IEEE, 2021.
  2. L. Bottou. Large-scale machine learning with stochastic gradient descent. In Proceedings of COMPSTAT’2010: 19th International Conference on Computational StatisticsParis France, August 22-27, 2010 Keynote, Invited and Contributed Papers, pages 177–186. Springer, 2010.
  3. Vggface2: A dataset for recognising faces across pose and age. In International Conference on Automatic Face & Gesture Recognition, pages 67–74. IEEE Computer Society, 2018.
  4. Transface: Calibrating transformer training for face recognition from a data-centric perspective. In ICCV, pages 20642–20653, 2023.
  5. Arcface: Additive angular margin loss for deep face recognition. In CVPR, pages 4690–4699. Computer Vision Foundation / IEEE, 2019.
  6. Lightweight face recognition challenge. In ICCV Workshops, pages 2638–2646. IEEE, 2019.
  7. Ms-celeb-1m: A dataset and benchmark for large-scale face recognition. In B. Leibe, J. Matas, N. Sebe, and M. Welling, editors, ECCV, volume 9907 of Lecture Notes in Computer Science, pages 87–102. Springer, 2016.
  8. Labeled faces in the wild: A database forstudying face recognition in unconstrained environments. In Workshop on faces in’Real-Life’Images: detection, alignment, and recognition, 2008.
  9. Curricularface: Adaptive curriculum learning loss for deep face recognition. In CVPR, pages 5900–5909. Computer Vision Foundation / IEEE, 2020.
  10. I. Hupont and C. F. Tena. Demogpairs: Quantifying the impact of demographic imbalance in deep face recognition. In International Conference on Automatic Face & Gesture Recognition, pages 1–7. IEEE, 2019.
  11. Adaface: Quality adaptive margin for face recognition. In CVPR, pages 18729–18738. IEEE, 2022.
  12. Groupface: Learning latent groups and constructing group-based representations for face recognition. In CVPR, pages 5621–5630, 2020.
  13. Pushing the frontiers of unconstrained face detection and recognition: IARPA janus benchmark A. In CVPR, pages 1931–1939. IEEE Computer Society, 2015.
  14. Sphereface: Deep hypersphere embedding for face recognition. In CVPR, pages 6738–6746. IEEE Computer Society, 2017.
  15. IARPA janus benchmark - C: face dataset and protocol. In International Conference on Biometrics, pages 158–165. IEEE, 2018.
  16. Magface: A universal representation for face recognition and quality assessment. In CVPR, pages 14225–14234. Computer Vision Foundation / IEEE, 2021.
  17. Agedb: The first manually collected, in-the-wild age database. In CVPR Workshops, pages 1997–2005. IEEE Computer Society, 2017.
  18. Deep face recognition. In BMVC. British Machine Vision Association, 2015.
  19. Face recognition: Too bias, or not too bias? In CVPR Workshops, pages 1–10. Computer Vision Foundation / IEEE, 2020.
  20. Double trouble? impact and detection of duplicates in face image datasets. arXiv preprint arXiv:2401.14088, 2024.
  21. Frontal to profile face verification in the wild. In WACV, pages 1–9. IEEE Computer Society, 2016.
  22. Circle loss: A unified perspective of pair similarity optimization. In CVPR, pages 6397–6406. Computer Vision Foundation / IEEE, 2020.
  23. Mlfw: A database for face recognition on masked faces. arXiv preprint arXiv:2109.05804, 2021.
  24. Normface: L22{}_{\mbox{2}}start_FLOATSUBSCRIPT 2 end_FLOATSUBSCRIPT hypersphere embedding for face verification. In Q. Liu, R. Lienhart, H. Wang, S. K. Chen, S. Boll, Y. P. Chen, G. Friedland, J. Li, and S. Yan, editors, Multi-Media, pages 1041–1049. ACM, 2017.
  25. Cosface: Large margin cosine loss for deep face recognition. In CVPR, pages 5265–5274. Computer Vision Foundation / IEEE Computer Society, 2018.
  26. IARPA janus benchmark-b face dataset. In CVPR Workshops, pages 592–600. IEEE Computer Society, 2017.
  27. Face recognition in unconstrained videos with matched background similarity. In CVPR 2011, pages 529–534. IEEE, 2011.
  28. Consistency and accuracy of celeba attribute values. In 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), 2023.
  29. H. Wu and K. W. Bowyer. What should be balanced in a” balanced” face recognition dataset. In BMVC, volume 1, page 2, 2023.
  30. Learning face representation from scratch. arXiv preprint arXiv:1411.7923, 2014.
  31. T. Zheng and W. Deng. Cross-pose lfw: A database for studying cross-pose face recognition in unconstrained environments. Beijing University of Posts and Telecommunications, Tech. Rep, 5(7), 2018.
  32. Cross-age lfw: A database for studying cross-age face recognition in unconstrained environments. arXiv preprint arXiv:1708.08197, 2017.
  33. Y. Zhong and W. Deng. Towards transferable adversarial attack against deep face recognition. IEEE Transactions on Information Forensics and Security, 2020.
  34. Uniface: Unified cross-entropy loss for deep face recognition. In Proceedings of the IEEE/CVF International Conference on Computer Vision, pages 20730–20739, 2023.
  35. Webface260m: A benchmark for million-scale deep face recognition. PAMI, 45(2):2627–2644, 2023.
User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (6)
  1. Haiyu Wu (22 papers)
  2. Sicong Tian (5 papers)
  3. Jacob Gutierrez (2 papers)
  4. Aman Bhatta (9 papers)
  5. Kevin W. Bowyer (50 papers)
  6. Kağan Öztürk (1 paper)
Citations (1)

Summary

We haven't generated a summary for this paper yet.

X Twitter Logo Streamline Icon: https://streamlinehq.com