FastFace: Fast-converging Scheduler for Large-scale Face Recognition Training with One GPU
Abstract: Computing power has evolved into a foundational and indispensable resource in the area of deep learning, particularly in tasks such as Face Recognition (FR) model training on large-scale datasets, where multiple GPUs are often a necessity. Recognizing this challenge, some FR methods have started exploring ways to compress the fully-connected layer in FR models. Unlike other approaches, our observations reveal that without prompt scheduling of the learning rate (LR) during FR model training, the loss curve tends to exhibit numerous stationary subsequences. To address this issue, we introduce a novel LR scheduler leveraging Exponential Moving Average (EMA) and Haar Convolutional Kernel (HCK) to eliminate stationary subsequences, resulting in a significant reduction in converging time. However, the proposed scheduler incurs a considerable computational overhead due to its time complexity. To overcome this limitation, we propose FastFace, a fast-converging scheduler with negligible time complexity, i.e. O(1) per iteration, during training. In practice, FastFace is able to accelerate FR model training to a quarter of its original time without sacrificing more than 1% accuracy, making large-scale FR training feasible even with just one single GPU in terms of both time and space complexity. Extensive experiments validate the efficiency and effectiveness of FastFace. The code is publicly available at: https://github.com/amoonfana/FastFace
- Killing two birds with one stone: Efficient and robust training of face recognition cnns by partial FC. In IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR, pages 4032–4041, 2022.
- Elasticface: Elastic margin loss for deep face recognition. In IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, CVPRW, pages 1577–1586, 2022.
- Arcface: Additive angular margin loss for deep face recognition. In IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR, pages 4690–4699, 2019.
- Lightweight face recognition challenge. In IEEE/CVF International Conference on Computer Vision Workshops, ICCVW, pages 2638–2646, 2019.
- Sub-center arcface: Boosting face recognition by large-scale noisy web faces. In Andrea Vedaldi, Horst Bischof, Thomas Brox, and Jan-Michael Frahm, editors, European Conference on Computer Vision, ECCV, volume 12356, pages 741–757, 2020.
- Masked face recognition challenge: The insightface track report. In IEEE/CVF International Conference on Computer Vision Workshops, ICCVW, pages 1437–1444, 2021.
- Variational prototype learning for deep face recognition. In IEEE Conference on Computer Vision and Pattern Recognition, CVPR, pages 11906–11915, 2021.
- Uniformface: Learning deep equidistributed representation for face recognition. In IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR, pages 3415–3424, 2019.
- Ms-celeb-1m: A dataset and benchmark for large-scale face recognition. In European Conference on Computer Vision, ECCV, volume 9907 of Lecture Notes in Computer Science, pages 87–102, 2016.
- Deep residual learning for image recognition. In IEEE Conference on Computer Vision and Pattern Recognition, CVPR, pages 770–778, 2016.
- Labeled faces in the wild: A database forstudying face recognition in unconstrained environments. In University of Massachusetts, Technical Report, 2007.
- Curricularface: Adaptive curriculum learning loss for deep face recognition. In IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR, pages 5900–5909, 2020.
- Adaface: Quality adaptive margin for face recognition. In IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR, pages 18729–18738, 2022.
- Dynamic class queue for large scale face recognition in the wild. In IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR, pages 3763–3772, 2021.
- Virtual fully-connected layer: Training a large-scale face recognition dataset with limited computational resources. In IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR, pages 13315–13324, 2021.
- Large-margin softmax loss for convolutional neural networks. In Maria-Florina Balcan and Kilian Q. Weinberger, editors, International Conference on Machine Learning, ICML, volume 48, pages 507–516, 2016.
- Sphereface: Deep hypersphere embedding for face recognition. In IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR, pages 6738–6746, 2017.
- IARPA janus benchmark - C: face dataset and protocol. In International Conference on Biometrics, ICB, pages 158–165, 2018.
- Magface: A universal representation for face recognition and quality assessment. In IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR, pages 14225–14234, 2021.
- Agedb: The first manually collected, in-the-wild age database. In IEEE Conference on Computer Vision and Pattern Recognition Workshops, CVPRW, pages 1997–2005, 2017.
- Frontal to profile face verification in the wild. In IEEE Winter Conference on Applications of Computer Vision (WACV), pages 1–9, 2016.
- The megaface benchmark: 1 million faces for recognition at scale. In IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR, pages 4873–4882, 2016.
- Circle loss: A unified perspective of pair similarity optimization. In IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR, pages 6397–6406, 2020.
- Additive margin softmax for face verification. In International Conference on Learning Representations Workshops, ICLRW, 2018.
- Cosface: Large margin cosine loss for deep face recognition. In IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR, pages 5265–5274, 2018.
- An efficient training approach for very large scale face recognition. In IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR, pages 4073–4082, 2022.
- IARPA janus benchmark-b face dataset. In IEEE Conference on Computer Vision and Pattern Recognition Workshops, CVPRW, pages 592–600, 2017.
- Learning face representation from scratch. CoRR, abs/1411.7923, 2014.
- Webface260m: A benchmark unveiling the power of million-scale deep face recognition. In IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR, pages 10492–10502, 2021.
Paper Prompts
Sign up for free to create and run prompts on this paper using GPT-5.
Top Community Prompts
Collections
Sign up for free to add this paper to one or more collections.