Range Loss for Deep Face Recognition with Long-tail (1611.08976v1)

Published 28 Nov 2016 in cs.CV

Abstract: Convolutional neural networks have achieved great improvement on face recognition in recent years because of its extraordinary ability in learning discriminative features of people with different identities. To train such a well-designed deep network, tremendous amounts of data is indispensable. Long tail distribution specifically refers to the fact that a small number of generic entities appear frequently while other objects far less existing. Considering the existence of long tail distribution of the real world data, large but uniform distributed data are usually hard to retrieve. Empirical experiences and analysis show that classes with more samples will pose greater impact on the feature learning process and inversely cripple the whole models feature extracting ability on tail part data. Contrary to most of the existing works that alleviate this problem by simply cutting the tailed data for uniform distributions across the classes, this paper proposes a new loss function called range loss to effectively utilize the whole long tailed data in training process. More specifically, range loss is designed to reduce overall intra-personal variations while enlarging inter-personal differences within one mini-batch simultaneously when facing even extremely unbalanced data. The optimization objective of range loss is the $k$ greatest range's harmonic mean values in one class and the shortest inter-class distance within one batch. Extensive experiments on two famous and challenging face recognition benchmarks (Labeled Faces in the Wild (LFW) and YouTube Faces (YTF) not only demonstrate the effectiveness of the proposed approach in overcoming the long tail effect but also show the good generalization ability of the proposed approach.

Citations (434)

View on Semantic Scholar

Summary

The paper introduces a novel range loss function that minimizes maximum intra-class distances while maximizing minimum inter-class distances.
It efficiently leverages under-represented long-tail data without excessive truncation, enhancing discriminative feature learning in unbalanced datasets.
Empirical results on LFW and YTF benchmarks show that models using range loss outperform those with traditional softmax and contrastive losses.

Overview of "Range Loss for Deep Face Recognition with Long-tail"

The paper introduces a novel approach designed to tackle issues related to long-tail data distributions in the field of deep face recognition. This work proposes the "range loss" function, aiming to improve the model's ability to extract discriminative features from datasets where a vast majority of classes (or identities) are under-represented. This is particularly relevant for face recognition tasks, where a significant imbalance exists in the number of available samples per class.

Key Contributions

The main contribution of this paper is the range loss function, which offers an alternative to the conventional practice of attempting to balance datasets by truncating the "tail" of the distribution. Instead of equalizing data distribution across classes by cutting off rare data, range loss leverages the entirety of the data available, including the tail. The authors demonstrate that this approach can enhance feature learning by focusing on minimizing intra-class variations while maximizing inter-class distances, even when faced with extremely unbalanced datasets.

Range Loss Formula: The range loss is structured to reduce the maximum intra-class distances and increase the shortest inter-class distances within a mini-batch. This is distinct from existing losses like contrastive loss and triplet loss, which optimize pair-wise distances based on specific sample sets rather than class-wide statistical parameters.
Utilization of Long-tail Data: The authors extensively explore how range loss can exploit long-tail data. They find that truncating long-tail data can improve model performance up to a point, beyond which performance degrades if data is excessively pruned. Range loss, thus, effectively enhances learning from tail data, which would otherwise be underutilized or ignored.
Empirical Validation: Through experiments conducted on challenging face recognition benchmarks such as LFW and YTF, the paper validates that models employing range loss outperform those using only softmax or combined softmax and contrastive losses. It highlights superior performance not just in controlled settings but also under the idiosyncrasies introduced by real-world, unbalanced datasets.

Results

Strong numerical results are evident in the superiority of the proposed approach over traditional methods. For instance, models trained with the combined range loss and softmax loss achieved notable improvements in performance metrics such as accuracy, demonstrating strong discriminative power over a baseline approach. The paper reports that compared to a model using only softmax loss, applying the range loss achieves higher verification accuracy on LFW and YTF datasets, reflecting an improved generalization capability when dealing with long-tail data.

Implications and Future Developments

The implications of this research are significant for AI practitioners dealing with imbalanced datasets, particularly in domains extending beyond face recognition. Range loss's ability to leverage long-tail data presents a candidate for wide adoption in varied machine learning applications, from object recognition to rare event detection. Moreover, the speculated future development may explore the integration of range loss with other architectures and loss functions to further exploit its potential benefits in diverse AI challenges. There is also potential for future refinement of the range loss to adapt dynamically to datasets with varying levels of class representation imbalance.

Conclusion

In conclusion, the introduction of the range loss function marks a valuable step forward in the domain of face recognition under challenging data conditions. By effectively utilizing the complete long-tail distribution, the proposed methodology offers a promising direction for overcoming limitations posed by imbalanced datasets. As long-tail distributions are ubiquitous in real-world data, this advance holds promise for broad applicability and inspires further studies on advanced loss formulations exploiting data distribution characteristics.

PDF Markdown