Class Rectification Hard Mining for Imbalanced Deep Learning (1712.03162v1)

Published 8 Dec 2017 in cs.CV

Abstract: Recognising detailed facial or clothing attributes in images of people is a challenging task for computer vision, especially when the training data are both in very large scale and extremely imbalanced among different attribute classes. To address this problem, we formulate a novel scheme for batch incremental hard sample mining of minority attribute classes from imbalanced large scale training data. We develop an end-to-end deep learning framework capable of avoiding the dominant effect of majority classes by discovering sparsely sampled boundaries of minority classes. This is made possible by introducing a Class Rectification Loss (CRL) regularising algorithm. We demonstrate the advantages and scalability of CRL over existing state-of-the-art attribute recognition and imbalanced data learning models on two large scale imbalanced benchmark datasets, the CelebA facial attribute dataset and the X-Domain clothing attribute dataset.

Authors (3)

Qi Dong (21 papers)
Shaogang Gong (94 papers)
Xiatian Zhu (139 papers)

Citations (193)

View on Semantic Scholar

Summary

The paper introduces a Class Rectification Loss (CRL) method that leverages incremental hard sample mining to address imbalanced training data.
It demonstrates improved performance by outperforming the LMLE approach with a 2% gain on CelebA and a 4% gain on X-Domain datasets.
The CRL framework enhances computational efficiency by training three times faster and avoids the pitfalls of traditional over- or under-sampling techniques.

Class Rectification Hard Mining for Imbalanced Deep Learning

The paper authored by Qi Dong, Shaogang Gong, and Xiatian Zhu presents a novel deep learning framework specifically designed to address the challenges of imbalanced training data, particularly prevalent in large-scale facial and clothing attribute recognition tasks. The primary focus of this research is the development of a Class Rectification Loss (CRL) algorithm, which is integrated into a deep learning model to effectively counter the biases introduced by majority classes in imbalanced datasets.

Methodology

The authors introduce a novel concept in batch processing through incremental hard sample mining. This involves strategically focusing on hard positives and negatives of minority classes, counteracting the dominant influence of well-represented majority classes. The CRL algorithm is the cornerstone of this approach, ensuring that minority classes are adequately learned by concentrating on sparsely sampled data boundaries.

Experimental Design and Results

Two large-scale imbalanced datasets were employed to demonstrate the efficacy of the proposed framework: CelebA and X-Domain. The CRL model showcased enhanced accuracy over existing models in both facial and clothing attribute recognition tasks across these datasets. In particular, the CRL method outperformed the Large Margin Local Embedding (LMLE) method, which previously held state-of-the-art status, by a margin of 2% in average accuracy on the CelebA dataset and 4% on the X-Domain dataset. This success is attributed to CRL's effective mitigation of the negative impact of imbalanced data through the continual refinement of class boundaries and margins.

Moreover, the CRL model demonstrated computational efficiency, training three times faster than the LMLE model, making it practical for large-scale applications. The paper highlights that classical re-sampling approaches such as over-sampling or under-sampling can inadvertently lead to noise introduction or loss of valuable data, respectively. The CRL approach circumvents these pitfalls by enabling focus on the minority classes directly within the learning framework.

Implications and Future Directions

The implications of this research are far-reaching in the field of computer vision, particularly in applications involving multi-label recognition from imbalanced datasets. The authors have set a precedent for more refined approaches to deep learning in scenarios where class imbalance is a significant hurdle. This has potential applications in advancing artificial intelligence systems that require detailed attribute recognition from visual data, ranging from security and surveillance to customized online retail experiences.

Future work may explore the extension of CRL and its hard mining strategy to other learning paradigms and more diverse datasets, examining its scalability beyond clothing and facial attributes. Additionally, further research is warranted to assess the integration of CRL with other forms of discriminative feature learning or generative adversarial networks for enriched model robustness.

In conclusion, the paper makes a valuable contribution to the domain of imbalanced deep learning, offering a scalable, efficient, and robust methodology for attribute recognition in highly imbalanced datasets. This work is a significant step forward in addressing the persistent challenge of data imbalance in machine learning, paving the way for more equitable learning models.

PDF Markdown