BaCon: Boosting Imbalanced Semi-supervised Learning via Balanced Feature-Level Contrastive Learning (2403.12986v2)
Abstract: Semi-supervised Learning (SSL) reduces the need for extensive annotations in deep learning, but the more realistic challenge of imbalanced data distribution in SSL remains largely unexplored. In Class Imbalanced Semi-supervised Learning (CISSL), the bias introduced by unreliable pseudo-labels can be exacerbated by imbalanced data distributions. Most existing methods address this issue at instance-level through reweighting or resampling, but the performance is heavily limited by their reliance on biased backbone representation. Some other methods do perform feature-level adjustments like feature blending but might introduce unfavorable noise. In this paper, we discuss the bonus of a more balanced feature distribution for the CISSL problem, and further propose a Balanced Feature-Level Contrastive Learning method (BaCon). Our method directly regularizes the distribution of instances' representations in a well-designed contrastive manner. Specifically, class-wise feature centers are computed as the positive anchors, while negative anchors are selected by a straightforward yet effective mechanism. A distribution-related temperature adjustment is leveraged to control the class-wise contrastive degrees dynamically. Our method demonstrates its effectiveness through comprehensive experiments on the CIFAR10-LT, CIFAR100-LT, STL10-LT, and SVHN-LT datasets across various settings. For example, BaCon surpasses instance-level method FixMatch-based ABC on CIFAR10-LT with a 1.21% accuracy improvement, and outperforms state-of-the-art feature-level method CoSSL on CIFAR100-LT with a 0.63% accuracy improvement. When encountering more extreme imbalance degree, BaCon also shows better robustness than other methods.
- ReMixMatch: Semi-Supervised Learning with Distribution Matching and Augmentation Anchoring. In International Conference on Learning Representations.
- Mixmatch: A holistic approach to semi-supervised learning. In Advances in Neural Information Processing Systems 32, 5050–5060.
- Unsupervised learning of visual features by contrasting cluster assignments. In Advances in Neural Information Processing Systems 33, virtual.
- A simple framework for contrastive learning of visual representations. In International conference on machine learning, 1597–1607.
- Exploring simple siamese representation learning. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, 15750–15758.
- An Analysis of Single-Layer Networks in Unsupervised Feature Learning. In Proceedings of the Fourteenth International Conference on Artificial Intelligence and Statistics, volume 15 of JMLR Proceedings, 215–223.
- Cossl: Co-learning of representation and classifier for imbalanced semi-supervised learning. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, 14574–14584.
- Semi-supervised learning objectives as log-likelihoods in a generative model of data curation. arXiv:2008.05913.
- Bootstrap your own latent-a new approach to self-supervised learning. In Advances in Neural Information Processing Systems 33, virtual.
- Class-Imbalanced Semi-Supervised Learning with Adaptive Thresholding. In International Conference on Machine Learning, 8082–8094.
- Noise-contrastive estimation: A new estimation principle for unnormalized statistical models. In Proceedings of the thirteenth international conference on artificial intelligence and statistics, 297–304. JMLR Workshop and Conference Proceedings.
- Momentum contrast for unsupervised visual representation learning. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, 9729–9738.
- Distribution Aligning Refinery of Pseudo-label for Imbalanced Semi-supervised Learning. In Advances in Neural Information Processing Systems 33, virtual.
- Learning Multiple Layers of Features from Tiny Images. Technical Report.
- Smoothed Adaptive Weighting for Imbalanced Semi-Supervised Learning: Improve Reliability Against Unknown Distribution Data. In International Conference on Machine Learning, volume 162 of Proceedings of Machine Learning Research, 11828–11843.
- Temporal Ensembling for Semi-Supervised Learning. In 5th International Conference on Learning Representations.
- Lee, D.-H.; et al. 2013. Pseudo-label: The simple and efficient semi-supervised learning method for deep neural networks. In Workshop on challenges in representation learning, ICML, 896. Atlanta.
- ABC: Auxiliary Balanced Classifier for Class-imbalanced Semi-supervised Learning. In Advances in Neural Information Processing Systems 34, virtual, 7082–7094.
- Comatch: Semi-supervised learning with contrastive graph regularization. In Proceedings of the IEEE/CVF International Conference on Computer Vision, 9475–9484.
- Virtual Adversarial Training: A Regularization Method for Supervised and Semi-Supervised Learning. IEEE Transactions on Pattern Analysis and Machine Intelligence, 41(8): 1979–1993.
- Reading Digits in Natural Images with Unsupervised Feature Learning. In NIPS Workshop on Deep Learning and Unsupervised Feature Learning.
- DASO: Distribution-Aware Semantics-Oriented Pseudo-label for Imbalanced Semi-Supervised Learning. In IEEE/CVF Conference on Computer Vision and Pattern Recognition, New Orleans, LA, USA, 9776–9786.
- An Overview of Deep Semi-Supervised Learning. arXiv:2006.05278.
- FixMatch: Simplifying Semi-Supervised Learning with Consistency and Confidence. In Advances in Neural Information Processing Systems, Virtual.
- Mean teachers are better role models: Weight-averaged consistency targets improve semi-supervised deep learning results. In Advances in Neural Information Processing Systems 30, 1195–1204.
- Contrastive multiview coding. In European Conference on Computer Vision, 776–794.
- Representation Learning with Contrastive Predictive Coding. arXiv:1807.03748.
- Visualizing data using t-SNE. Journal of machine learning research, 9(11).
- Debiased Learning from Naturally Imbalanced Pseudo-Labels. In IEEE/CVF Conference on Computer Vision and Pattern Recognition, 14627–14637.
- USB: A Unified Semi-supervised Learning Benchmark for Classification. In Thirty-sixth Conference on Neural Information Processing Systems Datasets and Benchmarks Track.
- FreeMatch: Self-adaptive Thresholding for Semi-supervised Learning. In The Eleventh International Conference on Learning Representations, ICLR 2023, Kigali, Rwanda, May 1-5, 2023. OpenReview.net.
- Semi-Supervised Semantic Segmentation Using Unreliable Pseudo-Labels. In IEEE/CVF Conference on Computer Vision and Pattern Recognition, 4238–4247.
- CReST: A Class-Rebalancing Self-Training Framework for Imbalanced Semi-Supervised Learning. In IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2021, virtual, June 19-25, 2021, 10857–10866.
- Unsupervised feature learning via non-parametric instance discrimination. In Proceedings of the IEEE conference on computer vision and pattern recognition, 3733–3742.
- Unsupervised Data Augmentation for Consistency Training. In Advances in Neural Information Processing Systems, Virtual.
- Rethinking the value of labels for improving class-imbalanced learning. In Advances in Neural Information Processing Systems 33, virtual.
- Wide Residual Networks. In Proceedings of the British Machine Vision Conference 2016.
- FlexMatch: Boosting Semi-Supervised Learning with Curriculum Pseudo Labeling. In Advances in Neural Information Processing, Virtual, 18408–18419.
- Simmatch: Semi-supervised learning with similarity matching. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 14471–14481.
- Zhu, X. J. 2005. Semi-supervised learning literature survey. Technical Report.
- Qianhan Feng (4 papers)
- Lujing Xie (2 papers)
- Shijie Fang (11 papers)
- Tong Lin (24 papers)