Gradient-Aware Logit Adjustment Loss for Long-tailed Classifier (2403.09036v1)
Abstract: In the real-world setting, data often follows a long-tailed distribution, where head classes contain significantly more training samples than tail classes. Consequently, models trained on such data tend to be biased toward head classes. The medium of this bias is imbalanced gradients, which include not only the ratio of scale between positive and negative gradients but also imbalanced gradients from different negative classes. Therefore, we propose the Gradient-Aware Logit Adjustment (GALA) loss, which adjusts the logits based on accumulated gradients to balance the optimization process. Additionally, We find that most of the solutions to long-tailed problems are still biased towards head classes in the end, and we propose a simple and post hoc prediction re-balancing strategy to further mitigate the basis toward head class. Extensive experiments are conducted on multiple popular long-tailed recognition benchmark datasets to evaluate the effectiveness of these two designs. Our approach achieves top-1 accuracy of 48.5\%, 41.4\%, and 73.3\% on CIFAR100-LT, Places-LT, and iNaturalist, outperforming the state-of-the-art method GCL by a significant margin of 3.62\%, 0.76\% and 1.2\%, respectively. Code is available at https://github.com/lt-project-repository/lt-project.
- “Causal interventional training for image recognition,” IEEE Transactions on Multimedia, pp. 1–1, 2021.
- “Long-tail learning via logit adjustment,” in International Conference on Learning Representations, 2021.
- “Class-balanced distillation for long-tailed visual recognition,” in BMVC, 2021.
- “Towards calibrated hyper-sphere representation via distribution overlap coefficient for long-tailed learning,” ECCV, 2022.
- “Decoupling representation and classifier for long-tailed recognition,” in International Conference on Learning Representations, 2020.
- “Balanced meta-softmax for long-tailed visual recognition,” Advances in neural information processing systems, vol. 33, pp. 4175–4186, 2020.
- “Focal loss for dense object detection,” 2017 IEEE International Conference on Computer Vision (ICCV), pp. 2999–3007, 2017.
- “Distribution alignment: A unified framework for long-tail visual recognition,” in Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, 2021, pp. 2361–2370.
- “Equalization loss for long-tailed object recognition,” 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 11659–11668, 2020.
- “Long-tailed visual recognition via gaussian clouded logit adjustment,” in 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2022, pp. 6919–6928.
- “Distilling the knowledge in a neural network,” ArXiv, vol. abs/1503.02531, 2015.
- “Large-scale long-tailed recognition in an open world,” in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2019, pp. 2537–2546.
- “The inaturalist species classification and detection dataset,” in Proceedings of the IEEE conference on computer vision and pattern recognition, 2018, pp. 8769–8778.
- “Improving calibration for long-tailed recognition,” in Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, 2021, pp. 16489–16498.
- Learning Imbalanced Datasets with Label-Distribution-Aware Margin Loss, Curran Associates Inc., Red Hook, NY, USA, 2019.
- “Long-tailed classification by keeping the good and removing the bad momentum causal effect,” in Proceedings of the 34th International Conference on Neural Information Processing Systems, Red Hook, NY, USA, 2020, NIPS’20, Curran Associates Inc.
- “Bbn: Bilateral-branch network with cumulative learning for long-tailed visual recognition,” in Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, 2020, pp. 9719–9728.
- “Distilling virtual examples for long-tailed recognition,” 2021 IEEE/CVF International Conference on Computer Vision (ICCV), pp. 235–244, 2021.
- “Long-tailed recognition via weight balancing,” in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022, pp. 6897–6907.
- “Distributional robustness loss for long-tail learning,” in Proceedings of the IEEE/CVF International Conference on Computer Vision, 2021, pp. 9495–9504.
- “Constructing balance from imbalance for long-tailed image recognition,” in ECCV, 2022.
- Fan Zhang (686 papers)
- Wei Qin (68 papers)
- Weijieying Ren (11 papers)
- Lei Wang (975 papers)
- Zetong Chen (6 papers)
- Richang Hong (117 papers)