Parametric Classification for Generalized Category Discovery: A Baseline Study (2211.11727v4)
Abstract: Generalized Category Discovery (GCD) aims to discover novel categories in unlabelled datasets using knowledge learned from labelled samples. Previous studies argued that parametric classifiers are prone to overfitting to seen categories, and endorsed using a non-parametric classifier formed with semi-supervised k-means. However, in this study, we investigate the failure of parametric classifiers, verify the effectiveness of previous design choices when high-quality supervision is available, and identify unreliable pseudo-labels as a key problem. We demonstrate that two prediction biases exist: the classifier tends to predict seen classes more often, and produces an imbalanced distribution across seen and novel categories. Based on these findings, we propose a simple yet effective parametric classification method that benefits from entropy regularisation, achieves state-of-the-art performance on multiple GCD benchmarks and shows strong robustness to unknown class numbers. We hope the investigation and proposed simple framework can serve as a strong baseline to facilitate future studies in this field. Our code is available at: https://github.com/CVMI-Lab/SimGCD.
- k-means++: the advantages of careful seeding. In ACM-SIAM Symposium on Discrete Algorithms, 2007.
- Self-labelling via simultaneous clustering and representation learning. In ICLR, 2020.
- Masked siamese networks for label-efficient learning. In ECCV, 2022.
- Semi-supervised learning of visual features by non-parametrically predicting view assignments with support samples. In ICCV, 2021.
- Mixmatch: A holistic approach to semi-supervised learning. In NeurIPS, 2019.
- Open-world semi-supervised learning. In ICLR, 2022.
- Deep Clustering for Unsupervised Learning of Visual Features. In ECCV, 2018.
- Unsupervised learning of visual features by contrasting cluster assignments. In nips, 2020.
- Emerging properties in self-supervised vision transformers. In ICCV, 2021.
- A simple framework for contrastive learning of visual representations. In ICML, 2020.
- Semi-supervised learning under class distribution mismatch. In AAAI, 2020.
- Picie: Unsupervised semantic segmentation using invariance and equivariance in clustering. In CVPR, 2021.
- Discriminability-transferability trade-off: An information-theoretic perspective. In ECCV, 2022.
- Imagenet: A large-scale hierarchical image database. In CVPR, 2009.
- An image is worth 16x16 words: Transformers for image recognition at scale. In ICLR, 2021.
- Xcon: Learning with experts for fine-grained category discovery. In BMVC, 2022.
- A unified objective for novel class discovery. In ICCV, 2021.
- Unsupervised representation learning by predicting image rotations. In ICLR, 2018.
- Bootstrap your own latent-a new approach to self-supervised learning. In NeurIPS, 2020.
- Safe deep semi-supervised learning for unseen-class unlabeled data. In ICML, 2020.
- Automatically discovering and learning new visual categories with ranking statistics. In ICLR, 2020.
- Autonovel: Automatically discovering and learning novel visual categories. IEEE TPAMI, 2021.
- Learning to discover novel visual categories via deep transfer clustering. In ICCV, 2019.
- Momentum contrast for unsupervised visual representation learning. In CVPR, 2020.
- Deep residual learning for image recognition. In CVPR, 2016.
- Billion-scale similarity search with GPUs. IEEE Transactions on Big Data, 2019.
- Supervised contrastive learning. In NeurIPS, 2020.
- 3d object representations for fine-grained categorization. In ICCV Workshops, 2013.
- Learning multiple layers of features from tiny images. Technical Report, 2009.
- James MacQueen. Some methods for classification and analysis of multivariate observations. In Proceedings of the Fifth Berkeley Symposium on Mathematical Statistics and Probability, 1967.
- Fine-grained visual classification of aircraft. arXiv preprint arXiv:1306.5151, 2013.
- Long-tail learning via logit adjustment. In ICLR, 2021.
- Realistic evaluation of deep semi-supervised learning algorithms. In NeurIPS, 2018.
- Semi-supervised learning with scarce annotations. In CVPR Workshops, 2020.
- Balanced meta-softmax for long-tailed visual recognition. In NeurIPS, 2020.
- Class-incremental novel class discovery. In ECCV, 2022.
- Openmatch: Open-set semi-supervised learning with open-set consistency regularization. In NeurIPS, 2021.
- Fixmatch: Simplifying semi-supervised learning with consistency and confidence. In NeurIPS, 2020.
- Opencon: Open-world contrastive learning. TMLR, 2023.
- The herbarium challenge 2019 dataset. arXiv preprint arXiv:1906.05372, 2019.
- Mean teachers are better role models: Weight-averaged consistency targets improve semi-supervised deep learning results. In NeurIPS, 2017.
- Laurens Van der Maaten and Geoffrey Hinton. Visualizing data using t-sne. JMLR, 2008.
- Generalized category discovery. In CVPR, 2022.
- Open-set recognition: A good closed-set classifier is all you need? In ICLR, 2022.
- Normface: L2 hypersphere embedding for face verification. In ACM MM, 2017.
- Long-tailed recognition by routing diverse distribution-aware experts. In ICLR, 2021.
- Debiased learning from naturally imbalanced pseudo-labels. In CVPR, 2022.
- Caltech-ucsd birds 200. Technical Report CNS-TR-201, Caltech, 2010.
- Self-supervised visual representation learning with semantic grouping. In NeurIPS, 2022.
- Multi-task curriculum framework for open-set semi-supervised learning. In ECCV, 2020.
- S4l: Self-supervised semi-supervised learning. In ICCV, 2019.
- Novel visual category discovery with dual ranking statistics and mutual knowledge distillation. In NeurIPS, 2021.
- Incremental generalized category discovery. In ICCV, 2023.
- Distilling visual priors from self-supervised learning. In ECCV Workshops, 2020.
- Learning semi-supervised gaussian mixture models for generalized category discovery. In ICCV, 2023.
- Neighborhood contrastive learning for novel class discovery. In CVPR, 2021.
- Openmix: Reviving known knowledge for discovering novel visual categories in an open world. In CVPR, 2021.