Bayesian Learning-driven Prototypical Contrastive Loss for Class-Incremental Learning (2405.11067v2)
Abstract: The primary objective of methods in continual learning is to learn tasks in a sequential manner over time from a stream of data, while mitigating the detrimental phenomenon of catastrophic forgetting. In this paper, we focus on learning an optimal representation between previous class prototypes and newly encountered ones. We propose a prototypical network with a Bayesian learning-driven contrastive loss (BLCL) tailored specifically for class-incremental learning scenarios. Therefore, we introduce a contrastive loss that incorporates new classes into the latent representation by reducing the intra-class distance and increasing the inter-class distance. Our approach dynamically adapts the balance between the cross-entropy and contrastive loss functions with a Bayesian learning technique. Empirical evaluations conducted on both the CIFAR-10 and CIFAR-100 dataset for image classification and images of a GNSS-based dataset for interference classification validate the efficacy of our method, showcasing its superiority over existing state-of-the-art approaches.
- Rainbow Memory: Continual Learning with a Memory of Diverse Samples. In IEEE/CVF Intl. Conf. on Computer Vision and Pattern Recognition (CVPR), Nashville, TN, June 2021.
- Online Continual Learning on a Contaminated Data Stream with Blurry Task Boundaries. In IEEE/CVF Intl. Conf. on Computer Vision and Pattern Recognition (CVPR), New Orleans, LA, June 2022.
- Continual Lifelong Learning in Natural Language Processing: A Survey. In Proc. of the Intl. Conf. on Computational Linguistics, pages 6523–6541, Barcelona, Spain, Dec. 2020.
- Multimodal Learning for Reliable Interference Classification in GNSS Signals. In Proc. of the Intl. Technical Meeting of the Satellite Division of the Institute of Navigation (ION GNSS+), pages 3210–3234, Denver, CO, Sept. 2022.
- A Dendrite Method for Cluster Analysis. In Communications in Statistics, volume 3(1), pages 1–28, Sept. 1972.
- Co2LsuperscriptCo2L\text{Co}^{2}\text{L}Co start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT L: Contrastive Continual Learning. In Intl. Conf. on Computer Vision (ICCV), Montreal, QC, Oct. 2021.
- Sy-CON: Symmetric Contrastive Loss for Continual Self-Supervised Representation Learning. In arXiv preprint arXiv:2306.05101, June 2023.
- A Simple Framework for Contrastive Learning of Visual Representations. In Intl. Conf. on Machine Learning (ICML), volume 149, pages 1597–1607, July 2020.
- Beyond Triplet Loss: A Deep Quadruplet Network for Person Re-identification. In IEEE/CVF Intl. Conf. on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, July 2017.
- Learning with Augmented Class by Exploiting Unlabeled Data. In Proc. of the Intl. Conf. on Artificial Intelligence (AAAI), volume 28, 2014.
- A Cluster Separation Measure. In IEEE Trans. on Pattern Analysis and Machine Intelligence (TPAMI), volume PAMI-1(2), pages 224–227, Apr. 1979.
- ImageNet: A Large-scale Hierarchical Image Database. In IEEE/CVF Intl. Conf. on Computer Vision and Pattern Recognition (CVPR), Miami, FL, June 2009.
- A Theoretically Sound Upper Bound on the Triplet Loss for Improving the Efficiency of Deep Distance Metric Learning. In IEEE/CVF Intl. Conf. on Computer Vision and Pattern Recognition (CVPR), Long Beach, CA, June 2019.
- DeCAF: A Deep Convolutional Activation Feature for Generic Visual Recognition. In Journal for Machine Learning Research (JMLR), volume 32(1), pages 647–655, 2014.
- PODNet: Pooled Outputs Distillation for Small-Tasks Incremental Learning. In IEEE/CVF Europ. Conf. on Computer Vision (ECCV), volume 12365, pages 86–102, Nov. 2020.
- DyTox: Transformers for Continual Learning with DYnamic TOken eXpansion. In IEEE/CVF Intl. Conf. on Computer Vision and Pattern Recognition (CVPR), pages 9285–9295, 2022.
- Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation. In IEEE/CVF Intl. Conf. on Computer Vision and Pattern Recognition (CVPR), pages 580–587, 2014.
- On Clustering Validation Techniques. In Journal of Intelligent Information Systems, volume 17, pages 107–145, Dec. 2001.
- Bayesian Graph Contrastive Learning. In arXiv preprint arXiv:2112.07823, Aug. 2022.
- Deep Residual Learning for Image Recognition. In IEEE/CVF Intl. Conf. on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, June 2016.
- MHTN: Modal-Adversarial Hybrid Transfer Network for Cross-Modal Retrieval. In Trans. on Cybernetics, volume 50(3), pages 1047–1059, 2020.
- CLASSIC: Continual and Contrastive Learning of Aspect Sentiment Classification Tasks. In Intl. Conf. on Empirical Methods in Natural Language Processing (EMNLP), pages 6871–6883, Nov. 2021.
- What Uncertainties Do We Need in Bayesian Deep Learning for Computer Vision? In Advances in Neural Information Processing Systems (NIPS), Long Beach, CA, 2017.
- Multi-Task Learning Using Uncertainty to Weigh Losses for Scene Geometry and Semantics. In IEEE/CVF Intl. Conf. on Computer Vision and Pattern Recognition (CVPR), 2018.
- Overcoming Catastrophic Forgetting in Neural Networks. In Applied Mathematics, volume 114(13), pages 3521–3526, Mar. 2017.
- Uncertainty-aware Evaluation of Time-Series Classification for Online Handwriting Recognition with Domain Shift. In IJCAI-ECAI Intl. Workshop on Spatio-Temporal Reasoning and Learning (STRL), volume 3190, Vienna, Austria, July 2022.
- Online Continual Learning on Class Incremental Blurry Task Configuration with Anytime Inference. In Intl. Conf. on Learning Representations (ICLR), 2022.
- Alex Krizhevsky. Learning Multiple Layers of Features from Tiny Images. 2009.
- From N to N+1: Multiclass Transfer Incremental Learning. In IEEE/CVF Intl. Conf. on Computer Vision and Pattern Recognition (CVPR), Portland, OR, June 2013.
- A Continual Learning Survey: Defying Forgetting in Classification Tasks. In IEEE Trans. on Pattern Analysis and Machine Intelligence (TPAMI), volume 44(7), July 2022.
- Catastrophic Interference in Neural Networks: Causes, Solutions, and Data. In F. N. Dempster & C. J. Brainerd (Eds.), Interference and Inhibition in Cognition, pages 329–361, 1995.
- Learning Without Forgetting. In IEEE Trans. on Pattern Analysis and Machine Intelligence (TPAMI), volume 40(12), pages 2935–2947, Dec. 2018.
- Bayesian Contrastive Learning with Manifold Regularization for Self-Supervised Skeleton Based Action Recognition. In IEEE Intl. Symposium on Circuits and Systems (ISCAS), Monterey, CA, May 2023.
- Bayesian Self-Supervised Contrastive Learning. In arXiv preprint arXiv:2301.11673, Jan. 2024.
- RMM: Reinforced Memory Management for Class-Incremental Learning. In Advances in Neural Information Processing Systems (NIPS), 2021.
- Gradient Episodic Memory for Continual Learning. In Advances in Neural Information Processing Systems (NIPS), Long Beach, CA, 2017.
- Online Continual Learning in Image Classification: An Empirical Survey. In arXiv preprint arXiv:2101.10423, Oct. 2021.
- Class-Incremental Learning: Survey and Performance Evaluation on Image Classification. In IEEE Trans. on Pattern Analysis and Machine Intelligence (TPAMI), volume 45(5), May 2023.
- Few-Shot Learning with Uncertainty-based Quadruplet Selection for Interference Classification in GNSS Data. In Intl. Conf. on Localization and GNSS (ICL GNSS), Feb. 2024.
- Auxiliary Cross-Modal Representation Learning with Triplet Loss Functions for Online Handwriting Recognition. In IEEE Access, volume 11, pages 94148–94172, Aug. 2023.
- Machine Learning-assisted GNSS Interference Monitoring Through Crowdsourcing. In Proc. of the Intl. Technical Meeting of the Satellite Division of the Institute of Navigation (ION GNSS+), pages 1151–1175, Denver, CO, Sept. 2022.
- A New Approach to Cross-Modal Multimedia Retrieval. In Proc. of the ACM Intl. Conf. on Multimedia (ACMMM), pages 251–260, Oct. 2010.
- R. Ratcliff. Connectionist Models of Recognition Memory: Constraints Imposed by Learning and Forgetting Functions. In Psychological Review, volume 97(2), pages 285–308, Apr. 1997.
- iCaRL: Incremental Classifier and Representation Learning. In IEEE Intl. Conf. on Computer Vision and Pattern Recognition (CVPR), pages 2011–2010, Honolulu, HI, July 2017.
- Experience Replay for Continual Learning. In Advances in Neural Information Processing Systems (NIPS), 2019.
- Adversarial Representation Learning for Text-to-Image Matching. In IEEE/CVF Intl. Conf. on Computer Vision (ICCV), pages 5814–5824, Seoul, Korea, 2019.
- FaceNet: A Unified Embedding for Face Recognition and Clustering. In IEEE/CVF Intl. Conf. on Computer Vision and Pattern Recognition (CVPR), Boston, MA, June 2015.
- Prototypical Networks for Few-shot Learning. In Advances in Neural Information Processing Systems (NIPS), pages 4080–4090, Dec. 2017.
- Contrastive Domain Adaptation. In IEEE/CVF Intl. Conf. on Computer Vision and Pattern Recognition Workshops (CVPRW), Nashville, TN, June 2021.
- Brain-inspired Replay for Continual Learning with Artificial Neural Networks. In Nature Communications, volume 11(4069), Aug. 2020.
- Laurens van der Maaten and Geoffrey Hinton. Visualizing Data Using t-SNE. In Journal for Machine Learning Research (JMLR), volume 9(86), pages 2579–2605, Nov. 2008.
- Low-Cost COTS GNSS Interference Monitoring, Detection, and Classification System. In MDPI Sensors, volume 23(7), 3452, Mar. 2023.
- FOSTER: Feature Boosting and Compression for Class-Incremental Learning. In IEEE/CVF Europ. Conf. on Computer Vision (ECCV), volume 13685, pages 398–414, Oct. 2022.
- Learning to Prompt for Continual Learning. In IEEE/CVF Intl. Conf. on Computer Vision and Pattern Recognition (CVPR), pages 139–149, 2022.
- Cross-Modal Retrieval with CNN Visual Features: A New Baseline. In Trans. on Cybernetics, volume 47(2), pages 449–460, Mar. 2016.
- Large Scale Incremental Learning. In IEEE/CVF Intl. Conf. on Computer Vision and Pattern Recognition (CVPR), pages 374–382, 2019.
- General Incremental Learning with Domain-aware Categorical Representations. In IEEE/CVF Intl. Conf. on Computer Vision and Pattern Recognition (CVPR), pages 14351–14360, 2022.
- DER: Dynamically Expandable Representation for Class Incremental Learning. In IEEE/CVF Intl. Conf. on Computer Vision and Pattern Recognition (CVPR), pages 3014–3023, 2021.
- Maintaining Discrimination and Fairness in Class Incremental Learning. In IEEE/CVF Intl. Conf. on Computer Vision and Pattern Recognition (CVPR), pages 13205–13214, 2020.
- Deep Class-Incremental Learning: A Survey. In arXiv preprint arXiv:2302.03648, Feb. 2023.
- A Model or 603 Exemplars: Towards Memory-Efficient Class-Incremental Learning. In arXiv preprint arXiv:2205.13218, Feb. 2023.
- Nisha L. Raichur (3 papers)
- Lucas Heublein (14 papers)
- Tobias Feigl (18 papers)
- Alexander Rügamer (6 papers)
- Christopher Mutschler (59 papers)
- Felix Ott (19 papers)