Dynamic Feature Learning and Matching for Class-Incremental Learning
Abstract: Class-incremental learning (CIL) has emerged as a means to learn new classes incrementally without catastrophic forgetting of previous classes. Recently, CIL has undergone a paradigm shift towards dynamic architectures due to their superior performance. However, these models are still limited by the following aspects: (i) Data augmentation (DA), which are tightly coupled with CIL, remains under-explored in dynamic architecture scenarios. (ii) Feature representation. The discriminativeness of dynamic feature are sub-optimal and possess potential for refinement. (iii) Classifier. The misalignment between dynamic feature and classifier constrains the capabilities of the model. To tackle the aforementioned drawbacks, we propose the Dynamic Feature Learning and Matching (DFLM) model in this paper from above three perspectives. Specifically, we firstly introduce class weight information and non-stationary functions to extend the mix DA method for dynamically adjusting the focus on memory during training. Then, von Mises-Fisher (vMF) classifier is employed to effectively model the dynamic feature distribution and implicitly learn their discriminative properties. Finally, the matching loss is proposed to facilitate the alignment between the learned dynamic features and the classifier by minimizing the distribution distance. Extensive experiments on CIL benchmarks validate that our proposed model achieves significant performance improvements over existing methods.
- K. J. Joseph, J. Rajasegaran, S. H. Khan, F. S. Khan, and V. N. Balasubramanian, “Incremental object detection via meta-learning,” IEEE Trans. Pattern Anal. Mach. Intell., vol. 44, no. 12, pp. 9209–9216, 2022.
- S. Rebuffi, A. Kolesnikov, G. Sperl, and C. H. Lampert, “icarl: Incremental classifier and representation learning,” in 2017 IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2017. IEEE Computer Society, 2017, pp. 5533–5542.
- Z. Ke and B. Liu, “Continual learning of natural language processing tasks: A survey,” CoRR, vol. abs/2211.12701, 2022.
- S. Mazumder, N. Ma, and B. Liu, “Towards a continuous knowledge learning engine for chatbots,” CoRR, vol. abs/1802.06024, 2018.
- L. Ouyang, J. Wu, X. Jiang, D. Almeida, C. L. Wainwright, P. Mishkin, C. Zhang, S. Agarwal, K. Slama, A. Ray, J. Schulman, J. Hilton, F. Kelton, L. Miller, M. Simens, A. Askell, P. Welinder, P. F. Christiano, J. Leike, and R. Lowe, “Training language models to follow instructions with human feedback,” in NeurIPS, 2022.
- X. Han, Y. Zhou, K. Chen, H. Qiu, M. Qiu, Y. Liu, and T. Zhang, “Ads-lead: Lifelong anomaly detection in autonomous driving systems,” IEEE Trans. Intell. Transp. Syst., vol. 24, no. 1, pp. 1039–1051, 2023.
- M. J. Mirza, M. Masana, H. Possegger, and H. Bischof, “An efficient domain-incremental learning approach to drive in all weather conditions,” in IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, CVPR Workshops 2022. IEEE, 2022, pp. 3000–3010.
- R. B. Girshick, J. Donahue, T. Darrell, and J. Malik, “Region-based convolutional networks for accurate object detection and segmentation,” IEEE Trans. Pattern Anal. Mach. Intell., vol. 38, no. 1, pp. 142–158, 2016.
- K. He, X. Zhang, S. Ren, and J. Sun, “Deep residual learning for image recognition,” in 2016 IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2016. IEEE Computer Society, 2016, pp. 770–778.
- A. Krizhevsky, I. Sutskever, and G. E. Hinton, “Imagenet classification with deep convolutional neural networks,” in Advances in Neural Information Processing Systems 25, NeurIPS 2012, 2012, pp. 1106–1114.
- K. Simonyan and A. Zisserman, “Very deep convolutional networks for large-scale image recognition,” in 3rd International Conference on Learning Representations, ICLR 2015, 2015.
- R. M. French, “Catastrophic forgetting in connectionist networks,” Trends in cognitive sciences, vol. 3, no. 4, pp. 128–135, 1999.
- I. J. Goodfellow, M. Mirza, X. Da, A. C. Courville, and Y. Bengio, “An empirical investigation of catastrophic forgeting in gradient-based neural networks,” in 2nd International Conference on Learning Representations, ICLR 2014, 2014.
- M. McCloskey and N. J. Cohen, “Catastrophic interference in connectionist networks: The sequential learning problem,” in Psychology of Learning and Motivation. Academic Press, 1989, vol. 24, pp. 109–165.
- E. Belouadah, A. Popescu, and I. Kanellos, “A comprehensive study of class incremental learning algorithms for visual tasks,” Neural Networks, vol. 135, pp. 38–54, 2021.
- M. D. Lange, R. Aljundi, M. Masana, S. Parisot, X. Jia, A. Leonardis, G. G. Slabaugh, and T. Tuytelaars, “A continual learning survey: Defying forgetting in classification tasks,” IEEE Trans. Pattern Anal. Mach. Intell., vol. 44, no. 7, pp. 3366–3385, 2022.
- A. Douillard, M. Cord, C. Ollion, T. Robert, and E. Valle, “Podnet: Pooled outputs distillation for small-tasks incremental learning,” in Computer Vision - ECCV 2020 - 16th European Conference, vol. 12365. Springer, 2020, pp. 86–102.
- M. Kang, J. Park, and B. Han, “Class-incremental learning by knowledge distillation with adaptive feature consolidation,” in IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2022. Computer Vision Foundation / IEEE, 2022, pp. 16 071–16 080.
- Y. Liu, B. Schiele, and Q. Sun, “RMM: reinforced memory management for class-incremental learning,” in Advances in Neural Information Processing Systems 34, NeurIPS 2021, 2021, pp. 3478–3490.
- S. Yan, J. Xie, and X. He, “DER: dynamically expandable representation for class incremental learning,” in IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2021. Computer Vision Foundation / IEEE, 2021, pp. 3014–3023.
- B. Zhao, X. Xiao, G. Gan, B. Zhang, and S. Xia, “Maintaining discrimination and fairness in class incremental learning,” in 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2020. Computer Vision Foundation / IEEE, 2020, pp. 13 205–13 214.
- F. Zhu, Z. Cheng, X. Zhang, and C. Liu, “Class-incremental learning via dual augmentation,” in Advances in Neural Information Processing Systems 34, NeurIPS 2021, 2021, pp. 14 306–14 318.
- F. Wang, D. Zhou, H. Ye, and D. Zhan, “FOSTER: feature boosting and compression for class-incremental learning,” in Computer Vision - ECCV 2022 - 17th European Conference, vol. 13685. Springer, 2022, pp. 398–414.
- R. Aljundi, F. Babiloni, M. Elhoseiny, M. Rohrbach, and T. Tuytelaars, “Memory aware synapses: Learning what (not) to forget,” in Computer Vision - ECCV 2018 - 15th European Conference, vol. 11207. Springer, 2018, pp. 144–161.
- J. Kirkpatrick, R. Pascanu, N. Rabinowitz, J. Veness, G. Desjardins, A. A. Rusu, K. Milan, J. Quan, T. Ramalho, A. Grabska-Barwinska, D. Hassabis, C. Clopath, D. Kumaran, and R. Hadsell, “Overcoming catastrophic forgetting in neural networks,” Proceedings of the National Academy of Sciences, vol. 114, no. 13, pp. 3521–3526, 2017.
- Z. Li and D. Hoiem, “Learning without forgetting,” in Computer Vision - ECCV 2016 - 14th European Conference, vol. 9908. Springer, 2016, pp. 614–629.
- A. Chaudhry, M. Rohrbach, M. Elhoseiny, T. Ajanthan, P. K. Dokania, P. H. S. Torr, and M. Ranzato, “Continual learning with tiny episodic memories,” CoRR, vol. abs/1902.10486, 2019.
- H. Shin, J. K. Lee, J. Kim, and J. Kim, “Continual learning with deep generative replay,” in Advances in Neural Information Processing Systems 30, 2017, 2017, pp. 2990–2999.
- J. Rajasegaran, M. Hayat, S. H. Khan, F. S. Khan, and L. Shao, “Random path selection for continual learning,” in Advances in Neural Information Processing Systems 32, NeurIPS 2019, 2019, pp. 12 648–12 658.
- A. A. Rusu, N. C. Rabinowitz, G. Desjardins, H. Soyer, J. Kirkpatrick, K. Kavukcuoglu, R. Pascanu, and R. Hadsell, “Progressive neural networks,” CoRR, vol. abs/1606.04671, 2016.
- G. E. Hinton, O. Vinyals, and J. Dean, “Distilling the knowledge in a neural network,” CoRR, vol. abs/1503.02531, 2015.
- H. Zhang, M. Cissé, Y. N. Dauphin, and D. Lopez-Paz, “mixup: Beyond empirical risk minimization,” in 6th International Conference on Learning Representations, ICLR 2018. OpenReview.net, 2018.
- S. Yun, D. Han, S. Chun, S. J. Oh, Y. Yoo, and J. Choe, “Cutmix: Regularization strategy to train strong classifiers with localizable features,” in 2019 IEEE/CVF International Conference on Computer Vision, ICCV 2019. IEEE, 2019, pp. 6022–6031.
- D. Zhou, Q. Wang, Z. Qi, H. Ye, D. Zhan, and Z. Liu, “Deep class-incremental learning: A survey,” CoRR, vol. abs/2302.03648, 2023.
- L. Wang, X. Zhang, H. Su, and J. Zhu, “A comprehensive survey of continual learning: Theory, method and application,” CoRR, vol. abs/2302.00487, 2023.
- M. Masana, X. Liu, B. Twardowski, M. Menta, A. D. Bagdanov, and J. van de Weijer, “Class-incremental learning: Survey and performance evaluation on image classification,” IEEE Trans. Pattern Anal. Mach. Intell., vol. 45, no. 5, pp. 5513–5533, 2023.
- F. Zenke, B. Poole, and S. Ganguli, “Continual learning through synaptic intelligence,” in Proceedings of the 34th International Conference on Machine Learning, ICML 2017, vol. 70. PMLR, 2017, pp. 3987–3995.
- D. Lopez-Paz and M. Ranzato, “Gradient episodic memory for continual learning,” in Advances in Neural Information Processing Systems 30, NeurIPS 2017, 2017, pp. 6467–6476.
- I. J. Goodfellow, J. Pouget-Abadie, M. Mirza, B. Xu, D. Warde-Farley, S. Ozair, A. C. Courville, and Y. Bengio, “Generative adversarial nets,” in Advances in Neural Information Processing Systems 27, NeurIPS 2014, 2014, pp. 2672–2680.
- D. P. Kingma and M. Welling, “Auto-encoding variational bayes,” in 2nd International Conference on Learning Representations, ICLR 2014, 2014.
- G. M. Van de Ven, H. T. Siegelmann, and A. S. Tolias, “Brain-inspired replay for continual learning with artificial neural networks,” Nature communications, vol. 11, no. 1, p. 4069, 2020.
- F. M. Castro, M. J. Marín-Jiménez, N. Guil, C. Schmid, and K. Alahari, “End-to-end incremental learning,” in Computer Vision - ECCV 2018 - 15th European Conference, vol. 11216. Springer, 2018, pp. 241–257.
- S. Hou, X. Pan, C. C. Loy, Z. Wang, and D. Lin, “Learning a unified classifier incrementally via rebalancing,” in IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2019. Computer Vision Foundation / IEEE, 2019, pp. 831–839.
- Y. Wu, Y. Chen, L. Wang, Y. Ye, Z. Liu, Y. Guo, and Y. Fu, “Large scale incremental learning,” in IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2019. Computer Vision Foundation / IEEE, 2019, pp. 374–382.
- S. Qiang, J. Hou, J. Wan, Y. Liang, Z. Lei, and D. Zhang, “Mixture uniform distribution modeling and asymmetric mix distillation for class incremental learning,” in The Thirty-Seventh AAAI Conference on Artificial Intelligence, AAAI 2023. AAAI Press, 2023.
- L. Yu, B. Twardowski, X. Liu, L. Herranz, K. Wang, Y. Cheng, S. Jui, and J. van de Weijer, “Semantic drift compensation for class-incremental learning,” in 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2020. Computer Vision Foundation / IEEE, 2020, pp. 6980–6989.
- F. Zhu, X. Zhang, C. Wang, F. Yin, and C. Liu, “Prototype augmentation and self-supervision for incremental learning,” in IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2021. Computer Vision Foundation / IEEE, 2021, pp. 5871–5880.
- K. Zhu, W. Zhai, Y. Cao, J. Luo, and Z. Zha, “Self-sustaining representation expansion for non-exemplar class-incremental learning,” in IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2022. IEEE, 2022, pp. 9286–9295.
- H. Yin, P. Molchanov, J. M. Alvarez, Z. Li, A. Mallya, D. Hoiem, N. K. Jha, and J. Kautz, “Dreaming to distill: Data-free knowledge transfer via deepinversion,” in 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2020. Computer Vision Foundation / IEEE, 2020, pp. 8712–8721.
- J. S. Smith, Y. Hsu, J. Balloch, Y. Shen, H. Jin, and Z. Kira, “Always be dreaming: A new approach for data-free class-incremental learning,” in 2021 IEEE/CVF International Conference on Computer Vision, ICCV 2021. IEEE, 2021, pp. 9354–9364.
- Q. Gao, C. Zhao, B. Ghanem, and J. Zhang, “R-DFCIL: relation-guided representation learning for data-free class incremental learning,” in Computer Vision - ECCV 2022 - 17th European Conference, vol. 13683. Springer, 2022, pp. 423–439.
- D. Zhou, Q. Wang, H. Ye, and D. Zhan, “A model or 603 exemplars: Towards memory-efficient class-incremental learning,” CoRR, vol. abs/2205.13218, 2022.
- L. Wang, X. Zhang, K. Yang, L. Yu, C. Li, L. Hong, S. Zhang, Z. Li, Y. Zhong, and J. Zhu, “Memory replay with data compression for continual learning,” in The Tenth International Conference on Learning Representations, ICLR 2022. OpenReview.net, 2022.
- T. Wu, G. Swaminathan, Z. Li, A. Ravichandran, N. Vasconcelos, R. Bhotika, and S. Soatto, “Class-incremental learning with strong pre-trained models,” in IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2022. IEEE, 2022, pp. 9591–9600.
- P. Liu, W. Yuan, J. Fu, Z. Jiang, H. Hayashi, and G. Neubig, “Pre-train, prompt, and predict: A systematic survey of prompting methods in natural language processing,” ACM Comput. Surv., vol. 55, no. 9, pp. 195:1–195:35, 2023.
- Z. Wang, Z. Zhang, C. Lee, H. Zhang, R. Sun, X. Ren, G. Su, V. Perot, J. G. Dy, and T. Pfister, “Learning to prompt for continual learning,” in IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2022. IEEE, 2022, pp. 139–149.
- Z. Wang, Z. Zhang, S. Ebrahimi, R. Sun, H. Zhang, C. Lee, X. Ren, G. Su, V. Perot, J. G. Dy, and T. Pfister, “Dualprompt: Complementary prompting for rehearsal-free continual learning,” in Computer Vision - ECCV 2022 - 17th European Conference, vol. 13686. Springer, 2022, pp. 631–648.
- A. Dosovitskiy, L. Beyer, A. Kolesnikov, D. Weissenborn, X. Zhai, T. Unterthiner, M. Dehghani, M. Minderer, G. Heigold, S. Gelly, J. Uszkoreit, and N. Houlsby, “An image is worth 16x16 words: Transformers for image recognition at scale,” in 9th International Conference on Learning Representations, ICLR 2021. OpenReview.net, 2021.
- C. Shorten and T. M. Khoshgoftaar, “A survey on image data augmentation for deep learning,” J. Big Data, vol. 6, p. 60, 2019.
- M. Xu, S. Yoon, A. Fuentes, and D. S. Park, “A comprehensive survey of image augmentation techniques for deep learning,” Pattern Recognit., vol. 137, p. 109347, 2023.
- Z. Zhong, L. Zheng, G. Kang, S. Li, and Y. Yang, “Random erasing data augmentation,” in The Thirty-Fourth AAAI Conference on Artificial Intelligence, AAAI 2020. AAAI Press, 2020, pp. 13 001–13 008.
- T. Devries and G. W. Taylor, “Improved regularization of convolutional neural networks with cutout,” CoRR, vol. abs/1708.04552, 2017.
- J. Han, P. Fang, W. Li, J. Hong, M. A. Armin, I. D. Reid, L. Petersson, and H. Li, “You only cut once: Boosting data augmentation with a single cut,” in International Conference on Machine Learning, ICML 2022, vol. 162. PMLR, 2022, pp. 8196–8212.
- V. Verma, A. Lamb, C. Beckham, A. Najafi, I. Mitliagkas, D. Lopez-Paz, and Y. Bengio, “Manifold mixup: Better representations by interpolating hidden states,” in Proceedings of the 36th International Conference on Machine Learning, ICML 2019, vol. 97. PMLR, 2019, pp. 6438–6447.
- J. Bang, H. Kim, Y. Yoo, J. Ha, and J. Choi, “Rainbow memory: Continual learning with a memory of diverse samples,” in IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2021. Computer Vision Foundation / IEEE, 2021, pp. 8218–8227.
- F. Zhu, X.-Y. Zhang, R.-Q. Wang, and C.-L. Liu, “Learning by seeing more classes,” IEEE Trans. Pattern Anal. Mach. Intell., 2023.
- A. Antoniou, A. J. Storkey, and H. Edwards, “Data augmentation generative adversarial networks,” CoRR, vol. abs/1711.04340, 2017.
- P. T. G. Jackson, A. A. Abarghouei, S. Bonner, T. P. Breckon, and B. Obara, “Style augmentation: Data augmentation via style randomization,” in IEEE Conference on Computer Vision and Pattern Recognition Workshops, CVPR Workshops 2019. Computer Vision Foundation / IEEE, 2019, pp. 83–92.
- A. Radford, L. Metz, and S. Chintala, “Unsupervised representation learning with deep convolutional generative adversarial networks,” in 4th International Conference on Learning Representations, ICLR 2016, 2016.
- E. D. Cubuk, B. Zoph, D. Mané, V. Vasudevan, and Q. V. Le, “Autoaugment: Learning augmentation strategies from data,” in IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2019. Computer Vision Foundation / IEEE, 2019, pp. 113–123.
- E. D. Cubuk, B. Zoph, J. Shlens, and Q. V. Le, “Randaugment: Practical automated data augmentation with a reduced search space,” in 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR Workshops. Computer Vision Foundation / IEEE, 2020, pp. 3008–3017.
- Y. Wang, X. Pan, S. Song, H. Zhang, G. Huang, and C. Wu, “Implicit semantic data augmentation for deep networks,” in Advances in Neural Information Processing Systems 32, NeurIPS 2019, 2019, pp. 12 614–12 623.
- Y. Wang, G. Huang, S. Song, X. Pan, Y. Xia, and C. Wu, “Regularizing deep networks with semantic data augmentation,” IEEE Trans. Pattern Anal. Mach. Intell., vol. 44, no. 7, pp. 3733–3748, 2022.
- Z. Gao, Y. Wu, Y. Jia, and M. Harandi, “Hyperbolic feature augmentation via distribution estimation and infinite sampling on manifolds,” in Advances in Neural Information Processing Systems 35, NeurIPS 2022, 2022.
- S. Li, K. Gong, C. H. Liu, Y. Wang, F. Qiao, and X. Cheng, “Metasaug: Meta semantic augmentation for long-tailed visual recognition,” in IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2021. Computer Vision Foundation / IEEE, 2021, pp. 5212–5221.
- C. Luo, J. Zhan, X. Xue, L. Wang, R. Ren, and Q. Yang, “Cosine normalization: Using cosine similarity instead of dot product in neural networks,” in Artificial Neural Networks and Machine Learning - ICANN 2018 - 27th International Conference on Artificial Neural Networks, vol. 11139. Springer, 2018, pp. 382–391.
- W. Liu, Y. Wen, Z. Yu, M. Li, B. Raj, and L. Song, “Sphereface: Deep hypersphere embedding for face recognition,” in 2017 IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2017. IEEE Computer Society, 2017, pp. 6738–6746.
- H. Wang, Y. Wang, Z. Zhou, X. Ji, D. Gong, J. Zhou, Z. Li, and W. Liu, “Cosface: Large margin cosine loss for deep face recognition,” in 2018 IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2018. Computer Vision Foundation / IEEE Computer Society, 2018, pp. 5265–5274.
- J. Deng, J. Guo, N. Xue, and S. Zafeiriou, “Arcface: Additive angular margin loss for deep face recognition,” in IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2019. Computer Vision Foundation / IEEE, 2019, pp. 4690–4699.
- A. Iscen, A. Araujo, B. Gong, and C. Schmid, “Class-balanced distillation for long-tailed visual recognition,” in 32nd British Machine Vision Conference 2021, BMVC 2021. BMVA Press, 2021, p. 165.
- M. Li, Y. Cheung, and Y. Lu, “Long-tailed visual recognition via gaussian clouded logit adjustment,” in IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2022. IEEE, 2022, pp. 6919–6928.
- H. Wang, S. Fu, X. He, H. Fang, Z. Liu, and H. Hu, “Towards calibrated hyper-sphere representation via distribution overlap coefficient for long-tailed learning,” in Computer Vision - ECCV 2022: 17th European Conference, vol. 13684. Springer, 2022, pp. 179–196.
- S. Alshammari, Y. Wang, D. Ramanan, and S. Kong, “Long- tailed recognition via weight balancing,” in IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2022. IEEE, 2022, pp. 6887–6897.
- A. Hasnat, J. Bohné, J. Milgram, S. Gentric, and L. Chen, “von mises-fisher mixture model-based deep learning: Application to face verification,” CoRR, vol. abs/1706.04264, 2017.
- T. R. Scott, A. C. Gallagher, and M. C. Mozer, “von mises-fisher loss: An exploration of embedding geometries for supervised learning,” in 2021 IEEE/CVF International Conference on Computer Vision, ICCV 2021. IEEE, 2021, pp. 10 592–10 602.
- S. Li, J. Xu, X. Xu, P. Shen, S. Li, and B. Hooi, “Spherical confidence learning for face recognition,” in IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2021. Computer Vision Foundation / IEEE, 2021, pp. 15 629–15 637.
- X. Zhe, S. Chen, and H. Yan, “Directional statistics-based deep metric learning for image classification and retrieval,” Pattern Recognit., vol. 93, pp. 113–123, 2019.
- M. Kirchhof, K. Roth, Z. Akata, and E. Kasneci, “A non-isotropic probabilistic take on proxy-based deep metric learning,” in Computer Vision - ECCV 2022 - 17th European Conference, vol. 13686. Springer, 2022, pp. 435–454.
- T. Wang and P. Isola, “Understanding contrastive representation learning through alignment and uniformity on the hypersphere,” in Proceedings of the 37th International Conference on Machine Learning, ICML 2020, vol. 119. PMLR, 2020, pp. 9929–9939.
- R. S. Zimmermann, Y. Sharma, S. Schneider, M. Bethge, and W. Brendel, “Contrastive learning inverts the data generating process,” in Proceedings of the 38th International Conference on Machine Learning, ICML 2021, vol. 139. PMLR, 2021, pp. 12 979–12 990.
- T. Kobayashi, “T-vmf similarity for regularizing intra-class feature distribution,” in IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2021. Computer Vision Foundation / IEEE, 2021, pp. 6616–6625.
- V. Vapnik, “Principles of risk minimization for learning theory,” in Advances in Neural Information Processing Systems 4, 1991. Morgan Kaufmann, 1991, pp. 831–838.
- O. Chapelle, J. Weston, L. Bottou, and V. Vapnik, “Vicinal risk minimization,” in Advances in Neural Information Processing Systems 13, 2000. MIT Press, 2000, pp. 416–422.
- L. Zhang, Z. Deng, K. Kawaguchi, A. Ghorbani, and J. Zou, “How does mixup help with robustness and generalization?” in 9th International Conference on Learning Representations, ICLR 2021. OpenReview.net, 2021.
- L. Carratino, M. Cissé, R. Jenatton, and J. Vert, “On mixup regularization,” CoRR, vol. abs/2006.06049, 2020.
- Y. Cui, M. Jia, T. Lin, Y. Song, and S. J. Belongie, “Class-balanced loss based on effective number of samples,” in IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2019. Computer Vision Foundation / IEEE, 2019, pp. 9268–9277.
- B. Zhou, Q. Cui, X. Wei, and Z. Chen, “BBN: bilateral-branch network with cumulative learning for long-tailed visual recognition,” in 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2020. Computer Vision Foundation / IEEE, 2020, pp. 9716–9725.
- K. Cao, C. Wei, A. Gaidon, N. Aréchiga, and T. Ma, “Learning imbalanced datasets with label-distribution-aware margin loss,” in Advances in Neural Information Processing Systems 32, NeurIPS 2019, 2019, pp. 1565–1576.
- S. Sra, “Directional statistics in machine learning: a brief review,” Applied Directional Statistics: modern methods and case studies, vol. 225, no. 6, 2018.
- F. Schroff, D. Kalenichenko, and J. Philbin, “Facenet: A unified embedding for face recognition and clustering,” in IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2015. IEEE Computer Society, 2015, pp. 815–823.
- Y. Sun, C. Cheng, Y. Zhang, C. Zhang, L. Zheng, Z. Wang, and Y. Wei, “Circle loss: A unified perspective of pair similarity optimization,” in 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2020. Computer Vision Foundation / IEEE, 2020, pp. 6397–6406.
- T. Diethe, “A note on the kullback-leibler divergence for the von mises-fisher distribution,” CoRR, vol. abs/1502.07104, 2015.
- A. Krizhevsky and G. Hinton, “Learning multiple layers of features from tiny images,” University of Toronto, Toronto, Ontario, Tech. Rep., 2009.
- J. Deng, W. Dong, R. Socher, L. Li, K. Li, and L. Fei-Fei, “Imagenet: A large-scale hierarchical image database,” in 2009 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2009). IEEE Computer Society, 2009, pp. 248–255.
- A. Paszke, S. Gross, F. Massa, A. Lerer, J. Bradbury, G. Chanan, T. Killeen, Z. Lin, N. Gimelshein, L. Antiga, A. Desmaison, A. Köpf, E. Z. Yang, Z. DeVito, M. Raison, A. Tejani, S. Chilamkurthy, B. Steiner, L. Fang, J. Bai, and S. Chintala, “Pytorch: An imperative style, high-performance deep learning library,” in Advances in Neural Information Processing Systems 32, NeurIPS 2019, 2019, pp. 8024–8035.
- D. Zhou, F. Wang, H. Ye, and D. Zhan, “Pycil: A python toolbox for class-incremental learning,” CoRR, vol. abs/2112.12533, 2021.
- L. Van der Maaten and G. Hinton, “Visualizing data using t-sne,” J. Mach. Learn. Res., vol. 9, no. 86, pp. 2579–2605, 2008.
Paper Prompts
Sign up for free to create and run prompts on this paper using GPT-5.
Top Community Prompts
Collections
Sign up for free to add this paper to one or more collections.