Papers
Topics
Authors
Recent
Search
2000 character limit reached

TADIL: Task-Agnostic Domain-Incremental Learning through Task-ID Inference using Transformer Nearest-Centroid Embeddings

Published 21 Jun 2023 in cs.LG and cs.CV | (2306.11955v1)

Abstract: Machine Learning (ML) models struggle with data that changes over time or across domains due to factors such as noise, occlusion, illumination, or frequency, unlike humans who can learn from such non independent and identically distributed data. Consequently, a Continual Learning (CL) approach is indispensable, particularly, Domain-Incremental Learning. In this paper, we propose a novel pipeline for identifying tasks in domain-incremental learning scenarios without supervision. The pipeline comprises four steps. First, we obtain base embeddings from the raw data using an existing transformer-based model. Second, we group the embedding densities based on their similarity to obtain the nearest points to each cluster centroid. Third, we train an incremental task classifier using only these few points. Finally, we leverage the lightweight computational requirements of the pipeline to devise an algorithm that decides in an online fashion when to learn a new task using the task classifier and a drift detector. We conduct experiments using the SODA10M real-world driving dataset and several CL strategies. We demonstrate that the performance of these CL strategies with our pipeline can match the ground-truth approach, both in classical experiments assuming task boundaries, and also in more realistic task-agnostic scenarios that require detecting new tasks on-the-fly

Definition Search Book Streamline Icon: https://streamlinehq.com
References (26)
  1. F. Zenke, B. Poole, and S. Ganguli, “Continual Learning through Synaptic Intelligence,” in Proc. 34th Int. Conf. on Machine Learning (ICML’17), ser. Proceedings of Machine Learning Research, vol. 70.   PMLR, Aug. 6–11 2017, pp. 3987–3995.
  2. D. Lopez-Paz and M. Ranzato, “Gradient Episodic Memory for Continual Learning,” in Advances in Neural Information Processing Systems, vol. 30 (NIPS 2017).   Curran Associates Inc., 2017, pp. 6470–6479.
  3. S.-A. Rebuffi, A. Kolesnikov, G. Sperl, and C. H. Lampert, “iCaRL: Incremental Classifier and Representation Learning,” in Proc. 2017 IEEE Conf. on Computer Vision and Pattern Recognition (CVPR’17), Jul. 21–26 2017, pp. 5533–5542.
  4. R. Aljundi, F. Babiloni, M. Elhoseiny, M. Rohrbach, and T. Tuytelaars, “Memory Aware Synapses: Learning What (not) to Forget,” in Proc. European Conf. on Computer Vision, ECCV 2018.   Springer International Publishing, Sep. 8–14 2018, pp. 144–161.
  5. A. Radford, J. W. Kim, C. Hallacy, A. Ramesh, G. Goh, S. Agarwal, G. Sastry, A. Askell, P. Mishkin, J. Clark, G. Krueger, and I. Sutskever, “Learning Transferable Visual Models From Natural Language Supervision,” in Proc. 38th Int. Conf. on Machine Learning (ICML’21), ser. Proceedings of Machine Learning Research, vol. 139.   PMLR, Jul. 18–24 2021, pp. 8748–8763.
  6. A. Prakash, K. Chitta, and A. Geiger, “Multi-Modal Fusion Transformer for End-to-End Autonomous Driving,” in Proc. 2021 IEEE/CVF Conf. on Computer Vision and Pattern Recognition (CVPR’21).   IEEE Computer Society, Jun. 19–25 2021, pp. 7073–7083.
  7. Z. Huang, X. Mo, and C. Lv, “Multi-modal Motion Prediction with Transformer-based Neural Network for Autonomous Driving,” in Proc. 39th Int. Conf. on Robotics and Automation (ICRA), May 23–27 2022, pp. 2605–2611.
  8. M. De Lange, R. Aljundi, M. Masana, S. Parisot, X. Jia, A. Leonardis, G. Slabaugh, and T. Tuytelaars, “A Continual Learning Survey: Defying Forgetting in Classification Tasks,” IEEE Trans. Pattern Anal. Mach. Intell., vol. 44, no. 7, pp. 3366–3385, 2022.
  9. J. Kirkpatrick, R. Pascanu, N. Rabinowitz, J. Veness, G. Desjardins, A. A. Rusu, K. Milan, J. Quan, T. Ramalho, A. Grabska-Barwinska, D. Hassabis, C. Clopath, D. Kumaran, and R. Hadsell, “Overcoming Catastrophic Forgetting in Neural Networks,” Proceedings of the National Academy of Sciences, vol. 114, no. 13, pp. 3521–3526, Mar. 2017.
  10. G. I. Parisi, R. Kemker, J. L. Part, C. Kanan, and S. Wermter, “Continual Lifelong Learning with Neural Networks: A Review,” Neural Networks, vol. 113, pp. 54–71, 2019.
  11. D. Rolnick, A. Ahuja, J. Schwarz, T. Lillicrap, and G. Wayne, “Experience Replay for Continual Learning,” in Advances in Neural Information Processing Systems (NeurIPS 2019), vol. 32.   Curran Associates, Inc., 2019.
  12. M. Mirza, M. Masana, H. Possegger, and H. Bischof, “An Efficient Domain-Incremental Learning Approach to Drive in All Weather Conditions,” in Proc. 2022 IEEE/CVF Conf. on Computer Vision and Pattern Recognition Workshops (CVPRW’22), Jun. 19–20 2022, pp. 3000–3010.
  13. C. González, G. Sakas, and A. Mukhopadhyay, “What is wrong with continual learning in medical image segmentation?” CoRR, vol. abs/2010.11008, 2020. [Online]. Available: https://arxiv.org/abs/2010.11008
  14. J. Xie, S. Yan, and X. He, “General Incremental Learning with Domain-aware Categorical Representations,” in Proc. 2022 IEEE/CVF Conf. on Computer Vision and Pattern Recognition (CVPR’22).   IEEE Computer Society, Jun. 21–24 2022, pp. 14 331–14 340.
  15. H. Zhu, M. Majzoubi, A. Jain, and A. Choromanska, “TAME: Task Agnostic Continual Learning using Multiple Experts,” 2022, arXiv preprint arXiv:2210.03869.
  16. H. Shin, J. K. Lee, J. Kim, and J. Kim, “Continual Learning with Deep Generative Replay,” in Advances in Neural Information Processing Systems, vol. 30 (NIPS 2017).   Curran Associates Inc., 2017, pp. 2994–3003.
  17. Z. Li and D. Hoiem, “Learning without Forgetting,” IEEE Trans. Pattern Anal. Mach. Intell., vol. 40, no. 12, pp. 2935–2947, 2018.
  18. J. Schwarz, W. Czarnecki, J. Luketina, A. Grabska-Barwinska, Y. W. Teh, R. Pascanu, and R. Hadsell, “Progress & Compress: A Scalable Framework for Continual Learning,” in Proc. 35th Int. Conf. on Machine Learning, (ICML’18), ser. Proceedings of Machine Learning Research, vol. 80.   PMLR, Jul. 10–15 2018, pp. 4535–4544.
  19. R. Aljundi, M. Lin, B. Goujaud, and Y. Bengio, “Gradient Based Sample Selection for Online Continual Learning,” in Advances in Neural Information Processing Systems, vol. 32 (NeurIPS 2019), Dec. 8–14 2019, pp. 11 816–11 825.
  20. A. A. Rusu, N. C. Rabinowitz, G. Desjardins, H. Soyer, J. Kirkpatrick, K. Kavukcuoglu, R. Pascanu, and R. Hadsell, “Progressive Neural Networks,” 2022, arXiv preprint arXiv:1606.04671.
  21. Y. Li, J. Yang, Y. Song, L. Cao, J. Luo, and L. Li, “Learning from Noisy Labels with Distillation,” in Proc. 2017 IEEE Int. Conf. on Computer Vision (ICCV).   IEEE Computer Society, Oct. 22–29 2017, pp. 1928–1936.
  22. A. Mallya and S. Lazebnik, “PackNet: Adding Multiple Tasks to a Single Network by Iterative Pruning,” in Proc. 2018 IEEE/CVF Conf. on Computer Vision and Pattern Recognition (CVPR’18).   IEEE Computer Society, Jun. 18–23 2018, pp. 7765–7773.
  23. K. He, X. Zhang, S. Ren, and J. Sun, “Deep Residual Learning for Image Recognition,” in Proc. 2016 IEEE Conf. on Computer Vision and Pattern Recognition (CVPR’16), Jun. 27–30 2016, pp. 770–778.
  24. R. Tibshirani, T. Hastie, B. Narasimhan, and G. Chu, “Diagnosis of Multiple Cancer Types by Shrunken Centroids of Gene Expression,” Proceedings of the National Academy of Sciences of the United States of America, vol. 99, no. 10, pp. 6567–6572, 2002.
  25. J. Han, X. Liang, H. Xu, K. Chen, L. Hong, J. Mao, C. Ye, W. Zhang, Z. Li, X. Liang, and C. Xu, “SODA10M: A Large-Scale 2D Self/Semi-Supervised Object Detection Dataset for Autonomous Driving,” 2021, arXiv preprint arXiv:2106.11118.
  26. E. Verwimp, K. Yang, S. Parisot, L. Hong, S. McDonagh, E. Pérez-Pellitero, M. De Lange, and T. Tuytelaars, “CLAD: A Realistic Continual Learning Benchmark for Autonomous Driving,” Neural Networks, vol. 161, pp. 659–669, 2023.
Citations (1)

Summary

No one has generated a summary of this paper yet.

Paper to Video (Beta)

No one has generated a video about this paper yet.

Whiteboard

No one has generated a whiteboard explanation for this paper yet.

Open Problems

We haven't generated a list of open problems mentioned in this paper yet.

Continue Learning

We haven't generated follow-up questions for this paper yet.

Collections

Sign up for free to add this paper to one or more collections.