Recasting Continual Learning as Sequence Modeling (2310.11952v2)

Published 18 Oct 2023 in cs.LG and cs.AI

Abstract: In this work, we aim to establish a strong connection between two significant bodies of machine learning research: continual learning and sequence modeling. That is, we propose to formulate continual learning as a sequence modeling problem, allowing advanced sequence models to be utilized for continual learning. Under this formulation, the continual learning process becomes the forward pass of a sequence model. By adopting the meta-continual learning (MCL) framework, we can train the sequence model at the meta-level, on multiple continual learning episodes. As a specific example of our new formulation, we demonstrate the application of Transformers and their efficient variants as MCL methods. Our experiments on seven benchmarks, covering both classification and regression, show that sequence models can be an attractive solution for general MCL.

Authors (3)

Soochan Lee (7 papers)
Jaehyeon Son (5 papers)
Gunhee Kim (74 papers)

Citations (5)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Tweets

https://twitter.com/diegocalanzone/status/1766557789558587680

Recasting Continual Learning as Sequence Modeling (2310.11952v2)

Summary

Related Papers

Tweets