Continual Prompt Tuning for Dialog State Tracking (2203.06654v1)

Published 13 Mar 2022 in cs.CL

Abstract: A desirable dialog system should be able to continually learn new skills without forgetting old ones, and thereby adapt to new domains or tasks in its life cycle. However, continually training a model often leads to a well-known catastrophic forgetting issue. In this paper, we present Continual Prompt Tuning, a parameter-efficient framework that not only avoids forgetting but also enables knowledge transfer between tasks. To avoid forgetting, we only learn and store a few prompt tokens' embeddings for each task while freezing the backbone pre-trained model. To achieve bi-directional knowledge transfer among tasks, we propose several techniques (continual prompt initialization, query fusion, and memory replay) to transfer knowledge from preceding tasks and a memory-guided technique to transfer knowledge from subsequent tasks. Extensive experiments demonstrate the effectiveness and efficiency of our proposed method on continual learning for dialog state tracking, compared with state-of-the-art baselines.

PDF Abstract

Summarize Bookmark Chat (Pro)

Authors (5)

Qi Zhu (160 papers)
Bing Li (374 papers)
Fei Mi (56 papers)
Xiaoyan Zhu (54 papers)
Minlie Huang (225 papers)

Citations (53)

View on Semantic Scholar

Continual Prompt Tuning for Dialog State Tracking (2203.06654v1)

Related Papers