Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
38 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
41 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

SPT: Semi-Parametric Prompt Tuning for Multitask Prompted Learning (2212.10929v1)

Published 21 Dec 2022 in cs.CL, cs.AI, and cs.LG

Abstract: Pre-trained LLMs can efficiently interpolate human-written prompts in a natural way. Multitask prompted learning can help generalization through a diverse set of tasks at once, thus enhancing the potential for more effective downstream fine-tuning. To perform efficient multitask-inference in the same batch, parameter-efficient fine-tuning methods such as prompt tuning have been proposed. However, the existing prompt tuning methods may lack generalization. We propose SPT, a semi-parametric prompt tuning method for multitask prompted learning. The novel component of SPT is a memory bank from where memory prompts are retrieved based on discrete prompts. Extensive experiments, such as (i) fine-tuning a full LLM with SPT on 31 different tasks from 8 different domains and evaluating zero-shot generalization on 9 heldout datasets under 5 NLP task categories and (ii) pretraining SPT on the GLUE datasets and evaluating fine-tuning on the SuperGLUE datasets, demonstrate effectiveness of SPT.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (7)
  1. M Saiful Bari (22 papers)
  2. Aston Zhang (48 papers)
  3. Shuai Zheng (67 papers)
  4. Xingjian Shi (35 papers)
  5. Yi Zhu (233 papers)
  6. Shafiq Joty (187 papers)
  7. Mu Li (95 papers)
Citations (5)
X Twitter Logo Streamline Icon: https://streamlinehq.com