Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
80 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

LoPT: Low-Rank Prompt Tuning for Parameter Efficient Language Models (2406.19486v1)

Published 27 Jun 2024 in cs.CL, cs.AI, cs.ET, cs.LG, and eess.SP

Abstract: In prompt tuning, a prefix or suffix text is added to the prompt, and the embeddings (soft prompts) or token indices (hard prompts) of the prefix/suffix are optimized to gain more control over LLMs for specific tasks. This approach eliminates the need for hand-crafted prompt engineering or explicit model fine-tuning. Prompt tuning is significantly more parameter-efficient than model fine-tuning, as it involves optimizing partial inputs of LLMs to produce desired outputs. In this work, we aim to further reduce the amount of trainable parameters required for a LLM to perform well on specific tasks. We propose Low-rank Prompt Tuning (LoPT), a low-rank model for prompts that achieves efficient prompt optimization. The proposed method demonstrates similar outcomes to full parameter prompt tuning while reducing the number of trainable parameters by a factor of 5. It also provides promising results compared to the state-of-the-art methods that would require 10 to 20 times more parameters.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (3)
  1. Shouchang Guo (5 papers)
  2. Sonam Damani (5 papers)
  3. Keng-hao Chang (4 papers)