Towards Unified Task Embeddings Across Multiple Models: Bridging the Gap for Prompt-Based Large Language Models and Beyond (2402.14522v2)

Published 22 Feb 2024 in cs.CL and cs.LG

Abstract: Task embedding, a meta-learning technique that captures task-specific information, has gained popularity, especially in areas such as multi-task learning, model editing, and interpretability. However, it faces challenges with the emergence of prompt-guided LLMs operating in a gradient-free manner. Existing task embedding methods rely on fine-tuned, task-specific LLMs, which hinders the adaptability of task embeddings across diverse models, especially prompt-based LLMs. To hardness the potential of task embeddings in the era of LLMs, we propose a framework for unified task embeddings (FUTE), harmonizing task embeddings from various models, including smaller LLMs and LLMs with varied prompts, within a single vector space. Such uniformity enables comparison and analysis of similarities amongst different models, broadening the scope and utility of existing task embedding methods in multi-model scenarios, while maintaining their performance comparable to architecture-specific methods.

PDF HTML Abstract

Summarize Bookmark Chat (Pro)

References (63)

Authors (4)

Xinyu Wang (186 papers)
Hainiu Xu (12 papers)
Lin Gui (66 papers)
Yulan He (113 papers)

Tweets

https://twitter.com/yulanhe/status/1823244147656176051

Towards Unified Task Embeddings Across Multiple Models: Bridging the Gap for Prompt-Based Large Language Models and Beyond (2402.14522v2)

Related Papers

Tweets