LIFT: Improving Long Context Understanding Through Long Input Fine-Tuning (2412.13626v1)

Published 18 Dec 2024 in cs.CL and cs.AI

Abstract: Long context understanding remains challenging for LLMs due to their limited context windows. This paper introduces Long Input Fine-Tuning (LIFT) for long context modeling, a novel framework that enhances LLM performance on long-context tasks by adapting model parameters to the context at test time. LIFT enables efficient processing of lengthy inputs without the computational burden of offline long-context adaptation, and can improve the long-context capabilities of arbitrary short-context models. The framework is further enhanced by integrating in-context learning and pre-LIFT supervised fine-tuning. The combination of in-context learning and LIFT enables short-context models like Llama 3 to handle arbitrarily long contexts and consistently improves their performance on popular long-context benchmarks like LooGLE and LongBench. We also provide a comprehensive analysis of the strengths and limitations of LIFT on long context understanding, offering valuable directions for future research.

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Follow-up Questions

We haven't generated follow-up questions for this paper yet.

Generate Now

LIFT: Improving Long Context Understanding Through Long Input Fine-Tuning (2412.13626v1)

Summary

Follow-up Questions

Authors (6)

Tweets

LIFT: Improving Long Context Understanding Through Long Input Fine-Tuning (2412.13626v1)

Summary

Follow-up Questions

Related Papers

Authors (6)

Tweets