Fine-Tuning LLaMA for Multi-Stage Text Retrieval (2310.08319v1)

Published 12 Oct 2023 in cs.IR

Abstract: The effectiveness of multi-stage text retrieval has been solidly demonstrated since before the era of pre-trained LLMs. However, most existing studies utilize models that predate recent advances in LLMs. This study seeks to explore potential improvements that state-of-the-art LLMs can bring. We conduct a comprehensive study, fine-tuning the latest LLaMA model both as a dense retriever (RepLLaMA) and as a pointwise reranker (RankLLaMA) for both passage retrieval and document retrieval using the MS MARCO datasets. Our findings demonstrate that the effectiveness of LLMs indeed surpasses that of smaller models. Additionally, since LLMs can inherently handle longer contexts, they can represent entire documents holistically, obviating the need for traditional segmenting and pooling strategies. Furthermore, evaluations on BEIR demonstrate that our RepLLaMA-RankLLaMA pipeline exhibits strong zero-shot effectiveness. Model checkpoints from this study are available on HuggingFace.

References (58)

Citations (116)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Fine-Tuning LLaMA for Multi-Stage Text Retrieval (2310.08319v1)

Summary

Related Papers

Tweets