Toward Optimal Search and Retrieval for RAG (2411.07396v1)

Published 11 Nov 2024 in cs.CL

Abstract: Retrieval-augmented generation (RAG) is a promising method for addressing some of the memory-related challenges associated with LLMs. Two separate systems form the RAG pipeline, the retriever and the reader, and the impact of each on downstream task performance is not well-understood. Here, we work towards the goal of understanding how retrievers can be optimized for RAG pipelines for common tasks such as Question Answering (QA). We conduct experiments focused on the relationship between retrieval and RAG performance on QA and attributed QA and unveil a number of insights useful to practitioners developing high-performance RAG pipelines. For example, lowering search accuracy has minor implications for RAG performance while potentially increasing retrieval speed and memory efficiency.

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Follow-up Questions

We haven't generated follow-up questions for this paper yet.

Generate Now

Authors (6)

Tweets

https://twitter.com/ipfconline1/status/1881948671291396163

https://twitter.com/ambodj528647/status/1859557857202520442

https://twitter.com/raghavan_anand/status/1863659531793760332

https://twitter.com/GptMaestro/status/1863668970156626205

YouTube

Show All Videos