Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
80 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

R^2AG: Incorporating Retrieval Information into Retrieval Augmented Generation (2406.13249v2)

Published 19 Jun 2024 in cs.CL, cs.AI, and cs.IR

Abstract: Retrieval augmented generation (RAG) has been applied in many scenarios to augment LLMs with external documents provided by retrievers. However, a semantic gap exists between LLMs and retrievers due to differences in their training objectives and architectures. This misalignment forces LLMs to passively accept the documents provided by the retrievers, leading to incomprehension in the generation process, where the LLMs are burdened with the task of distinguishing these documents using their inherent knowledge. This paper proposes R$2$AG, a novel enhanced RAG framework to fill this gap by incorporating Retrieval information into Retrieval Augmented Generation. Specifically, R$2$AG utilizes the nuanced features from the retrievers and employs a R$2$-Former to capture retrieval information. Then, a retrieval-aware prompting strategy is designed to integrate retrieval information into LLMs' generation. Notably, R$2$AG suits low-source scenarios where LLMs and retrievers are frozen. Extensive experiments across five datasets validate the effectiveness, robustness, and efficiency of R$2$AG. Our analysis reveals that retrieval information serves as an anchor to aid LLMs in the generation process, thereby filling the semantic gap.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (4)
  1. Fuda Ye (1 paper)
  2. Shuangyin Li (14 papers)
  3. Yongqi Zhang (33 papers)
  4. Lei Chen (484 papers)