Mechanism underlying the effectiveness of noisy prompts in RAG
Investigate and elucidate why adding random, unrelated documents to the prompt context of Retrieval-Augmented Generation for open-domain question answering improves large language model accuracy, and identify the properties and mechanisms of the resulting "noisy state" that contribute to its effectiveness.
Sponsor
References
Although these experiments show a pattern, we cannot yet answer this question in a definitive manner. While out of the scope of this work, which focuses on the retriever component of RAG systems, we believe it is highly important to investigate the reasons for which the LLM shows this behavior. Future studies should aim to elucidate why this noisy state is more advantageous and identify the characteristics that contribute to its effectiveness.