Optimizing open-domain question answering with graph-based retrieval augmented generation (2503.02922v1)

Published 4 Mar 2025 in cs.IR

Abstract: In this work, we benchmark various graph-based retrieval-augmented generation (RAG) systems across a broad spectrum of query types, including OLTP-style (fact-based) and OLAP-style (thematic) queries, to address the complex demands of open-domain question answering (QA). Traditional RAG methods often fall short in handling nuanced, multi-document synthesis tasks. By structuring knowledge as graphs, we can facilitate the retrieval of context that captures greater semantic depth and enhances LLM operations. We explore graph-based RAG methodologies and introduce TREX, a novel, cost-effective alternative that combines graph-based and vector-based retrieval techniques. Our benchmarking across four diverse datasets highlights the strengths of different RAG methodologies, demonstrates TREX's ability to handle multiple open-domain QA types, and reveals the limitations of current evaluation methods. In a real-world technical support case study, we demonstrate how TREX solutions can surpass conventional vector-based RAG in efficiently synthesizing data from heterogeneous sources. Our findings underscore the potential of augmenting LLMs with advanced retrieval and orchestration capabilities, advancing scalable, graph-based AI solutions.

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Tweets

https://twitter.com/rohanpaul_ai/status/1898693119849427345

https://twitter.com/_reachsumit/status/1897506220334047412

Optimizing open-domain question answering with graph-based retrieval augmented generation (2503.02922v1)

Summary

Related Papers

Tweets