Inference Scaled GraphRAG: Improving Multi Hop Question Answering on Knowledge Graphs (2506.19967v1)

Published 24 Jun 2025 in cs.CL and cs.AI

Abstract: LLMs have achieved impressive capabilities in language understanding and generation, yet they continue to underperform on knowledge-intensive reasoning tasks due to limited access to structured context and multi-hop information. Retrieval-Augmented Generation (RAG) partially mitigates this by grounding generation in retrieved context, but conventional RAG and GraphRAG methods often fail to capture relational structure across nodes in knowledge graphs. We introduce Inference-Scaled GraphRAG, a novel framework that enhances LLM-based graph reasoning by applying inference-time compute scaling. Our method combines sequential scaling with deep chain-of-thought graph traversal, and parallel scaling with majority voting over sampled trajectories within an interleaved reasoning-execution loop. Experiments on the GRBench benchmark demonstrate that our approach significantly improves multi-hop question answering performance, achieving substantial gains over both traditional GraphRAG and prior graph traversal baselines. These findings suggest that inference-time scaling is a practical and architecture-agnostic solution for structured knowledge reasoning with LLMs

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Tweets

https://twitter.com/Travis_t2/status/1938246430273228886

YouTube

Show All Videos

Inference Scaled GraphRAG: Improving Multi Hop Question Answering on Knowledge Graphs (2506.19967v1)

Summary

Related Papers

Tweets

YouTube