Papers
Topics
Authors
Recent
Search
2000 character limit reached

Text-Graph Synergy: A Bidirectional Verification and Completion Framework for RAG

Published 7 May 2026 in cs.AI and cs.IR | (2605.05643v1)

Abstract: Retrieval-Augmented Generation (RAG) has become a core paradigm for enhancing factual grounding and multi-hop reasoning in LLMs. Traditional text-based RAG often retrieves logically irrelevant pseudo-evidence, while graph-based RAG is frequently hindered by search-time pruning, which may discard potentially valid reasoning paths. Existing hybrid approaches primarily adopt simple evidence concatenation or unidirectional enhancement, which fails to address the fundamental "Information Island" problem caused by asymmetric reasoning flows between unstructured text and structured graphs. We propose \textbf{TGS-RAG}, a unified framework for \textbf{T}ext-\textbf{G}raph \textbf{S}ynergistic enhancement. TGS-RAG introduces a bidirectional mechanism: (i) a \textbf{Graph-to-Text} channel that employs a Global Voting strategy from visited graph nodes to re-rank and refine textual evidence, filtering out semantic noise; and (ii) a \textbf{Text-to-Graph} channel that utilizes the \textbf{Memory-based Orphan Entity Bridging} algorithm. This algorithm utilizes textual cues to proactively resurrect valid but previously pruned reasoning paths from the search history without additional database overhead. Experimental results on multiple multi-hop reasoning benchmarks demonstrate that TGS-RAG significantly outperforms state-of-the-art baselines, achieving a superior balance between retrieval precision and computational efficiency.

Authors (2)

Summary

  • The paper introduces a bidirectional framework that synergizes text and graph retrieval to overcome isolated retrieval limitations in RAG.
  • It employs global voting re-ranking and memory-based orphan entity bridging to recover reasoning paths and enhance factual consistency.
  • Empirical evaluations on MuSiQue and HotpotQA demonstrate significant performance gains and computational efficiency over baseline hybrid RAG methods.

Bidirectional Synergy in RAG: The TGS-RAG Framework

Motivation and Problematic Aspects of Isolated Retrieval

Despite the wide adoption of Retrieval-Augmented Generation (RAG) for enhancing factuality and multi-hop reasoning in LLMs, existing paradigms—text-based and graph-based retrieval—remain fundamentally siloed. Text-based RAG commonly retrieves pseudo-evidence due to overreliance on semantic similarity, leading to logical irrelevance. Conversely, graph-based RAG excels at logical interpretability but is constrained by search-time pruning, often overlooking valid reasoning paths (the "Information Island" problem). Hybrid and unidirectional methods have attempted to combine these strengths, but their lack of mutual verification and closed-loop integration leaves a critical performance ceiling. Figure 1

Figure 1: Comparison illustrating how traditional text-based and graph-based retrieval paradigms create "Information Islands," whereas TGS-RAG introduces a bidirectional synergy to bridge semantic gaps.

Architecture of TGS-RAG

TGS-RAG introduces a bidirectional enhancement framework that tightly couples unstructured text and structured knowledge graphs at both evidence retrieval and reasoning stages. The architecture involves three main phases:

  1. Knowledge Base Construction: Extraction of entities and relations from documents produces a unified knowledge base where text chunks are cross-referenced with a knowledge graph.
  2. Bidirectional Synergistic Retrieval: A dual-retrieval process generates both candidate textual evidence and structured graph reasoning paths. Semantic Beam Search accumulates a Visited Memory including both active and pruned nodes.
  3. Generation: Selected paths and chunks are synthesized into a coherent answer. Figure 2

    Figure 2: The TGS-RAG architecture highlighting the knowledge base, bidirectional retrieval with Visited Memory, and orchestrated answer generation.

Bidirectional Synergistic Mechanisms

The core innovation of TGS-RAG is the closed-loop interaction between the text and graph channels:

Graph-to-Text (Global Voting and Re-ranking)

All graph entities visited—including those in pruned paths—serve as recommenders, guiding the re-ranking of text chunks through a voting mechanism. The final score for each text chunk is a weighted combination of embedding similarity and graph-driven votes, with a tunable synergy weight (α\alpha). This enables TGS-RAG to recall text that may be structurally relevant but low in surface similarity, thus overcoming the pseudo-evidence problem prevalent in dense retrieval.

Text-to-Graph (Memory-based Orphan Entity Bridging)

To recover reasoning paths discarded by greedy graph pruning, TGS-RAG introduces the Memory-based Orphan Entity Bridging algorithm. Textual context is scanned to identify key entities present in retrieved text but absent from active graph paths. If these "orphan entities" were previously visited and pruned during beam search, their reasoning paths are resurrected, reconstructing logical chains with zero overhead and avoiding the combinatorial explosion of iterative graph expansion. Figure 3

Figure 3: The Orphan Entity Bridging process: semantic beam search generates a Visited Memory; text cues recall relevant but pruned entities, triggering resurrection of lost reasoning chains.

Empirical Evaluation

Experiments on MuSiQue and HotpotQA demonstrate several robust quantitative findings.

  • On MuSiQue, TGS-RAG achieves a Strict Hit Rate of 34.84% and LLM Judge Accuracy of 41.37%, more than doubling baseline hybrid RAG's performance.
  • On HotpotQA, it attains 62.00% Strict Hit Rate and 79.99% Judge Accuracy, setting a new performance state-of-the-art among evaluated systems.

These improvements are attributed specifically to the bidirectional synergy: removal of Graph-to-Text or Text-to-Graph components in ablation studies produces sharp performance drops (e.g., strict hit rate falls to 47.82% without Bridging).

Furthermore, TGS-RAG exhibits a Pareto-efficient computational profile. While methods such as GraphRAG and LightRAG incur exorbitant token usage (e.g., LightRAG uses 757M tokens on HotpotQA), TGS-RAG delivers superior accuracy/recall with 3–5× less computational overhead.

Theoretical and Practical Implications

TGS-RAG’s bidirectional framework directly addresses the core brittleness of hybrid and pipeline retrieval, proving that independent channels are insufficient for robust multi-hop reasoning across disconnected or sparsified evidence. The global voting and memory-based bridging mechanisms establish mutual verification, which is vital not only for factual retrieval but also for robust and interpretable reasoning in LLMs.

In practical terms, the on-demand resurrection of orphan paths enables efficient retrieval from large or dynamically updated knowledge bases, avoiding the scaling bottlenecks of exhaustive graph indexing. This positions TGS-RAG as a scalable, cost-effective solution for deployed knowledge-intensive systems.

Broader Impact and Future Directions

The TGS-RAG approach signals a new direction for RAG systems: beyond shallow hybridization, effective factual reasoning in LLMs demands bidirectional, memory-augmented synergy between symbolic structure and neural evidence. Anticipated advances include tighter integration with graph construction (e.g., continual knowledge extraction) and adaptive retrieval policies that learn optimal synergy parameters in-situ. Furthermore, the memory-based bridging principle is likely extensible to multi-modal RAG, where structured evidence bridges textual, tabular, and visual domains.

Conclusion

TGS-RAG establishes a rigorous, bidirectional methodology for integrating unstructured and structured retrieval in multi-hop reasoning tasks. Via global voting re-ranking and orphan entity bridging, it achieves both higher retrieval fidelity and efficiency compared to prevailing RAG paradigms. These findings substantiate the necessity of deep, closed-loop synergy for robust LLM factual reasoning, providing a scalable blueprint for future cross-modal retrieval and generation frameworks.

Paper to Video (Beta)

No one has generated a video about this paper yet.

Whiteboard

No one has generated a whiteboard explanation for this paper yet.

Open Problems

We haven't generated a list of open problems mentioned in this paper yet.

Collections

Sign up for free to add this paper to one or more collections.

Tweets

Sign up for free to view the 1 tweet with 3 likes about this paper.