Papers
Topics
Authors
Recent
Search
2000 character limit reached

BYOKG-RAG: Custom KG QA Framework

Updated 13 March 2026
  • BYOKG-RAG is a framework for question answering over custom knowledge graphs that combines LLM-driven artifact generation with specialized graph retrieval.
  • It employs an iterative two-stage process integrating artifact generation and downstream graph operations to address schema mismatches and entity linking challenges.
  • Experimental evaluations show improved performance across diverse benchmarks, underscoring its effective multi-strategy retrieval and generalization capabilities.

BYOKG-RAG (Bring-Your-Own-Knowledge-Graph Retrieval-Augmented Generation) is a general framework for question answering (QA) over custom or domain-specific knowledge graphs (KGs), using a synergy of LLMs and multi-strategy graph retrieval. It addresses core challenges in knowledge graph QA—heterogeneous schemas, unreliable entity linking, and limited generalization—by interleaving LLM-driven artifact generation and specialized, tool-mediated graph retrieval in an iterative pipeline. Unlike traditional RAG models that rely solely on unstructured text or assume a fixed KG schema, BYOKG-RAG generalizes to arbitrary, user-supplied KGs and supports complex reasoning by decomposing retrieval into multiple complementary phases (Mavromatis et al., 5 Jul 2025).

1. Motivation and Problem Setting

The primary target of BYOKG-RAG is open-domain and domain-specific KGQA, where the input is a natural-language question qq and a user-provided knowledge graph GG of triples T={(h,r,t)}T = \{(h, r, t)\}; the system is tasked with returning the correct answer entities A⊆VA \subseteq V (nodes of GG) (Mavromatis et al., 5 Jul 2025). Existing approaches typically encounter:

  • Schema/surface form mismatches (varying node/edge types, aliases)
  • Inadequate generalization to custom KGs or previously unseen ontologies
  • Agentic LLM traversals prone to entity linking errors and poor multistep compositional reasoning

BYOKG-RAG aims to overcome these limitations by directly leveraging both LLM reasoning and specialized graph operations, iteratively exchanging high-level reasoning artifacts between the LLM and downstream KG tools.

2. System Architecture and Iterative Pipeline

BYOKG-RAG is built around a two-stage loop that continues until convergence:

Stage I: KG-Linker (LLM Generation of Graph Artifacts)

  • From prompt (q,S,Ct−1)(q, S, C^{t-1}), where SS is the KG schema and Ct−1C^{t-1} the prior context, the LLM produces:
    • Extracted question entities (E~q\tilde{E}_q)
    • Candidate answer mentions (E~a\tilde{E}_a)
    • Reasoning/relation paths (P~\tilde{P})
    • Executable OpenCypher queries (Q~\tilde{Q})
    • Draft answers (A~\tilde{A})

Stage II: Graph Retrieval Toolkit

  • Specialized modules resolve LLM outputs to yield relevant KG contexts:
    • Agentic mode: One-hop expansions with iterative LLM filtering for relation/edge relevance.
    • Scoring mode: Retrieves top-kk KG triples by semantic similarity: sim(q,(h,r,t))=Embed(q,h)+Embed(q,r)+Embed(q,t)\text{sim}(q, (h, r, t)) = \text{Embed}(q, h) + \text{Embed}(q, r) + \text{Embed}(q, t).

The union of all retrieved results forms a new context CtC^t; it is added to the LLM input for the next round. The process self-terminates upon stabilization or after TRT_R rounds. The final output is computed as A=LLM(q,C∗)A = \text{LLM}(q, C^*) (Mavromatis et al., 5 Jul 2025).

3. Key Algorithms and Formalism

Formally, BYOKG-RAG uses the following operations per iteration:

  • Entity Linking: For each e^\hat{e} from the LLM, obtain E(e^)=top-mE(\hat{e}) = \text{top-}m nodes according to both string and embedding similarity.
  • Path/Query Retrieval: For each proposed path or query, retrieve corresponding subgraphs/facts from GG.
  • Triplet Retrieval: Either via agentic LLM one-hop expansion and filtering, or via direct scoring by composite semantic similarity.
  • Iteration: Ct=Ct−1∪all new retrievalsC^t = C^{t-1} \cup \text{all new retrievals}.

A crucial insight is to treat the LLM as a generator of diverse "hooks"—string entities, relation paths, executable queries—rather than a full-graph agent. Graph tools resolve these hooks, yielding a more robust and generalizable retrieval compared to agent-only or retriever-only KGQA systems (Mavromatis et al., 5 Jul 2025).

4. Experimental Evaluation

BYOKG-RAG has been evaluated on multiple zero- and few-shot KGQA benchmarks:

Dataset Underlying KG Retrieval Complexity BYOKG-RAG Main Metric 2nd-Best
WebQSP-IH Freebase 1–2-hop QA Hit@1: 86.6% 86.2%
CWQ-IH Freebase up to 4-hop Hit@1: 73.6% 69.3%
CronQuestions Wikidata Temporal multi-entity Hit@1: 65.5% 59.8%
MedQA DiseaseDrugBank Medical domain Hit@2: 65.0% 62.5%
Northwind Enterprise Aggregation/cypher LLMaaJ: 64.9% 55.3%

Averaged over all tasks, BYOKG-RAG exceeds the strongest prior by 4.5 percentage points, while requiring no KG-specific supervision (Mavromatis et al., 5 Jul 2025).

Ablation experiments confirm that each retrieval component (agentic, path, scoring, query generation) provides unique contributions. The LLM-based linking step in particular improves performance over pure string/entity similarity by 6–8 points on compositional tasks. Second-pass refinement yields a further 5–7 point improvement for the most complex benchmarks (Mavromatis et al., 5 Jul 2025).

5. Generalization and Case Analysis

BYOKG-RAG is architecturally agnostic to the KG schema, requiring only schema introspection rather than curated alignment or task-specific templates. It adapts to diverse structural patterns (aggregation, temporal, domain-specialized) through its artifact generation and retrieval mix.

Notable qualitative cases include:

  • For CWQ-style compositional queries, BYOKG-RAG's first iteration proposes plausible relation chains; agentic retrieval then grounds the entities, and subsequent refinement revises the reasoning path to reach the correct target.
  • On temporal reasoning (CronQuestions), BYOKG-RAG leverages both path retrieval and query execution to resolve chronology, outperforming purely agentic traversals that tend to get trapped in local neighborhoods.
  • For enterprise-style aggregations (Text2Cypher), query execution is essential, highlighting the necessity of supporting programmatic KG operations.

6. Relationship to Other KG-Augmented RAG Systems

BYOKG-RAG shares high-level aims with other BYOKG-style approaches such as KG2^2RAG (Zhu et al., 8 Feb 2025) and "KG-Infused RAG" (Wu et al., 11 Jun 2025). A comparison is presented below:

System Main Focus Retrieval Modes LLM Role Empirical Coverage
BYOKG-RAG (Mavromatis et al., 5 Jul 2025) QA over custom KGs; robust, multi-strategy retrieval Entity linking, multi-hop path, agentic, query, scoring Artifact/hook generation; iterative Freebase, Wikidata, MedQA, enterprise
KG2^2RAG (Zhu et al., 8 Feb 2025) Chunk expansion and organization via KG structure Chunk ↔ KG linking, multi-hop expansion, graph-based paragraph organization Entity/relation disambiguation, paragraph assembly HotpotQA and variants
KG-Infused RAG (Wu et al., 11 Jun 2025) Fusing unstructured and KG evidence; spreading activation Dense text, KG spreading, combined ranking Knowledge activation, query rewrite, answer generation Multi-hop Wikipedia QA

A plausible implication is that BYOKG-RAG’s explicit decoupling of LLM-driven artifact generation and downstream graph retrieval enables greater generalization to arbitrary, user-supplied graphs and queries, as well as flexible integration with a range of graph operations.

7. Practical Considerations and Limitations

BYOKG-RAG does not require hand-labeled KGQA data or any graph-specific retriever fine-tuning, reducing onboarding friction for arbitrary domains. The framework's iterative design converges quickly (average ≈2 iterations), resulting in moderate inference overhead (about 2.4× a baseline LLM call), with substantially fewer LLM invocations than multi-step agentic baselines (Mavromatis et al., 5 Jul 2025).

However, trade-offs include additional complexity in integrating and orchestrating multiple retrieval modules, and potential error propagation if initial artifact extraction is misaligned with KG content. The system assumes availability of a graph schema and access to basic KG APIs (entity search, neighbor expansion, path queries, Cypher execution).

References

Topic to Video (Beta)

No one has generated a video about this topic yet.

Whiteboard

No one has generated a whiteboard explanation for this topic yet.

Follow Topic

Get notified by email when new papers are published related to BYOKG-RAG.