Papers
Topics
Authors
Recent
Search
2000 character limit reached

KnowAgent: Knowledge-Augmented LLM Agents

Updated 22 January 2026
  • KnowAgent is a methodology that integrates explicit, action-centric knowledge bases into LLM agents to improve planning and reduce erroneous actions.
  • It employs structured KBs and knowledge graphs to enforce valid action transitions and enhance multi-agent collaborative routing.
  • Empirical results show that KnowAgent decreases hallucination rates and boosts task success across varied domains.

KnowAgent is a methodology for enhancing the planning and reasoning capabilities of LLM-based agents through structured knowledge augmentation. The core principle of KnowAgent is to integrate explicit knowledge—particularly action-centric knowledge bases—into both the planning, execution, and collaborative phases of LLM agent systems, improving grounded decision-making, reducing planning hallucinations, and enabling context-sensitive routing in single- and multi-agent settings. The concept manifests in several prominent lines of research, including knowledge-augmented planning (Zhu et al., 2024), knowledge-guided agent routing (Zhang et al., 6 Oct 2025), and parametric world knowledge models for planning (Qiao et al., 2024).

1. Knowledge-Augmented Planning for LLM Agents

The foundational KnowAgent approach introduces explicit action knowledge bases (KBs) and knowledgeable self-learning strategies to restrict the action space and guide agent planning (Zhu et al., 2024).

Task Formalization

Given:

  • State space S\mathcal{S}, with state st=Ht=(T0,A0,O0,,Tt1,At1,Ot1)s_t = \mathcal{H}_t = (\mathcal{T}_0,\mathcal{A}_0,\mathcal{O}_0,\ldots,\mathcal{T}_{t-1},\mathcal{A}_{t-1},\mathcal{O}_{t-1}) representing the full episodic history up to time tt (thoughts Ti\mathcal{T}_i, actions Ai\mathcal{A}_i, observations Oi\mathcal{O}_i).
  • Action space Ea={a1,,aN}E_a = \{a_1, \ldots, a_N\}, comprising high-level, executable action schemas.
  • Transition function T:S×AST: \mathcal{S} \times \mathcal{A} \to \mathcal{S} mapping states and actions to new states (via the environment).
  • Terminal goal predicate G:S{true,false}G: \mathcal{S} \to \{\mathrm{true}, \mathrm{false}\}.

The agent’s objective is to synthesize trajectories τ=(T0,A0,O0,)\tau=(\mathcal{T}_0, \mathcal{A}_0, \mathcal{O}_0, \ldots) maximizing G(sT)=trueG(s_T)=\mathrm{true}, subject to structured KB constraints.

Action Knowledge Base and Constrained Planning

The action KB is a directed graph G=(Ea,R)G=(E_a, \mathcal{R}) with nodes EaE_a and edges R={rk:aiaj}\mathcal{R}=\{r_k:a_i \to a_j\} encoding permitted action transitions. During planning, the next action at each step is selected only among valid out-neighbors of the current action node in GG, restricting the agent to plausible action paths and reducing hallucinations.

This constraint is incorporated via a masked beam search, with next-token distributions zeroed out for invalid transitions and renormalized at each planning step.

Knowledgeable Self-Learning

Iterative self-learning proceeds by: (i) generating trajectories with the current model under KB constraints, (ii) filtering/merging to select efficient, valid trajectories, and (iii) fine-tuning the model (e.g., via LoRA) on this filtered set. This process is repeated until performance stabilizes.

Performance is measured via F1 (HotpotQA) or task success rate (ALFWorld), with explicit error metrics:

  • Invalid action rate: the proportion of actions violating KB transitions.
  • Misordered action rate: proportion of semantically valid, but logically misordered actions.
  • Combined hallucination score H=H = InvRate ++ MisorderRate.

Empirically, KnowAgent attains lower HH (e.g., on HotpotQA, 0.35% invalid and 1.23% misordered on Llama-2-13B, outperforming ReAct and Reflexion), and superior F1 and success rates across all tested LLM backbones (Zhu et al., 2024).

2. Knowledge Graph–Guided Multi-Agent Routing

The AgentRouter framework extends the KnowAgent paradigm to collaborative multi-agent question answering by formulating agent selection and aggregation as heterogeneous graph reasoning over a knowledge graph (KG) encoding the query, entities, and agent candidates (Zhang et al., 6 Oct 2025).

Knowledge Graph Construction

For each question qq and context CC, a heterogeneous KG G=(V,E)\mathcal{G} = (\mathcal{V}, \mathcal{E}) is constructed:

  • Query node VQ\mathcal{V}_Q: embedding of the question via a contextual encoder.
  • Entity nodes VE\mathcal{V}_E: extracted via spaCy NER, storing embeddings, types, and mention frequencies.
  • Agent nodes VA\mathcal{V}_A: each representing a candidate KnowAgent (distinct LLM+prompting style), embedding the agent strategy description.

Edge types:

  1. Query–Entity (qeq \to e): connections to entities mentioned in context.
  2. Entity–Entity (eee \to e): dependency-derived relations.
  3. Agent–Entity (aea \to e): fixed, "manage" edges based on agent-declared attention.
  4. Query–Agent (qaq \to a): trainable edges encoding routing strength.

RouterGNN for Task-Aware Routing

A specialized heterogeneous GNN (RouterGNN) processes G\mathcal{G}. At each layer \ell:

  • Edge-type-specific transforms propagate messages between node types, followed by type-specific aggregation and gating.
  • After LL layers, query and agent node embeddings encode rich, context-dependent state.

Routing probabilities over agents,

pθ(aq,G)=exp(s(q,a))aexp(s(q,a)),p_\theta(a|q, \mathcal{G}) = \frac{\exp(s(q, a))}{\sum_{a'} \exp(s(q, a'))},

are produced via MLP over concatenated question and agent embeddings.

Supervision is provided via soft targets p(aq)p^*(a|q) (softmax over empirical F1 scores for each agent on the current qq), optimized with KL divergence loss.

Output Aggregation and Complementarity

Final answer aggregation employs a fusion rule (weighted majority/confidence voting) over candidate agent answers, with routing weights modulating influence. The system dynamically allocates probability mass to agents whose strategy best matches the question-context KG, e.g., emphasizing summary agents for complex subgraph reasoning and CoT agents for direct queries.

Quantitatively, AgentRouter consistently outperforms both strongest single agents and prior routers on 2WikiMultihopQA, HotpotQA, NewsQA, and TriviaQA, with F1 improvements of up to +4.03 points on TriviaQA (Zhang et al., 6 Oct 2025).

3. Parametric World Knowledge Model for Planning

The World Knowledge Model (WKM) framework generalizes KnowAgent principles by providing both global (task-level) and local (state-level) knowledge to guide agent planning (Qiao et al., 2024).

Architectural Components

  • Global knowledge generator πϕ(g)\pi_{\phi}^{(g)}: synthesizes task knowledge κ\kappa based only on the task instruction uu, steering high-level planning away from blind trial-and-error.
  • Local knowledge generator πϕ(l)\pi_{\phi}^{(l)}: at each time step tt, produces a summary sts_t of the agent's current state, which is used to query a database of state/action triplets for locally valid next action distribution pknow(a)p_{know}(a).

Agent decisions are made as:

at+1=argmaxaAu[γπθ(aht)+(1γ)pknow(a)],a_{t+1} = \arg\max_{a \in A_u} \left[ \gamma \, \pi_\theta(a|h_t) + (1 - \gamma) \, p_{know}(a) \right],

where hth_t is the current history, and γ\gamma interpolates between the agent’s policy and knowledge-informed preferences.

Training and Knowledge Synthesis

The world knowledge model is trained to reproduce synthesized task knowledge and state summaries via standard likelihood objectives over expert trajectories. Instance-level synthesis (per instruction) achieves superior generalization to unseen environments compared to static, dataset-level rules. Knowledge can be distilled from both expert and sampled trajectories.

Empirically, WKM boosts average reward (e.g., ALFWorld seen 66.8 → 73.6, unseen 71.4 → 76.9 with Mistral-7B+ETO), improves planning efficiency (fewer steps to goal), and reduces hallucination rates (e.g., ALFWorld seen hallucinatory actions 34.3% → 32.9%) (Qiao et al., 2024).

4. Empirical Results and Comparative Analysis

Experimental validation across knowledge-augmented planning, routing, and world knowledge modeling consistently demonstrates:

  • Significant reductions in invalid or misordered planning steps compared to unconstrained or prompting-based baselines.
  • Superior or comparable answer accuracy and success rates to methods relying on supervised or GPT-4-synthesized data.
  • Generalization of instance-level knowledge synthesis to novel instructions and environments.
  • The ability for small or "weak" world knowledge modules to guide "strong" agent models, improving performance via knowledge-informed control.

Ablation studies show that removing KB constraints or world knowledge components (either global or local) deteriorates both F1 and planning reliability (Zhu et al., 2024, Qiao et al., 2024).

Method Invalid (%) Misordered (%) F1 (HotpotQA, 13B) ALFWorld Success (%)
ReAct 2.08 3.54 24.17 20.90
Reflexion 6.87 3.87 36.70 34.33
FiReAct 38.26 50.37
KnowAgent 0.35 1.23 39.26 58.71

5. Limitations and Future Directions

Current KnowAgent systems require:

Instance-level knowledge synthesis and unified multi-task training indicate promising generalization and extensibility; further work on dynamic experience-based updating and tighter integration of knowledge and search (e.g., MCTS) is suggested (Qiao et al., 2024).

6. Significance and Broader Impact

The KnowAgent paradigm demonstrates that action-centric knowledge formalization—whether in the form of explicit, programmatic KBs, structured KGs, or parametric world models—substantially and reliably improves the actionability, reliability, and transfer of LLM-based agents. It bridges the gap between rich language representations and the precise, grounded requirements of sequential planning and collaborative reasoning. The approach is foundational for agentic AI systems seeking robust, explainable, and context-sensitive behavior across varied domains (Zhu et al., 2024, Zhang et al., 6 Oct 2025, Qiao et al., 2024).

Topic to Video (Beta)

Whiteboard

No one has generated a whiteboard explanation for this topic yet.

Follow Topic

Get notified by email when new papers are published related to KnowAgent.