Schema-based Hierarchical Active Inference
- Schema-based Hierarchical Active Inference is a computational framework that integrates schema theory, predictive coding, and active inference to support rapid generalization and flexible behavior.
- It employs a hierarchical generative model with abstract schema layers and concrete sensorimotor levels, enabling rapid schema remapping in complex tasks like navigation and robotics.
- The framework minimizes variational free energy using Dirichlet updates for grounding likelihoods, demonstrating both neural plausibility and robust performance across diverse domains.
Schema-based Hierarchical Active Inference (S-HAI) is a computational framework integrating schema theory, hierarchical predictive coding, and active inference to support rapid generalization, flexible behavior, and neural coding of abstract task structures. S-HAI provides a normative, mechanistic account for how abstract relational structures—schemas—can be formed, reused, and remapped to concrete experiences via hierarchical generative modeling and free-energy–minimizing inference. The architecture has demonstrated behavioral and neural plausibility across domains such as spatial navigation, control, robotics, and dynamic multi-entity interaction (Maele et al., 26 Jan 2026, Collis et al., 2024, Pezzato et al., 23 Jul 2025, Priorelli et al., 2024, Tinguy et al., 2023).
1. Conceptual Foundations and Hierarchical Generative Model
S-HAI models cognition and control as inference under hierarchical generative models, with schemas at higher levels encoding abstract relational or structural knowledge, and subordinate levels grounding these abstractions in sensorimotor or spatial specifics. A minimal S-HAI comprises at least two levels:
- Level 1 (concrete navigation or control): Discrete hidden states (e.g., spatial locations), observations (e.g., tile color), and atomic actions (e.g., up, down, left, right), governed by POMDP dynamics.
- Level 2 (schema): Abstract hidden states representing schema phases or relational goals (e.g., A, B, C, D), observations identifying the current schema phase or goal, and abstract actions (e.g., "advance phase").
Crucially, S-HAI introduces a grounding likelihood, , mapping abstract schema observations to specific states at the lower level, parameterized as a Dirichlet-matrix and learned online (Maele et al., 26 Jan 2026). This mapping supports "schema remapping," allowing abstract task structure to be rapidly rebound to new physical contexts.
For domains with both continuous and discrete states, recent S-HAI variants use rSLDS or discrete-continuous hybrid modules, with schemas emerging as dynamical switching modes (Collis et al., 2024, Priorelli et al., 2024).
2. Variational Inference, Learning Rules, and Policy Selection
Inference in S-HAI minimizes variational free energy separately at each hierarchical level. The variational posterior approximates , with free energy
Beliefs about hidden states are updated via message passing, and the grounding likelihood parameters are updated with conjugate Dirichlet increments: where is the inferred spatial reward location, and is a learning rate (Maele et al., 26 Jan 2026).
Policies (action sequences) are sampled according to their expected free energy: with reflecting epistemic (information gain) and extrinsic (reward) terms, and imposing inductive costs such as target-seeking regularization.
In hybrid continuous-discrete S-HAI, schema transitions are chosen to minimize a free-energy bound integrating discrete information gain, utility, and cached subgoal costs, with rSLDS-based message passing for mode inference and Laplace-EM for continuous state estimation (Collis et al., 2024, Priorelli et al., 2024).
3. Schema Formation, Remapping, and Generalization
S-HAI operationalizes schema formation and reuse through hierarchical POMDPs and nonparametric mixture models over grounding likelihoods:
- Fast schema remapping: Abstract schema phases can be rebound to entirely novel lower-level arrangements with minimal data, by updating after encountering new reward-location pairings (Maele et al., 26 Jan 2026).
- Clone-structured schema graphs: In tasks with ambiguous goal identities (e.g., repeated goals in one location), S-HAI-2C extensions use a clone-structured causal graph (CSCG) to maintain phase-specific state distinctions for disambiguation (Maele et al., 26 Jan 2026).
- Assimilation vs. accommodation: A Mixture of Grounding Likelihoods (MoGL) extension allows the agent to maintain multiple components, with nonparametric spawning of new mappings when likelihood drops below a threshold. This enables reuse of old schemata on repeated blocks and creation of new mappings for novel contexts (Maele et al., 26 Jan 2026).
In hybrid domains, rSLDS architectures discover schema-like dynamical modes, supporting temporally-abstracted subgoals, information-theoretic exploration, and rapid "lifting" from low-level control to abstract planning (Collis et al., 2024, Priorelli et al., 2024).
4. Behavioral and Cognitive Validation
S-HAI has been evaluated in tasks requiring generalization, remapping, and hierarchy:
- Spatial navigation ("ABCD" task): S-HAI achieves near-optimal rapid generalization, outperforming hierarchical active inference without schema mechanisms (HAI), especially on blocks with novel reward-well mappings (Maele et al., 26 Jan 2026).
- Alias-disambiguation ("ABCB" tasks): S-HAI-2C agents uniquely disambiguate phase-aliased goals via cloned schema states, rapidly converging to optimal performance (Maele et al., 26 Jan 2026).
- Sparse-reward control: S-HAI with rSLDS mode discovery solves sparse-reward continuous control benchmarks (e.g., Continuous Mountain Car) more efficiently than standard RL, exploiting schema subdivisions for exploration and goal-reaching (Collis et al., 2024).
- Hierarchical navigation in vision-based domains: Multi-layer S-HAI achieves >90% coverage and high success rates in multi-room grid environments, outperforming count-based and curiosity-driven RL (Tinguy et al., 2023).
- Long-horizon mobile manipulation: S-HAI architectures integrating skill-level schema planning, retry modules, and continuous active inference achieve top performance in realistic rearrangement tasks, demonstrating flexibility, adaptability, and on-the-fly composition (Pezzato et al., 23 Jul 2025).
5. Neural Correspondence and Theoretical Implications
S-HAI explicitly predicts and replicates neural codes observed in mammalian frontal cortex and hippocampus:
- Goal-progress cells: The expectation of the inductive cost ramps as the agent approaches goals, reproducing ramping activity in medial prefrontal cortex populations (Maele et al., 26 Jan 2026).
- Goal-identity cells: Schema-level posteriors remain stable during traversal to each abstract goal, generating task-invariant, goal-selective activity.
- Conjunctive cells: Joint encoding via parameters yields neurons selective for both abstract goal identity and spatial location, paralleling mPFC–hippocampal conjunction cells.
- Place-like cells: One-hot Level-1 state encodings replicate classic hippocampal place cell responses.
The framework aligns with data showing both structure-invariant and context-specific neural populations, and supports the hypothesis that predictive processing and active inference, implemented hierarchically, underlie schema-based learning, abstraction, and generalization (Maele et al., 26 Jan 2026).
6. Extensions and Applications
S-HAI generalizes to a spectrum of schema-rich domains:
- Hybrid body–tool–object systems: Deep S-HAI layers support composition of body schemas, tool affordances, and object-action relationships, integrating discrete policy inference with fast, continuous predictive-coding in multiple kinematic chains (Priorelli et al., 2024).
- Robotic control: S-HAI enables hierarchical skill selection, robust failure handling, and whole-body continuous inference, facilitating complex manipulation and rearrangement in realistic, online robotic settings (Pezzato et al., 23 Jul 2025).
- Spatial and temporal map learning: Cognitive map–based S-HAI architectures autonomously grow a graph of allocentric schemas, associating allocentric and egocentric representations, supporting efficient exploration, generalization, and scalable memory usage (Tinguy et al., 2023).
A summary of hierarchical organizations in representative S-HAI implementations:
| Paper | Level 1 | Level 2 / 3 | Schema Mechanism |
|---|---|---|---|
| (Maele et al., 26 Jan 2026) | Spatial navigation | Schema phase | Grounded POMDP + A{(2→1)} |
| (Collis et al., 2024) | rSLDS continuous | Discrete schema | Learned mode switching |
| (Pezzato et al., 23 Jul 2025) | Whole-body control | Skill/retry/nav | Discrete skill POMDP, retry |
| (Tinguy et al., 2023) | Egocentric state | Alloc/Map node | Growing graph of schemas |
| (Priorelli et al., 2024) | Hybrid dynamics | Discrete policy | Schema function composition |
7. Broader Significance and Future Directions
S-HAI bridges behavioral, computational, and neural levels of analysis by making schemas explicit, hierarchically embedded computational objects subject to online Bayesian inference and learning. The framework provides:
- Rapid, flexible generalization by decoupling abstract structure from context-specific realization,
- Efficient structure learning and memory use by modularizing, growing, and merging schematic representations,
- Interpretability through mapping between computational states and observed neural codes,
- Scalability to hybrid, multi-entity, and continuous action-perception domains.
Current research extends S-HAI to deeper hierarchies, nonparametric schema discovery, and complex multi-agent cognition (Maele et al., 26 Jan 2026, Collis et al., 2024, Tinguy et al., 2023). Empirically validated by behavioral and neural data, S-HAI establishes a unifying computational paradigm for schema-driven, hierarchical inference across cognitive science, neuroscience, AI, and robotics.