Bayesian Symbolic Agent (EMPA)
- Bayesian Symbolic Agent (EMPA) is a computational framework that integrates symbolic reasoning with Bayesian inference by modeling logical formulas as random variables influenced by empirical data.
- The architecture leverages a tunable abstraction parameter to interpolate between classical, empirical, and paraconsistent reasoning, enhancing flexibility across various applications.
- EMPA employs scalable Bayesian updates and active exploration, with neuro-symbolic variants demonstrating robust performance in cognitive modeling and robotics.
A Bayesian Symbolic Agent (EMPA) is a computational framework unifying symbolic reasoning and Bayesian probabilistic inference. EMPA treats logical formulas as random variables whose truth depends on latent, data-driven models. By tuning a single abstraction parameter, EMPA interpolates between classical logic, empirical generalization, paraconsistent reasoning, and fully probabilistic forms of inference. This architecture provides a generic foundation for constructing agents capable of learning and robust reasoning over symbolic structures derived from empirical data, applicable across domains from cognitive modeling to robotics (Kido, 2024, Arriaga et al., 10 Jun 2025, Andersen et al., 2017).
1. Theoretical Foundations and Formalism
The conceptual core of EMPA is a single probabilistic abstraction principle: symbolic knowledge arises by treating the truth of logical formulas as random variables determined by latent states (models) supported by observed data (Kido, 2024). The basic setting consists of:
- A finite propositional language whose atoms admit complete truth-value assignments (models ).
- A corpus of data , where each datum uniquely supports a model .
- Logical formulas are associated with random variables whose value is $1$ (true) or $0$ (false), depending on the sampled model.
EMPA defines several consequence relations, all unified under its probabilistic abstraction:
- Classical consequence: if every model satisfying 0 satisfies 1.
- Empirical consequence: 2 if all possible models of 3 (i.e., those with 4) are also possible models of 5.
- Paraconsistent and parapossible reasoning: When 6 is inconsistent or impossible, inference is performed over all maximal consistent (MCS) or maximal possible subsets (MPS) of 7 respectively, to recover robust reasoning in the presence of contradiction or impossibility.
2. Mathematical and Algorithmic Structure
The generative model underpinning EMPA introduces:
- A hidden world-state random variable 8 with empirical likelihood 9, where 0 is the number of data supporting 1.
- Formula likelihood: for each formula 2, 3, where 4 is 5 if 6, 7 otherwise.
- Marginals and conditionals: 8; 9.
The key abstraction parameter 0 offers a spectrum:
- 1 recovers deterministic logic;
- 2 yields statistical inference.
Bayesian update is achieved via 3 on observing new data (Kido, 2024).
The high-level loop comprises:
- Data acquisition (observe 4).
- Model update (5; 6).
- Formula update (compute 7).
- Query answering (compute 8, applying special-casing for paraconsistency/parapossibility).
3. Reasoning Modes and Unification
By tuning 9 and considering different consistency regimes, EMPA integrates several reasoning paradigms:
- Classical entailment via supported and consistent premises (0).
- Statistical inference when 1.
- Paraconsistent reasoning (contradictory premises, 2) via MCS.
- Parapossible reasoning (impossible premises, 3) via MPS.
This flexibility allows graded, robust, and data-driven symbolic inference, with inference over largest coherent subsets providing resilience to contradiction and incompleteness. Selective attention and abstraction via maximal subsets is analogous to cognitive strategies observed in humans (Kido, 2024).
4. EMPA Variants and Symbolic Model Learning
Multiple instantiations of EMPA have been proposed:
A. Active Exploration for Symbolic Representation Learning
EMPA supports online construction and Bayesian refinement of symbolic state spaces and option models. The agent maintains a posterior 4 over symbolic models; each model 5 specifies clustered symbolic state factors and action precondition/effect distributions. Bayesian parameter updates employ Beta and sparse Dirichlet priors, with posterior factorization across options and symbolic state partitions.
Active exploration is guided by maximizing expected information gain (reduction in posterior standard deviation 6), implemented via Monte-Carlo Tree Search (MCTS) for tractable policy optimization (Andersen et al., 2017).
Empirical evaluation in benchmark domains (Asteroids, Treasure Game) demonstrates data efficiency and coverage that significantly surpass random or greedy exploration baselines, requiring about one-third as many action executions to achieve equivalent symbolic model completeness.
B. Neuro-symbolic Hybrid EMPA
In robotic learning contexts, EMPA is instantiated as a modular agent unifying:
- Differentiable-physics world model 7,
- Bayesian inference over physical parameters 8,
- Meta-learning for rapid task adaptation (MAML-style optimization over task batches),
- Symbolic program synthesis layer for high-level action composition.
Probabilistic evaluation of symbolic preconditions and postconditions is grounded in current posteriors over continuous latent states, and symbolic planning is adapted to explicit model uncertainty (e.g., via RJMCMC-based synthesis). The integrated planning/inference loop propagates Bayesian posteriors as signals for uncertainty-aware high-level control (Arriaga et al., 10 Jun 2025).
5. Scalability and Computational Features
Algorithmic efficiency in EMPA is achieved by structuring probabilistic inference around data-supported models rather than all possible truth assignments, reducing the computational burden from 9 to 0, where 1 is the number of observed data points and 2 the formula set size (Kido, 2024). In symbolic model construction, the factorization of posteriors and the use of conjugate updates enable closed-form Bayesian refinement, scalable to online learning scenarios (Andersen et al., 2017).
Meta-learning variants leverage batch optimization across tasks and amortized inference, facilitating rapid adaptation without excessive retraining (Arriaga et al., 10 Jun 2025).
6. Extensions and Practical Implications
EMPA accommodates several extensions:
- Lifted inference over first-order logics via tractable model-counting.
- Continuous latent-state handling through approximate inference techniques (e.g., variational Bayes, HMC).
- Integration with utility-theoretic decision-making, facilitating probabilistic symbolic planning and active data acquisition.
- Domain-specific leveraging of symbolic abstraction for efficient robotic world modeling and adaptive planning (Kido, 2024, Arriaga et al., 10 Jun 2025).
A plausible implication is that, by formalizing symbolic abstraction as a statistical process over data-dependent latent states, EMPA provides a unified substrate for constructing agents with flexible, scalable, and robust symbolic reasoning, directly linking empirical observations to high-level logical operations.
7. Empirical Results and Domain Applications
EMPA’s empirical performance has been quantitatively validated in symbolic exploration tasks and conceptualized for use in robotics:
- In game domains, EMPA achieves faster coverage of symbolic transition spaces and more efficient exploration than undirected baselines, as measured by fraction of unobserved symbolic transitions and successful attainment of rare object states (Andersen et al., 2017).
- In neuro-symbolic robotics, EMPA-based architectures are proposed to enable uncertainty-aware symbolic planning and adaptive generalization, integrating learned physical models, meta-learned residuals, and probabilistic symbolic program synthesis for interpretable and data-efficient control in novel environments (Arriaga et al., 10 Jun 2025).
These results support EMPA’s suitability as a principled and extensible agent framework for diverse symbolic reasoning and learning tasks.