Satisficing Mentalizing in Bayesian ToM

Updated 19 May 2026

Satisficing mentalizing is a framework that simplifies Bayesian inference by using computationally efficient heuristic models to infer mental states under bounded rationality.
The approach reduces complexity by clamping certain latent variables, with specialized models and an adaptive switching strategy that dynamically responds to prediction errors.
Empirical studies show that switching models achieve near-optimal predictive performance at significantly reduced runtimes compared to full Bayesian methods.

Satisficing mentalizing refers to an inference regime within Bayesian Theory of Mind (BToM) that prioritizes computational efficiency by balancing accuracy and runtime requirements when attributing mental states to agents based on observed behavior. This approach is motivated by the recognition that both humans and artificial systems often operate under bounded rationality, rendering fully Bayesian inference over all possible mental-state configurations intractable. Instead, satisficing mentalizing employs simplified or heuristic models that are computationally tractable and empirically sufficient for decision-making, particularly in real-time or resource-constrained settings (Pöppel et al., 2019).

1. Conceptual Foundations and Motivation

Satisficing mentalizing builds directly on the concept of bounded rationality as articulated by Simon, emphasizing the need to reconcile ideal inference with practical tractability. In BToM, the goal is to infer latent mental-state variables—such as an agent’s goal, belief about the world, and belief about locations or assignments—by observing their actions. Because the number of possible combinations of these latent states grows exponentially, exact Bayesian inference rapidly becomes infeasible. Satisficing mentalizing therefore comprises strategies that deliver inference quality “good enough” to guide behavior effectively, at a fraction of the computational cost required for full Bayesian updating. This is analogous to the use of heuristics in human cognition and practical artificial reasoning systems.

2. The Full Bayesian ToM Model

The full BToM model formalizes mentalizing inference as a joint posterior over the set of possible goals $G$ , goal-beliefs $B_g$ , and world-beliefs $B_w$ , conditioned on observed action histories:

$G$ : possible goals (e.g., four colored exits)
$B_g$ : all possible assignments of colors to exit locations (24 configurations)
$B_w$ : world-beliefs—either true-layout or an unknown layout (freespace assumption)
$a_t$ : action at time $t$ ; $\vec{a}_t = (a_1, \ldots, a_t)$

The model employs uniform priors and a Boltzmann policy for action likelihood:

$P(a_{t+1} \mid g, b_g, b_w) = \frac{\exp[\beta U(a_{t+1}; g, b_g, b_w)]}{\sum_{a' \in A} \exp[\beta U(a'; g, b_g, b_w)]}$

where $B_g$ 0 is negative distance-to-goal (using the agent’s current belief about the environment), and $B_g$ 1 is an inverse temperature parameter controlling rationality.

Posterior inference follows by Bayes’ rule:

$B_g$ 2

and marginalization yields the next-action prediction:

$B_g$ 3

Although statistically optimal, this workflow incurs exponential complexity with respect to the number of latent variable combinations ((Pöppel et al., 2019), Section 2).

3. Specialized Simplified Bayesian Models

To achieve tractability, specialized models clamp one or more mental-state variables to their true or default values, dramatically reducing inference complexity:

Model	Clamped Variable(s)	Complexity
True World & Goal (TWG)	$B_g$ 4	$B_g$ 5 per action
True World (TW)	$B_g$ 6	$B_g$ 7
True Goal (TG)	$B_g$ 8	$B_g$ 9

TWG: Assumes agent holds true beliefs about both world and goal; sums only over possible goals.
TW: Agent’s belief about world is accurate, but goal-belief is variable.
TG: Agent’s goal-belief is accurate, but world-belief may be inaccurate; in path uncertainty conditions, always assumes a freespace world-belief model.

Under these restrictions, inference and action prediction entail only linear or bilinear summations, as opposed to full joint enumeration ((Pöppel et al., 2019), Section 3).

4. The Switching Approach and Surprise-Driven Adaptation

The switching approach dynamically selects between specialized models by monitoring prediction error (“surprise”) over observed actions:

Maintains a current model $B_w$ 0
After each action $B_w$ $B_{w}$ 1, computes $B_w$ $B_{w}$ 2, with options:
- $B_w$ 3 (self-information)
- $B_w$ 4 (Itti–Baldi style)
Cumulative surprise $B_w$ 5 is compared to a threshold $B_w$ 6; exceeding $B_w$ 7 triggers reevaluation of all specialized models’ cumulative surprise on the full action history, switching to the one with minimal total surprise.
The threshold $B_w$ 8 is increased (e.g., $B_w$ 9) after each switch to prevent flip-flopping.

Pseudocode for the switching protocol is explicitly detailed in (Pöppel et al., 2019) (Section 4). This mechanism avoids full Bayesian inversion while maintaining responsiveness to changes in uncertainty by adapting model complexity to observed behavior.

5. Computational Complexity and Predictive Performance

Empirical analysis demonstrates substantial computational gains for satisficing models, particularly the switching approach, with corresponding performance:

Model	Mean Runtime (ms/action)	Rel. Runtime to TWG	Avg. Neg. Log-Likelihood $G$ 0
TWG	$G$ 1	$G$ 2	$G$ 3
TG	$G$ 4	$G$ 5	$G$ 6
TW	$G$ 7	$G$ 8	$G$ 9
Switching	$B_g$ 0	$B_g$ 1	$B_g$ 2
Full BToM	$B_g$ 3	$B_g$ 4	$B_g$ 5

The switching approach achieves the lowest average negative log-likelihood of $B_g$ 6 across all 687 observed trajectories, outperforming the full model, with runtime an order of magnitude faster than the full Bayesian ToM. In head-to-head comparisons, the switching model outperforms the full model on $B_g$ 7 of trajectories, TG and TW on $B_g$ 8 each, and TWG on $B_g$ 9 ((Pöppel et al., 2019), Section 8).

6. Uncertainty Scenarios and Human Behavioral Correlates

Empirical validation relies on a human study with 110 participants solving maze tasks under three uncertainty conditions:

No Uncertainty (NU): Agent knows the maze layout and correct exit location; true beliefs about goal and world.
Destination Uncertainty (DU): Maze layout known, but exit color-location unknown until sighted; uncertainty over $B_w$ 0.
Path Uncertainty (PU): Exit known, but layout beyond a local radius hidden; uncertainty over $B_w$ 1.

Model fit (average $B_w$ 2) in each scenario:

Condition	TWG	TW	TG	Switching	Full
NU	0.59	0.62	0.63	0.59	0.60
DU	0.80	0.65	0.61	0.61	0.68
PU	1.12	0.91	0.74	0.73	1.08

The switching model consistently achieves best or near-best fit, while maintaining efficient runtime.

Participants’ trajectories exhibited path optimality within 20% of the shortest possible path at rates of $B_w$ 3 (NU), $B_w$ 4 (DU), and $B_w$ 5 (PU), with step count over optimal at $B_w$ 6, $B_w$ 7, and $B_w$ 8 excess, respectively ((Pöppel et al., 2019), Section 7).

7. Satisficing Outcome and Implications

The empirical results indicate that satisficing mentalizing via the switching approach provides a compromise solution: it yields predictive accuracy closely matching that of the most specialized models for each uncertainty regime, adapts automatically to shifting uncertainty, and avoids the exponential computational burden of full-state Bayesian inference. This suggests that both human mentalizing and practical artificial systems may benefit from modular, adaptive inference architectures that leverage situation-dependent model selection without sacrificing responsiveness. The strong performance of the switching approach underpins its potential as a satisficing strategy for efficient, context-sensitive social reasoning (Pöppel et al., 2019).

Markdown Report Issue Upgrade to Chat

References (1)

Satisficing Mentalizing: Bayesian Models of Theory of Mind Reasoning in Scenarios with Different Uncertainties (2019)

Topic to Video (Beta)

No one has generated a video about this topic yet.

Whiteboard

No one has generated a whiteboard explanation for this topic yet.

Follow Topic

Get notified by email when new papers are published related to Satisficing Mentalizing.

Satisficing Mentalizing in Bayesian ToM

1. Conceptual Foundations and Motivation

2. The Full Bayesian ToM Model

3. Specialized Simplified Bayesian Models

4. The Switching Approach and Surprise-Driven Adaptation

5. Computational Complexity and Predictive Performance

6. Uncertainty Scenarios and Human Behavioral Correlates

7. Satisficing Outcome and Implications

Topic to Video (Beta)

Whiteboard

Follow Topic

Continue Learning

Don't miss out on important new AI/ML research

Satisficing Mentalizing in Bayesian ToM

1. Conceptual Foundations and Motivation

2. The Full Bayesian ToM Model

3. Specialized Simplified Bayesian Models

4. The Switching Approach and Surprise-Driven Adaptation

5. Computational Complexity and Predictive Performance

6. Uncertainty Scenarios and Human Behavioral Correlates

7. Satisficing Outcome and Implications

Topic to Video (Beta)

Whiteboard

Follow Topic

Continue Learning

Related Topics

Don't miss out on important new AI/ML research

Sign up for free to explore the frontiers of research