Papers
Topics
Authors
Recent
Search
2000 character limit reached

Satisficing Mentalizing in Bayesian ToM

Updated 19 May 2026
  • Satisficing mentalizing is a framework that simplifies Bayesian inference by using computationally efficient heuristic models to infer mental states under bounded rationality.
  • The approach reduces complexity by clamping certain latent variables, with specialized models and an adaptive switching strategy that dynamically responds to prediction errors.
  • Empirical studies show that switching models achieve near-optimal predictive performance at significantly reduced runtimes compared to full Bayesian methods.

Satisficing mentalizing refers to an inference regime within Bayesian Theory of Mind (BToM) that prioritizes computational efficiency by balancing accuracy and runtime requirements when attributing mental states to agents based on observed behavior. This approach is motivated by the recognition that both humans and artificial systems often operate under bounded rationality, rendering fully Bayesian inference over all possible mental-state configurations intractable. Instead, satisficing mentalizing employs simplified or heuristic models that are computationally tractable and empirically sufficient for decision-making, particularly in real-time or resource-constrained settings (Pöppel et al., 2019).

1. Conceptual Foundations and Motivation

Satisficing mentalizing builds directly on the concept of bounded rationality as articulated by Simon, emphasizing the need to reconcile ideal inference with practical tractability. In BToM, the goal is to infer latent mental-state variables—such as an agent’s goal, belief about the world, and belief about locations or assignments—by observing their actions. Because the number of possible combinations of these latent states grows exponentially, exact Bayesian inference rapidly becomes infeasible. Satisficing mentalizing therefore comprises strategies that deliver inference quality “good enough” to guide behavior effectively, at a fraction of the computational cost required for full Bayesian updating. This is analogous to the use of heuristics in human cognition and practical artificial reasoning systems.

2. The Full Bayesian ToM Model

The full BToM model formalizes mentalizing inference as a joint posterior over the set of possible goals GG, goal-beliefs BgB_g, and world-beliefs BwB_w, conditioned on observed action histories:

  • GG: possible goals (e.g., four colored exits)
  • BgB_g: all possible assignments of colors to exit locations (24 configurations)
  • BwB_w: world-beliefs—either true-layout or an unknown layout (freespace assumption)
  • ata_t: action at time tt; at=(a1,,at)\vec{a}_t = (a_1, \ldots, a_t)

The model employs uniform priors and a Boltzmann policy for action likelihood:

P(at+1g,bg,bw)=exp[βU(at+1;g,bg,bw)]aAexp[βU(a;g,bg,bw)]P(a_{t+1} \mid g, b_g, b_w) = \frac{\exp[\beta U(a_{t+1}; g, b_g, b_w)]}{\sum_{a' \in A} \exp[\beta U(a'; g, b_g, b_w)]}

where BgB_g0 is negative distance-to-goal (using the agent’s current belief about the environment), and BgB_g1 is an inverse temperature parameter controlling rationality.

Posterior inference follows by Bayes’ rule:

BgB_g2

and marginalization yields the next-action prediction:

BgB_g3

Although statistically optimal, this workflow incurs exponential complexity with respect to the number of latent variable combinations ((Pöppel et al., 2019), Section 2).

3. Specialized Simplified Bayesian Models

To achieve tractability, specialized models clamp one or more mental-state variables to their true or default values, dramatically reducing inference complexity:

Model Clamped Variable(s) Complexity
True World & Goal (TWG) BgB_g4 BgB_g5 per action
True World (TW) BgB_g6 BgB_g7
True Goal (TG) BgB_g8 BgB_g9
  • TWG: Assumes agent holds true beliefs about both world and goal; sums only over possible goals.
  • TW: Agent’s belief about world is accurate, but goal-belief is variable.
  • TG: Agent’s goal-belief is accurate, but world-belief may be inaccurate; in path uncertainty conditions, always assumes a freespace world-belief model.

Under these restrictions, inference and action prediction entail only linear or bilinear summations, as opposed to full joint enumeration ((Pöppel et al., 2019), Section 3).

4. The Switching Approach and Surprise-Driven Adaptation

The switching approach dynamically selects between specialized models by monitoring prediction error (“surprise”) over observed actions:

  • Maintains a current model BwB_w0
  • After each action BwB_w1, computes BwB_w2, with options:
    • BwB_w3 (self-information)
    • BwB_w4 (Itti–Baldi style)
  • Cumulative surprise BwB_w5 is compared to a threshold BwB_w6; exceeding BwB_w7 triggers reevaluation of all specialized models’ cumulative surprise on the full action history, switching to the one with minimal total surprise.
  • The threshold BwB_w8 is increased (e.g., BwB_w9) after each switch to prevent flip-flopping.

Pseudocode for the switching protocol is explicitly detailed in (Pöppel et al., 2019) (Section 4). This mechanism avoids full Bayesian inversion while maintaining responsiveness to changes in uncertainty by adapting model complexity to observed behavior.

5. Computational Complexity and Predictive Performance

Empirical analysis demonstrates substantial computational gains for satisficing models, particularly the switching approach, with corresponding performance:

Model Mean Runtime (ms/action) Rel. Runtime to TWG Avg. Neg. Log-Likelihood GG0
TWG GG1 GG2 GG3
TG GG4 GG5 GG6
TW GG7 GG8 GG9
Switching BgB_g0 BgB_g1 BgB_g2
Full BToM BgB_g3 BgB_g4 BgB_g5

The switching approach achieves the lowest average negative log-likelihood of BgB_g6 across all 687 observed trajectories, outperforming the full model, with runtime an order of magnitude faster than the full Bayesian ToM. In head-to-head comparisons, the switching model outperforms the full model on BgB_g7 of trajectories, TG and TW on BgB_g8 each, and TWG on BgB_g9 ((Pöppel et al., 2019), Section 8).

6. Uncertainty Scenarios and Human Behavioral Correlates

Empirical validation relies on a human study with 110 participants solving maze tasks under three uncertainty conditions:

  • No Uncertainty (NU): Agent knows the maze layout and correct exit location; true beliefs about goal and world.
  • Destination Uncertainty (DU): Maze layout known, but exit color-location unknown until sighted; uncertainty over BwB_w0.
  • Path Uncertainty (PU): Exit known, but layout beyond a local radius hidden; uncertainty over BwB_w1.

Model fit (average BwB_w2) in each scenario:

Condition TWG TW TG Switching Full
NU 0.59 0.62 0.63 0.59 0.60
DU 0.80 0.65 0.61 0.61 0.68
PU 1.12 0.91 0.74 0.73 1.08

The switching model consistently achieves best or near-best fit, while maintaining efficient runtime.

Participants’ trajectories exhibited path optimality within 20% of the shortest possible path at rates of BwB_w3 (NU), BwB_w4 (DU), and BwB_w5 (PU), with step count over optimal at BwB_w6, BwB_w7, and BwB_w8 excess, respectively ((Pöppel et al., 2019), Section 7).

7. Satisficing Outcome and Implications

The empirical results indicate that satisficing mentalizing via the switching approach provides a compromise solution: it yields predictive accuracy closely matching that of the most specialized models for each uncertainty regime, adapts automatically to shifting uncertainty, and avoids the exponential computational burden of full-state Bayesian inference. This suggests that both human mentalizing and practical artificial systems may benefit from modular, adaptive inference architectures that leverage situation-dependent model selection without sacrificing responsiveness. The strong performance of the switching approach underpins its potential as a satisficing strategy for efficient, context-sensitive social reasoning (Pöppel et al., 2019).

Definition Search Book Streamline Icon: https://streamlinehq.com
References (1)

Topic to Video (Beta)

No one has generated a video about this topic yet.

Whiteboard

No one has generated a whiteboard explanation for this topic yet.

Follow Topic

Get notified by email when new papers are published related to Satisficing Mentalizing.