Papers
Topics
Authors
Recent
Search
2000 character limit reached

Zero Belief History (ZBH)

Updated 19 May 2026
  • Zero Belief History (ZBH) is defined as a belief state derived exclusively from the current context, omitting any reliance on past observations.
  • ZBH is applied in machine Theory of Mind benchmarks, such as the 'Pick the Right Stuff' task, to assess immediate perspective-taking in LLMs.
  • In dynamic economic models, ZBH highlights scenarios where agents’ zero-probability events drive subjective asset price bubbles due to belief heterogeneity.

Zero Belief History (ZBH) denotes settings in which an agent’s current belief state is conditioned solely on present context, with no access to or requirement for prior observation history. ZBH is foundational in theoretical models spanning machine Theory of Mind (ToM) and economic equilibrium under belief heterogeneity. In the context of LLMs, ZBH tasks operationalize the minimal challenge for perspective-taking: inferring what another entity believes “right now,” without reference to any past sequence of observations or events. In financial equilibrium theory, ZBH identifies the set of scenarios an agent considers impossible under their subjective probability measure, directly generating subjective asset price bubbles when agents disagree on zero-probability events.

1. Formal Definition and Mathematical Foundations

ZBH is defined over environments with discrete time steps t=0,1,2,t = 0,1,2,\ldots. At each time tt, the world is described by state sts_t, context CtC_t (all perceptual or observational data available at tt), and an agent AA with belief btab_t^a, which is a probability distribution or proposition set over possible world states. For LLMs, btLb_t^L is the model’s internal “belief” state.

In full generality, agent AA’s belief depends on the entire observation history Hta={o0a,o1a,,ota}H_t^a = \{o_0^a, o_1^a, \dots, o_t^a\}:

tt0

where tt1 is the observation at time tt2.

ZBH imposes the constraint:

tt3

that is, tt4 is computable solely from tt5; tt6 is independent of all prior contexts tt7.

A query in ZBH asks, for proposition tt8: “Does agent tt9 believe sts_t0 at sts_t1?”—with the answer sts_t2 computed from sts_t3 alone. There is no dependence on the “belief gap” created by unobserved or recalled prior observations (Tang et al., 2024).

2. Relation to Finite and Infinite Belief Histories

ZBH contrasts sharply with two broader classes:

Setting Data Used for Belief Inference Formal Belief Function
ZBH Only current context sts_t4 sts_t5
Finite Belief History (FBH) Bounded sts_t6-length prefix sts_t7 sts_t8
Infinite Belief History (IBH) Entire history sts_t9 or unbounded implicit structure CtC_t0

In FBH, the agent’s belief is chained through a fixed window of CtC_t1 previous observations. In IBH, the agent integrates information over unbounded or procedural pasts. ZBH is uniquely characterized by the absence of any explicit history term in the updating mechanism (Tang et al., 2024).

3. ZBH in Machine Theory of Mind: The "Pick the Right Stuff" Benchmark

Tang & Belle (Tang et al., 2024) instantiate ZBH using a multi-round text-based task—“Pick the Right Stuff”—where the LLM acts as a warehouse manager tracking users’ beliefs about object locations. Each round contains:

  • Initial setup: N users each place items in numbered slots (Room 1: Opaque Locker); Room 2 (Monitoring Room) contains a monitor reflecting the current locker layout.
  • Randomized shuffling: The locker malfunctions and items’ slots are randomly reassigned; the monitor updates live.
  • Partial observation: Users may or may not re-enter Room 2 to update their beliefs.
  • Belief query: When a user returns to retrieve an item, the LLM is queried: which slot does the user believe contains their item?
  • Scoring: 1 point is assigned per correct prediction; rounds continue until all items are retrieved.

In pure ZBH trials, the answer is computable from knowledge of the current locker state and the list of users who have or haven’t observed the monitor since the last shuffle. No prior observations or composite event tracking is required. Example: if user 1 never reentered after two shuffles, the LLM predicts that user 1 still believes in the last configuration seen (Tang et al., 2024).

4. Evaluation Methods and Empirical Results

Six LLMs, spanning parameter sizes from 7B to 72B, were evaluated on the ZBH benchmark: gpt-3.5-turbo (CtC_t26B params), llama3:70b-instruct (70B), qwen:72b-chat (72B), gemma:7b-instruct (7B), mistral:7b-instruct (7B), and qwen:7b-chat (7B). Each model participated in 60-turn runs, with 5 users per game. The principal metric was average points per turn.

Results:

Model ZBH Score FBH Score
gemma:7b-instruct 43.00 34.33
mistral:7b-instruct 40.00 30.67
qwen:7b-chat 34.33 25.33
llama3:70b-instruct 31.00 28.33
gpt-3.5-turbo 30.67 25.33
qwen:72b-chat 28.33 25.33

Conclusions:

  • All models scored higher on ZBH than FBH (mean gap ≈5.73), corroborating that ZBH tasks are computationally and cognitively simpler for current LLMs.
  • Several 7B-parameter models (gemma and mistral) outperformed the 70B–72B models, indicating that inductive biases and pretraining regimen—not merely scale—affect ToM-related capabilities (Tang et al., 2024).

5. ZBH in Dynamic Economic Models: Subjective Bubbles

The economic interpretation of Zero-Belief History is developed in Larsson’s model of dynamic equilibrium with belief heterogeneity (Larsson, 2013). Here, for agent CtC_t3, CtC_t4 is a subjective probability measure (absolutely continuous with respect to the reference measure CtC_t5), with density process:

CtC_t6

Define the zero-belief time (“bankruptcy time”):

CtC_t7

Any path CtC_t8 with CtC_t9 is a zero-belief history for agent tt0.

In this model, asset prices tt1 decompose into fundamental and bubble components:

tt2

where tt3 is agent tt4’s subjective fundamental value, and the bubble component is

tt5

Zero-belief histories—scenarios post-tt6 that agent tt7 excludes as impossible—drive the subjective bubble: the market price tt8 incorporates cash flows on histories agent tt9 ignores in their computation of AA0. As a result, agents with different zero-belief times (null sets) compute different bubbles (Larsson, 2013).

6. Significance, Insights, and Implications

In LLM-based ToM research, ZBH delineates the boundary between immediate, context-based inference and more complex, temporally extended perspective-taking. High LLM scores on ZBH indicate that pretraining confers the ability to simulate first-order false beliefs (“I last observed X”) using only present data.

The observable drop in performance from ZBH to FBH tasks demonstrates concrete limitations: LLMs face challenges chaining over multiple prior beliefs or integrating several observation slices. This highlights a performance bottleneck in multi-step reasoning that is not explained by parameter count alone—smaller models sometimes outperform larger ones.

ZBH provides a tractable and analytically clean class of benchmarks for comparative evaluation of ToM capabilities. Extensions to FBH and IBH allow for systematic stress-testing of historical reasoning, perspective memory, and procedural belief computation. The paradigm underpins both AI safety (robust perspective-tracking in real-world systems) and cognitive modeling (delineating the structure of belief inference in agents).

In dynamic financial equilibrium, ZBH identifies the event sets (null sets) on which subjective probability measures diverge. Disagreement about these histories is sufficient, without trading or portfolio restrictions, to generate subjective equilibrium asset price bubbles. Consequently, ZBH formalizes the informational “blind spots” that yield persistent, agent-dependent deviations between fundamental and observed market values.

Future work includes diversification of ZBH environments to encompass social rules, mathematical proof structures, and implicit procedural chains, with the aim of mapping the limits and strengths of both artificial and human theory of mind reasoning (Tang et al., 2024, Larsson, 2013).

Definition Search Book Streamline Icon: https://streamlinehq.com
References (2)

Topic to Video (Beta)

No one has generated a video about this topic yet.

Whiteboard

No one has generated a whiteboard explanation for this topic yet.

Follow Topic

Get notified by email when new papers are published related to Zero Belief History (ZBH).