Curious Causality-Seeking Agents Learn Meta Causal World (2506.23068v1)

Published 29 Jun 2025 in cs.LG, cs.AI, and stat.AP

Abstract: When building a world model, a common assumption is that the environment has a single, unchanging underlying causal rule, like applying Newton's laws to every situation. In reality, what appears as a drifting causal mechanism is often the manifestation of a fixed underlying mechanism seen through a narrow observational window. This brings about a problem that, when building a world model, even subtle shifts in policy or environment states can alter the very observed causal mechanisms. In this work, we introduce the \textbf{Meta-Causal Graph} as world models, a minimal unified representation that efficiently encodes the transformation rules governing how causal structures shift across different latent world states. A single Meta-Causal Graph is composed of multiple causal subgraphs, each triggered by meta state, which is in the latent state space. Building on this representation, we introduce a \textbf{Causality-Seeking Agent} whose objectives are to (1) identify the meta states that trigger each subgraph, (2) discover the corresponding causal relationships by agent curiosity-driven intervention policy, and (3) iteratively refine the Meta-Causal Graph through ongoing curiosity-driven exploration and agent experiences. Experiments on both synthetic tasks and a challenging robot arm manipulation task demonstrate that our method robustly captures shifts in causal dynamics and generalizes effectively to previously unseen contexts.

Summary

The paper introduces a novel Meta-Causal Graph formalism that integrates latent meta states with distinct causal subgraphs to capture context-dependent dynamics in RL.
It employs a curiosity-driven, intervention-based strategy that actively explores high-uncertainty regions, enabling effective learning under noisy and partial observability conditions.
Empirical results demonstrate improved out-of-distribution prediction accuracy and higher RL rewards, confirming the theoretical guarantees and practical benefits of the framework.

Meta-Causal Graphs and Curiosity-Driven Causal Discovery in World Models

This paper introduces the Meta-Causal Graph (MCG) as a new formalism for world modeling in reinforcement learning, addressing the challenge of context-dependent causal dynamics in open-ended environments. The authors propose a curiosity-driven, intervention-based agent architecture that actively discovers and refines a set of latent meta states, each associated with a distinct causal subgraph, thereby enabling robust generalization and adaptability to previously unseen contexts.

Motivation and Problem Formulation

Traditional world models in RL often assume a single, stationary causal structure, which is inadequate for environments where causal mechanisms shift as a function of latent or unobserved state variables. The paper formalizes this as the "narrow observational window" problem: what appears as non-stationary or drifting causality is often the result of observing only a subset of the true, context-dependent causal mechanisms. For example, the causal effect of "push" on "open" is contingent on the latent "locked/unlocked" state of a door.

The authors identify two key limitations in prior work:

Uniform causal graphs cannot capture context-dependent causal relationships.
Lack of active, curiosity-driven exploration impedes the discovery of global causal rules in open-ended environments.

Meta-Causal Graph: Formalism and Identifiability

The Meta-Causal Graph is defined as a minimal, unified representation comprising multiple causal subgraphs, each triggered by a latent meta state. The mapping from observed state to meta state is learned, and each meta state is associated with a distinct causal skeleton matrix. Theoretical results establish that:

Meta states are identifiable up to label permutation (swap-label equivalence) under mild assumptions on the data-generating process.
Overparameterization of the meta state space does not harm identifiability; redundant meta states are consolidated, and the true causal subgraphs are still recovered.
Causal subgraphs are uniquely identifiable if interventions are chosen such that, for every edge, there exists an intervention targeting only one of its endpoints.

These results are supported by rigorous proofs and illustrative examples, including reachability analysis for interventions under practical constraints.

Curious Causality-Seeking Agent: Architecture and Learning

The agent architecture consists of three main components:

Meta-State Discovery: Observed states are embedded and assigned to meta states via vector quantization. Each meta state is represented by a learnable codebook embedding.
Causal Subgraph Learning: For each meta state, a decoder predicts the probability matrix of the causal skeleton, with discrete structure learning enabled by Gumbel-Softmax reparameterization. The learning objective includes a sparsity regularizer to encourage parsimonious graphs.
Curiosity-Driven Interventional Exploration: The agent is rewarded for interventions that maximize the entropy of the posterior over causal relationships, driving exploration toward regions of high causal uncertainty. Interventional verification is used to estimate causal effects and refine the learned graphs.

The overall optimization objective integrates maximum likelihood estimation for transition dynamics, sparsity regularization, causal mask loss from interventions, and quantization loss for codebook updates. Training alternates between exploration, intervention, graph updating, and world model learning.

Pseudocode Outline

while not converged:
    # 1. Assign meta state via vector quantization
    u = argmin_u ||E(x) - e_u||^2

    # 2. Predict causal skeleton for meta state
    M_hat = D(e_u)
    M = sample_gumbel_softmax(M_hat)

    # 3. Select intervention maximizing curiosity reward
    i = select_intervention(R(do(X_i = x_i')))

    # 4. Perform intervention, observe transitions
    x_next = environment.step(do(X_i = x_i'))

    # 5. Estimate causal effects, update mask loss
    delta_ij = log P(X_j' | do(X_i)) - log P(X_j' | X)
    update_mask_loss(delta_ij, M_hat)

    # 6. Update world model and codebook
    update_world_model(M, x, x_next)
    update_codebook(E, e_u)

Empirical Evaluation

Experiments are conducted on both synthetic (Chemical) and robotic (Magnetic) environments, with the following findings:

Prediction Accuracy: The proposed method achieves the highest out-of-distribution prediction accuracy under increasing noise and partial observability, outperforming GNN, MLP, and recent causal discovery baselines.
Downstream RL Performance: The agent achieves higher average episode rewards in model-based RL tasks, demonstrating the practical utility of accurate causal world modeling.
Ablation Studies: Removal of the curiosity-driven reward or intervention verification leads to significant performance degradation, confirming the necessity of both components.
Overparameterization: Empirical results confirm the theoretical claim that overparameterizing the meta state space does not degrade performance; redundant codes are mapped to similar causal subgraphs.

Implications and Future Directions

Practical Implications:

The MCG framework enables agents to autonomously discover and adapt to context-dependent causal structures, a critical capability for robust RL in non-stationary, open-ended environments.
The curiosity-driven intervention policy provides a principled mechanism for efficient exploration, focusing data collection on regions of high epistemic uncertainty in the causal model.
The approach is compatible with high-dimensional, partially observed environments, as demonstrated in robotic manipulation tasks.

Theoretical Implications:

The identifiability results provide guarantees for causal discovery in the presence of latent context variables and overparameterized representations.
The formalization of intervention reachability offers a foundation for analyzing the limits of causal discovery under practical constraints.

Limitations and Open Problems:

The framework assumes that all relevant meta states are reachable via interventions, which may not hold in real-world settings with irreversible or inaccessible states.
Interventions incur costs, and the current method does not explicitly optimize for budgeted exploration.
The approach relies on discrete meta state assignments; extending to continuous or hierarchical meta-causal structures is a promising direction.

Speculation on Future Developments:

Integration with large-scale, foundation world models and language agents could enable more generalizable and interpretable causal reasoning in embodied AI.
Budget-aware and risk-sensitive exploration strategies will be necessary for deployment in safety-critical domains.
The MCG formalism may serve as a foundation for open-ended, self-improving agents capable of continual causal discovery and adaptation.

Conclusion

The Meta-Causal Graph and Curious Causality-Seeking Agent framework represent a significant advance in causal world modeling, providing both theoretical guarantees and practical algorithms for context-dependent causal discovery. The empirical results demonstrate strong performance in both synthetic and real-world tasks, and the formalism opens new avenues for research in robust, adaptive, and interpretable reinforcement learning.

PDF Markdown

Related Papers

Tweets

https://twitter.com/fly51fly/status/1940165367915278353

YouTube

Show All Videos