Papers
Topics
Authors
Recent
Assistant
AI Research Assistant
Well-researched responses based on relevant abstracts and paper content.
Custom Instructions Pro
Preferences or requirements that you'd like Emergent Mind to consider when generating responses.
Gemini 2.5 Flash
Gemini 2.5 Flash 71 tok/s
Gemini 2.5 Pro 48 tok/s Pro
GPT-5 Medium 23 tok/s Pro
GPT-5 High 17 tok/s Pro
GPT-4o 111 tok/s Pro
Kimi K2 161 tok/s Pro
GPT OSS 120B 412 tok/s Pro
Claude Sonnet 4 35 tok/s Pro
2000 character limit reached

Contingency-Aware Diffusion Planner (CoPlanner)

Updated 28 September 2025
  • The paper introduces a contingency-aware planner that integrates a pivot-conditioned diffusion mechanism with multi-scenario risk scoring for robust multi-agent trajectories.
  • It employs a two-stage process by first generating a safety-validated pivot segment and then branching into diverse, long-term trajectories to ensure interaction-aware, socially consistent planning.
  • Empirical evaluations on nuPlan show improved trajectory smoothness, reduced collisions, and enhanced safety in complex driving scenarios.

A contingency-aware diffusion planner (“CoPlanner” as described in (Zhong et al., 21 Sep 2025)) is an integrated generative planning framework that jointly addresses multi-agent interactive trajectory generation and robust decision-making under scenario uncertainty. CoPlanner is designed to address critical limitations in existing generation-then-evaluation motion planning systems, particularly their tendency to adopt a single most likely outcome and their inability to retain fallback trajectories essential for safety in rare but critical scenarios. CoPlanner incorporates a pivot-conditioned diffusion mechanism to ensure consistent, validated immediate behavior, and evaluates candidate trajectories across a spectrum of plausible long-term outcomes using a multi-scenario scoring strategy. This elevates robustness against multi-modal future uncertainty and enables interaction-aware, socially consistent planning for autonomous driving in complex environments.

1. Pivot-Conditioned Diffusion Mechanism

At the heart of CoPlanner is the pivot-conditioned diffusion process that divides planning into two temporally coupled components:

  • Short-Term Shared “Pivot” Segment: For all agents, the planner first generates a short-term segment (0 … t_b) that is validated to satisfy safety and traffic rule compliance. This segment forms an “anchor” for all downstream plans.
  • Branching Long-Term Futures: Conditional on the selected validated pivot, CoPlanner stochastically generates a set of diverse, long-horizon branches for both the ego-agent and surrounding agents. These diverse futures capture the multi-modality of plausible scene evolutions beyond the immediate past.

Mathematically, the joint distribution over multi-agent trajectories Λ is factorized as:

p(ΛC)=pθ(SC)pϕ(BS,C)p(\Lambda \mid C) = p_\theta(S \mid C) \, p_\phi(B \mid S, C)

where SS (the pivot) is the joint short-term segment, BB the set of long-term branches, and CC the conditioning context.

Sampling employs a temporal mask m(t)=1[ttb]m(t) = 1[t \leq t_b] during reverse diffusion. At each step:

  • For ttbt \leq t_b, the known anchor is re-noised (to preserve the pivot).
  • For t>tbt > t_b, the normal denoising process is applied, allowing multimodal branching.

This technique ensures all candidates remain temporally anchored to a safety-validated, interaction-consistent short-term evolution, while retaining diversity over plausible long-term developments.

2. Contingency-Aware Multi-Scenario Scoring

To select robust plans, CoPlanner abandons single-hypothesis selection in favor of multi-scenario evaluation:

  • For each candidate short-term anchor, multiple long-term branches are generated via the pivot-conditioned diffusion model.
  • Each complete candidate plan is evaluated both on the shared segment and across all generated branches, using a composite score that captures safety, progress, comfort, and rule compliance.

Let J(τ0Λ,C)J(\tau^0 \mid \Lambda, C) be the cost for ego plan τ0\tau^0 under long-term multi-agent realization Λ\Lambda and context CC, decomposed as:

J(τ0Λ,C)=Jshared(τ0:tb0Λ,C)+Jbranch(τtb:T0Λ,C)J(\tau^0 \mid \Lambda, C) = J_{shared}(\tau^0_{0:t_b} \mid \Lambda, C) + J_{branch}(\tau^0_{t_b:T} \mid \Lambda, C)

For each candidate, the risk-aggregated score across NN branches is:

Jk(τ0)=Jshared()+R({Jbranch()}n)J_k(\tau^0) = J_{shared}(\cdot) + R(\{J_{branch}(\cdot)\}_n)

where RR is a risk aggregator (e.g., equal-weight mean). The plan minimizing this overall score is selected as the robust, contingency-aware trajectory.

3. Technical Architecture and Diffusion Model

CoPlanner employs a joint generative model for multi-agent trajectories in a discrete-time, variance-preserving diffusion process. The model backbone is a diffusion transformer (DiT), which enables joint modeling of all dynamic agents (ego plus surroundings).

The training objective is an x(0)x^{(0)}-prediction loss under the VP schedule:

Lθ=Ex(0),h,x(h)μθ(x(h),h,C)x(0)22L_\theta = \mathbb{E}_{x^{(0)}, h, x^{(h)}} \left\| \mu_\theta(x^{(h)}, h, C) - x^{(0)} \right\|_2^2

Reverse diffusion is adapted to accept the pivot-masked temporal conditioning. The model ensures that the trajectory generation for all agents is coupled during the anchor segment and allowed to diversify beyond.

4. Integration of Prediction and Planning Under Interaction

Crucially, CoPlanner integrates multi-agent prediction and ego motion planning into a unified generative framework:

  • All joint trajectories (ego + agents) are sampled together, ensuring social consistency (i.e., the planned ego action does not conflict with the simultaneously forecasted neighbor actions).
  • The pivot segment validation guarantees that near-term execution is robust—no plan can “drift away” into unsafe or implausible behavior immediately after re-planning, a critical requirement in highly interactive, uncertain road scenes.
  • The diverse branch generation preserves fallback options: if the realized future diverges from the likely prediction, the system can rely on pre-evaluated alternative continuations.

Unlike conventional decoupled architectures—where planning passively reacts to exogenous predictions—CoPlanner’s joint modeling and evaluation actively anticipates interactive evolution across the ensemble of sampled futures.

5. Empirical Performance on nuPlan and Safety Outcomes

CoPlanner was benchmarked in closed-loop (planning-in-the-loop) evaluations on the nuPlan driving dataset. Performance results highlight the utility of the contingency-aware, pivot-conditioned approach:

  • On Val14 and Test14, as well as Test14-hard subsets, CoPlanner achieved higher overall scores across both non-reactive (log-replay) and reactive (IDM controller) settings than state-of-the-art baselines (Diffusion Planner, UrbanDriver, PDM).
  • Notable improvements were observed in:
    • Fewer collisions and increased time-to-collision (TTC)
    • Greater trajectory smoothness and driver comfort metrics
    • Robust operation in challenging scenarios involving rare and adversarial behaviors (due to fallback-preserving branch structure)
  • The two-stage sampling and risk-aggregated evaluation directly contributed to these robustness gains across both familiar and previously unseen test conditions.

6. Practical Implications and Extensions

CoPlanner’s architecture provides several practical advantages for safety-critical real-world deployment:

  • Preservation of fallback options: The ensemble of plausible long-term continuations ensures that if traffic evolves unexpectedly, alternative, pre-validated strategies remain available.
  • Social consistency: By jointly generating multi-agent scenarios, the system avoids common conflicts and unrealistic downstream plans introduced by exogenously predicted agent behavior.
  • Flexible risk aggregation: The scoring framework supports diverse risk measures and could, with extension, incorporate distributional uncertainty, learned risk attitudes, or dynamically chosen scenario weights.
  • Compatibility with physical constraints: The design can accommodate future improvements—such as incorporating feasibility-aware guidance, physics-informed priors, or adaptive branching-points (t_b)—to further align generated plans with vehicle dynamics and scene semantics.

Potential future directions include adaptively setting the short-term branching time per situation, learning scenario weights for better uncertainty calibration, and broader closed-loop evaluation with diverse and more adversarial reactive agent sets.

7. Mathematical Formulation Table

Component Mathematical Expression Description
Short/Long Decomposition p(ΛC)=pθ(SC)pϕ(BS,C)p(\Lambda | C) = p_\theta(S | C) \, p_\phi(B | S, C) Joint future as anchor + branches
Risk-Aggregated Cost Jk(τ0)=Jshared()+R({Jbranch()}n)J_k(\tau^0) = J_{shared}(\cdot) + R(\{J_{branch}(\cdot)\}_n) Selection over multi-scenario cost
Diffusion Loss Lθ=Ex(0),h,x(h)μθ(x(h),h,C)x(0)22L_\theta = \mathbb{E}_{x^{(0)}, h, x^{(h)}} \left\| \mu_\theta(x^{(h)}, h, C) - x^{(0)} \right\|_2^2 Model objective under VP schedule
Pivot Masking m(t)=1[ttb]m(t) = 1[t \leq t_b] Time mask for anchor/branch split

These formalizations anchor CoPlanner’s technical contributions and establish its foundation as a contingency-aware, interaction-consistent, and robust framework for autonomous motion planning.


CoPlanner’s interactive, scenario-robust design represents a major advancement toward closing the gap between offline generative modeling and online, safety-critical execution in autonomous driving systems (Zhong et al., 21 Sep 2025).

Definition Search Book Streamline Icon: https://streamlinehq.com
References (1)
Forward Email Streamline Icon: https://streamlinehq.com

Follow Topic

Get notified by email when new papers are published related to Contingency-Aware Diffusion Planner (CoPlanner).