Scope of modeling future uncertainty in video understanding

Determine how far video understanding models should extend beyond deterministic prediction to represent and reason about future uncertainty, including modeling multiple plausible futures and integrating uncertainty into planning and decision-making processes.

Background

The paper argues that real-world environments are fundamentally uncertain and that video understanding should support decision-making under uncertainty, not just deterministic prediction.

This motivates extending models to reason over multiple plausible futures, evaluate goals and costs, and operate effectively in embodied, online settings.

References

Beyond deterministic objectives such as video labeling, an open question is how far video understanding should extend toward understanding future uncertainty.

Video Understanding: From Geometry and Semantics to Unified Models  (2603.17840 - An et al., 18 Mar 2026) in Third outlook point, Section 6 (Conclusion and Outlook)