Dice Question Streamline Icon: https://streamlinehq.com

Experimental confirmation of compute-bound scaling on non-H100 accelerators

Establish experimentally whether inference for the WAN2.1-T2V text-to-video diffusion model exhibits a compute-bound regime and the same quadratic (in spatial resolution and frame count) and linear (in denoising steps) scaling laws on hardware accelerators other than the NVIDIA H100 SXM, for realistic token lengths.

Information Square Streamline Icon: https://streamlinehq.com

Background

The paper validates a compute-bound analytical model and associated scaling laws for WAN2.1-T2V on an NVIDIA H100 SXM, showing quadratic scaling with spatial and temporal dimensions and linear scaling with denoising steps.

Although Appendix results suggest these behaviors should generalize across accelerators, the authors explicitly acknowledge that such generalization has not been experimentally confirmed, identifying a gap that requires multi-accelerator measurement to verify the model’s portability.

References

Energy measurements were conducted on a single hardware platform (NVIDIA H100 SXM). While Appendix~\ref{sec:compute_bound_threshold} shows that the compute-bound regime and associated scaling trends should extend to other accelerators for realistic token lengths, this remains to be confirmed experimenta.

Video Killed the Energy Budget: Characterizing the Latency and Power Regimes of Open Text-to-Video Models (2509.19222 - Delavande et al., 23 Sep 2025) in Section: Limitations and Conclusion — Limitations paragraph