Measuring the energy contribution of audio generation in production T2V systems
Ascertain the energy cost attributable to audio generation components within production text-to-video systems that also synthesize audio, and quantify their contribution relative to the video generation pipeline under realistic usage.
References
Finally, many production T2V systems (e.g., Veo) also generate audio, whose contribution to energy cost remains unexplored.
— Video Killed the Energy Budget: Characterizing the Latency and Power Regimes of Open Text-to-Video Models
(2509.19222 - Delavande et al., 23 Sep 2025) in Section: Limitations and Conclusion — Limitations paragraph