Direct comparison with Tango-2 (DPO)
Determine the comparative performance between EzAudio and Tango-2, a diffusion-based text-to-audio model optimized via Direct Preference Optimization (DPO), by conducting a direct empirical comparison of the two systems.
Sponsor
References
Since our model does not use Direct Preference Optimization (DPO), we leave a comparison with Tango-2 for future work.
— EzAudio: Enhancing Text-to-Audio Generation with Efficient Diffusion Transformer
(2409.10819 - Hai et al., 17 Sep 2024) in Experiments, Subsection "Comparison with State-of-the-art" (footnote attached to "Tango")