Improve adversarial post-training for audio generative models
Determine how adversarial post-training can be improved and effectively applied to text-conditional audio generation using gaussian flow-based generative models (diffusion models and rectified flows).
References
How to improve adversarial post-training and apply it to audio remains an open question.
— Fast Text-to-Audio Generation with Adversarial Post-Training
(2505.08175 - Novack et al., 13 May 2025) in Section 1 (Introduction)