Does the lack of frontier expansion with increased NFE persist at larger scales?
Ascertain whether the observed absence of entropy–perplexity frontier expansion as the number of function evaluations (NFE) increases for discrete diffusion language models is merely an artifact of the model scale used in the experiments, or whether this behavior persists for larger diffusion models.
References
We observe that the frontiers are largely the same, implying that the performance improvements obtained by increasing NFE can be achieved via temperature tuning. We do not know whether this is simply an artifact of this specific scale of models, or whether this is a persistent phenomenon for larger diffusion models.
— CANDI: Hybrid Discrete-Continuous Diffusion Models
(2510.22510 - Pynadath et al., 26 Oct 2025) in Appendix, Section 6: OWT Text Generation, Frontier Curves for Higher NFE