Dice Question Streamline Icon: https://streamlinehq.com

Cause of flat/increasing TV vs guidance at large w under Tau-leaping

Determine whether the observed flat or increasing region in the curve of total variation distance versus guidance strength w, for the 1D masked discrete diffusion reverse dynamics simulated via Tau-leaping, is primarily caused by the sharp transition of the reverse sampling dynamics at large w that makes the Tau-leaping scheme less efficient and less stable.

Information Square Streamline Icon: https://streamlinehq.com

Background

The authors empirically plot the total variation distance as a function of guidance strength w for 1D guided masked discrete diffusion and note agreement with theory at small w but a flattening or slight increase at large w. They attribute this behavior to numerical issues rather than theoretical disagreement.

Given their theoretical result that the decay rate of total variation along the reverse dynamics depends double-exponentially on w for large w, they conjecture that Tau-leaping struggles with the resulting sharp temporal transition, leading to inefficiency and instability. Confirming this causal mechanism would clarify practical limitations and inform more robust numerical strategies.

References

For large w, we observe a flat/increasing region in the plot. We conjecture that this is mainly due to the sharp transition of the reverse sampling dynamics for large w (as shown in Remark \ref{rem:1d convergence rate}), which makes the Tau-leaping scheme less efficient and less stable.

What Exactly Does Guidance Do in Masked Discrete Diffusion Models (2506.10971 - Ye et al., 12 Jun 2025) in Section 5 (Numerical Examples), Experiments in 1D