Closed-form characterization of the velocity field in LPIPS-regularized flow-matching decoders
Derive a closed-form characterization of the non-straight, training-dependent velocity field learned by pixel-space diffusion decoders trained with flow matching augmented by LPIPS loss (i.e., optimizing L_λ(\hat{\nu}|\nu)=\|\hat{\nu}-\nu\|^2+\lambda L(x_0,\hat{x}_0)), where the effective target field is \nu - (\lambda t/2) \nabla L and depends on the noise level t and the LPIPS gradient. The goal is to obtain an explicit analytic expression for this velocity field that explains its shift during training and enables principled sampling schedule design.
References
This analysis reveals that LPIPS-regularized Flow-Matching decoders learn a non-straight velocity field that shifts during their training, and for which we do not have a closed form.