Insensitivity of KEEP performance to optical flow estimator accuracy
Investigate whether the limited effect of optical flow estimator accuracy on the performance of the Kalman-Inspired fEaturE Propagation (KEEP) framework for video face super-resolution arises from two specific factors: (i) attenuation of minor pixel-space misalignments due to the 32× downsampling of latent codes, which makes latent representations less sensitive to small spatial discrepancies, and (ii) compensation of flow estimation inaccuracies by jointly trained modules within KEEP. Establish the validity of these two factors as the explanation for the observed insensitivity to flow estimator choice.
References
We conjecture this can be attributed to two factors: 1) Minor misalignment in pixel space can be reasonably diminished as the latent code is highly downsampled by a factor of 32×, at which level the latent representations are less sensitive to small spatial discrepancies present in the pixel space. 2) Other modules can compensate for the inaccuracy caused by flow estimators in a joint training fashion.