Theoretical guarantees for Monte Carlo loss estimation under change of measure in TPPF
Establish whether Monte Carlo estimators of the loss functions for the Twisted-Path Particle Filter—specifically the KL-divergence losses D_KL(P^φ || P^{φ*}), D_KL(P^{φ*} || P^φ), and their sum—together with their gradients, deteriorate when computed via the untwisted change-of-measure representation (equation (eq:untwisted) in Section 4.2) instead of sampling from the twisted Markov chain; rigorously determine conditions under which these estimators are consistent and quantify any bias or variance introduced by the change of measure.
References
Moreover, it is natural to doubt whether the Monte Carlo approximation for the loss (and its gradient) would deteriorate after a change of measure. So far we have got no theoretical guarantee for this, but empirically, the experiments show that eq:untwisted can approximate the loss well.