Dice Question Streamline Icon: https://streamlinehq.com

Theoretical guarantees for Monte Carlo loss estimation under change of measure in TPPF

Establish whether Monte Carlo estimators of the loss functions for the Twisted-Path Particle Filter—specifically the KL-divergence losses D_KL(P^φ || P^{φ*}), D_KL(P^{φ*} || P^φ), and their sum—together with their gradients, deteriorate when computed via the untwisted change-of-measure representation (equation (eq:untwisted) in Section 4.2) instead of sampling from the twisted Markov chain; rigorously determine conditions under which these estimators are consistent and quantify any bias or variance introduced by the change of measure.

Information Square Streamline Icon: https://streamlinehq.com

Background

In Section 4.2 the authors propose training the twisting function for their Twisted-Path Particle Filter (TPPF) using KL-divergence losses between path measures. Because sampling from the twisted transition kernel can be difficult, they provide an alternative "untwisted" change-of-measure expression (equation (eq:untwisted)) to compute the loss and its gradient using samples from the original, untwisted Markov chain.

This practical workaround raises a methodological concern: whether Monte Carlo approximation of the loss and its gradient remains reliable after such a change of measure. The authors explicitly note that they lack a theoretical guarantee for the accuracy or stability of this approximation, even though empirical results are encouraging. Formalizing guarantees would strengthen the theoretical foundation of TPPF and inform its use across models and dimensions.

References

Moreover, it is natural to doubt whether the Monte Carlo approximation for the loss (and its gradient) would deteriorate after a change of measure. So far we have got no theoretical guarantee for this, but empirically, the experiments show that eq:untwisted can approximate the loss well.

Guidance for twisted particle filter: a continuous-time perspective (2409.02399 - Lu et al., 4 Sep 2024) in Section 4.2 (Approximating the optimal twisting function in discrete-time setting), paragraph following equation (eq:untwisted)