Optimal Weight Selection for Weighted Goal Loss in Gradient-Based Planning
Determine the optimal sequence of weights {w_i}_{i=2}^{H+1} in the Weighted Goal Loss objective used for gradient-based planning with latent world models f_θ and a fixed encoder Φ_μ. Specifically, select the weighting scheme for L_WGL, defined as the average over timesteps of w_i times the squared L2 distance between the predicted latent state at timestep i and the goal latent state, so as to most effectively guide optimization of action sequences under a planning horizon H.
Sponsor
References
We leave the optimal selection of this sequence of weights as future work.
— Closing the Train-Test Gap in World Models for Gradient-Based Planning
(2512.09929 - Parthasarathy et al., 10 Dec 2025) in Appendix, Section "Additional Experiment Results", Subsection "Weighted Goal Loss"