Optimal scaling theory for information‑gain controlled updates in VTI
Derive mathematical properties characterizing the optimal scaling of the gradient update applied to the parameters of the model‑weights distribution q_η in Variational Transdimensional Inference when regulating updates via an information‑gain threshold IG(q_η^(t+1) || q_η^(t)). Specifically, ascertain conditions, stability criteria, and convergence behavior for step‑size scaling of η under the entropy‑based information‑gain constraint to ensure stable joint optimization of (η, φ).
Sponsor
References
We show empirical results for controlling this rate and leave any mathematical properties for the optimal scaling to future research.
— Amortized variational transdimensional inference
(2506.04749 - Davies et al., 5 Jun 2025) in Appendix, Subsection “Controlling the learning rate via the information gain” (apdx:ig_threshold)