Study of the κ hyperparameter in Sven
Investigate the influence of the hyperparameter κ>0 in the generalized decomposition L(θ) = Σα ((ℓα(θ))κ/2)2/κ used by Sven, specifically analyzing how defining effective residuals ℛeffα = (ℓα(θ))κ/2 and the corresponding Jacobian M affect the pseudoinverse-based update, convergence behavior, and performance across regimes and losses.
References
We leave a detailed study of the κ hyperparameter to future work.
— Sven: Singular Value Descent as a Computationally Efficient Natural Gradient Method
(2604.01279 - Bright-Thonney et al., 1 Apr 2026) in Section 2 (Methodology), discussion following the κ-generalization of the loss decomposition