Reconciling the proper bases for spectral normalization and variance adaptation
Investigate how to reconcile the choice of basis in which spectral normalization and variance adaptation are applied within matrix-whitening optimizers, specifically whether variance adaptation should be performed in the rotated eigenbasis (as in SOAP) or in the original elementwise basis after orthogonalization (as in AdaMuon).
References
We leave further examination on how to reconcile the proper bases for spectral-normalization and variance-adaptation to future investigation.
— What Really Matters in Matrix-Whitening Optimizers?
(2510.25000 - Frans et al., 28 Oct 2025) in Subsection “Why does variance adaptation still work when done after orthogonalization?”