Conditions under which normalization layers substitute for volume control
Determine conditions under which normalization layers in neural networks effectively substitute for volume control in distance-based log-sum-exp objectives, and characterize when they fail to do so, with respect to preventing collapse in implicit expectation-maximization dynamics.
Sponsor
References
Several directions remain open. Understanding when normalization layers substitute for volume control, and when they do not, would connect the implicit EM framework to practical stability concerns.
— Gradient Descent as Implicit EM in Distance-Based Neural Models
(2512.24780 - Oursland, 31 Dec 2025) in Discussion, Open Directions (Section 7, Open Directions)