Extension beyond Gaussian supervised inputs to natural self-supervised settings
Extend the high-dimensional analysis of empirical risk minimization for single-head tied attention from Gaussian input embeddings with supervised targets to natural structured inputs learned in a self-supervised manner, determining whether the derived generalization and spectral predictions continue to hold.
References
Second, we assume Gaussian input embeddings with structured input-output relationship under supervised learning, leaving open the extension to natural structured inputs from which one learns in self-supervised manner.
— Inductive Bias and Spectral Properties of Single-Head Attention in High Dimensions
(2509.24914 - Boncoraglio et al., 29 Sep 2025) in Section 6, Conclusion and limitations