Conjecture: Convergence to a Shared Platonic Representation
Establish whether representation learning algorithms trained on diverse data modalities (e.g., images and text), objectives, and tasks converge to a shared representation of the underlying latent reality variable Z, and ascertain whether increasing model size together with data scale and task diversity causally drives this convergence.
References
We conjecture that representation learning algorithms will converge on a shared representation of $Z$, and scaling model size, as well as data and task diversity, drives this convergence.
                — The Platonic Representation Hypothesis
                
                (2405.07987 - Huh et al., 13 May 2024) in Figure 1 caption, Section 1 (Introduction)