Predicting the scale at which emergent capabilities appear in large language models
Determine how to reliably predict, as a function of model scale (e.g., parameter count, training data size, and training compute), the thresholds at which new qualitative “breakthrough” capabilities emerge in large language models, in order to anticipate capability onset and associated risks.
References
These breakthrough capabilities have been observed empirically, but we are unable to reliably predict the scale at which new breakthroughs will happen.
— Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models
(2206.04615 - Srivastava et al., 2022) in Subsection “Quantity has a quality all its own,” Section 1 (Introduction)