Scaling of intermediate-temperature sampling results to larger transformers and ambitious protein-structure tasks
Determine how the empirical findings obtained by sampling transformer parameter spaces at intermediate temperatures in small one- and four-block transformers trained on synthetic protein sequences scale to larger transformer architectures and to more ambitious tasks, such as predicting the structures of all known proteins.
References
Of course, we cannot yet determine how our results scale for larger transformers and more ambitious tasks, such as learning the structure of all known proteins.
— Sampling at intermediate temperatures is optimal for training large language models in protein structure prediction
(2603.29529 - Ghiringhelli et al., 31 Mar 2026) in Discussion and Conclusions