Scaling neural networks on astronomical observation data
Determine whether neural networks trained on astronomical observation data can be scaled in a manner analogous to established scaling behavior in the textual domain (i.e., whether increasing model parameters and data leads to predictable performance improvements as observed for large language models).
References
While a robust and innovative approach, \citet{ref_leung2023} leave some open questions which we hope to complement with this work: that is, can we scale neural networks on astronomical observation data just as we have done in the textual domain, and do we need the computational and architectural overhead of pretraining a full encoder-decoder transformer architecture to teach our models scientifically useful information?
— AstroPT: Scaling Large Observation Models for Astronomy
(2405.14930 - Smith et al., 23 May 2024) in Section 1, On 'Large Observation Models' (Introduction)