Tokenization for unevenly-sampled light curves
Develop a tokenization scheme for unevenly-sampled stellar light curves that enables efficient representation learning for transformer-based autoregressive generative modeling, addressing strong local correlations between adjacent observations and reducing sequence length and training/inference cost.
References
The tokenization of unevenly-sampled light curves, however, remains an open question.
— The Scaling Law in Stellar Light Curves
(2405.17156 - Pan et al., 27 May 2024) in Section Discussion