Effectiveness of DNALM pre-training in eukaryotes
Determine whether the pre-training strategies used in DNA language models (DNALMs) for eukaryotic genomes effectively capture key biological properties and consistently outperform traditional approaches.
References
However, in eukaryotes, questions remain about whether the pre-training strategies of DNA LLMs (DNALMs) effectively capture key biological properties and consistently outperform traditional approaches .
                — BMFM-DNA: A SNP-aware DNA foundation model to capture variant effects
                
                (2507.05265 - Li et al., 26 Jun 2025) in Section 1 (Introduction)