Demonstrate suitability of OMol25-trained models for underrepresented chemistries
Determine whether machine learning interatomic potentials trained on the Open Molecules 2025 (OMol25) dataset achieve sufficient accuracy and generalization for chemical classes with limited or absent coverage in OMol25, specifically lanthanide complexes, multimetallic structures, solvated protonated organic molecules and metal complexes, polymeric materials, and actinide-containing compounds, by rigorously evaluating their performance on these domains.
References
Furthermore, the coverage of certain classes of materials such as lanthanides complexes, multimetallic structures, and solvated protonated organic molecules and metal complexes are relatively limited. Although baseline models trained on OMol25 may still be suitable for these applications, it has yet to be demonstrated.