Dice Question Streamline Icon: https://streamlinehq.com

Annotation of MS/MS spectra remains highly unsolved

Establish computational methods that achieve reliable, high-accuracy annotation of tandem mass spectra (MS/MS) with correct molecular structures at scale, closing the performance gaps observed for baseline models in the MassSpecGym benchmark.

Information Square Streamline Icon: https://streamlinehq.com

Background

The MassSpecGym benchmark evaluates de novo structure generation, molecule retrieval, and spectrum simulation under a generalization-demanding split. Baseline models show limited performance, particularly for de novo generation where accuracy is zero, indicating substantial room for improvement.

Based on these results, the authors explicitly state that annotating MS/MS spectra with molecular structures remains a highly unsolved problem, motivating further methodological advances and standardized benchmarking.

References

We evaluated a series of baseline methods and demonstrated that the annotation of MS/MS spectra remains a highly unsolved problem.

MassSpecGym: A benchmark for the discovery and identification of molecules (2410.23326 - Bushuiev et al., 30 Oct 2024) in Conclusions