Papers
Topics
Authors
Recent
Assistant
AI Research Assistant
Well-researched responses based on relevant abstracts and paper content.
Custom Instructions Pro
Preferences or requirements that you'd like Emergent Mind to consider when generating responses.
Gemini 2.5 Flash
Gemini 2.5 Flash 83 tok/s
Gemini 2.5 Pro 54 tok/s Pro
GPT-5 Medium 21 tok/s Pro
GPT-5 High 20 tok/s Pro
GPT-4o 103 tok/s Pro
Kimi K2 205 tok/s Pro
GPT OSS 120B 456 tok/s Pro
Claude Sonnet 4 35 tok/s Pro
2000 character limit reached

A Review on Fragment-based De Novo 2D Molecule Generation (2405.05293v1)

Published 8 May 2024 in q-bio.BM and cs.LG

Abstract: In the field of computational molecule generation, an essential task in the discovery of new chemical compounds, fragment-based deep generative models are a leading approach, consistently achieving state-of-the-art results in molecular design benchmarks as of 2023. We present a detailed comparative assessment of their architectures, highlighting their unique approaches to molecular fragmentation and generative modeling. This review also includes comparisons of output quality, generation speed, and the current limitations of specific models. We also highlight promising avenues for future research that could bridge fragment-based models to real-world applications.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (36)
  1. Generative models for molecular discovery: Recent advances and challenges. Wiley Interdisciplinary Reviews: Computational Molecular Science, 12, 2022.
  2. Guacamol: benchmarking models for de novo molecular design. Journal of chemical information and modeling, 59(3):1096–1108, 2019.
  3. MolGAN: An implicit generative model for small molecular graphs. ICML 2018 workshop on Theoretical Foundations and Applications of Deep Generative Models, 2018.
  4. Artificial intelligence in drug discovery: Applications and techniques. Briefings in bioinformatics, 2021.
  5. Estimation of synthetic accessibility score of drug-like molecules based on molecular complexity and fragment contributions. Journal of Cheminformatics, 1:8 – 8, 2009.
  6. Deep graph generators: A survey. IEEE Access, 9:106675–106702, 2020.
  7. Gage, P. A new algorithm for data compression. The C Users Journal archive, 12:23–38, 1994.
  8. De novo molecular generation via connection-aware motif mining. In The Eleventh International Conference on Learning Representations, 2023.
  9. Automatic chemical design using a data-driven continuous representation of molecules. ACS Central Science, 4:268 – 276, 2016.
  10. Magnet: Motif-agnostic generation of molecules from shapes. arXiv preprint arXiv:2305.19303, 2023.
  11. Equivariant diffusion for molecule generation in 3d. In International conference on machine learning, pp. 8867–8887. PMLR, 2022.
  12. Deep learning methods for small molecule drug discovery: A survey. IEEE Transactions on Artificial Intelligence, 2023.
  13. Zinc20—a free ultralarge-scale chemical database for ligand discovery. Journal of chemical information and modeling, 60(12):6065–6073, 2020.
  14. Junction tree variational autoencoder for molecular graph generation. In International conference on machine learning, pp. 2323–2332. PMLR, 2018.
  15. Multi-objective molecule generation using interpretable substructures. In International Conference on Machine Learning, 2020a.
  16. Hierarchical generation of molecular graphs using structural motifs. In International conference on machine learning, pp. 4839–4848. PMLR, 2020b.
  17. Auto-encoding variational bayes. CoRR, abs/1312.6114, 2013.
  18. Molecule generation by principal subgraph mining and assembling. Advances in Neural Information Processing Systems, 35:2550–2563, 2022.
  19. Grammar variational autoencoder. In International Conference on Machine Learning, 2017.
  20. Learning to extend molecular scaffolds with structural motifs. In International Conference on Learning Representations, 2022.
  21. Chembl: towards direct deposition of bioassay data. Nucleic acids research, 47(D1):D930–D940, 2019.
  22. Augmenting genetic algorithms with deep neural networks for exploring the chemical space. In International Conference on Learning Representations, 2020.
  23. Fréchet chemnet distance: a metric for generative models for molecules in drug discovery. Journal of chemical information and modeling, 58(9):1736–1741, 2018.
  24. Coarse-to-fine: a hierarchical diffusion model for molecule generation in 3d. In International Conference on Machine Learning, pp. 28277–28299. PMLR, 2023.
  25. Enumeration of 166 billion organic small molecules in the chemical universe database gdb-17. Journal of chemical information and modeling, 52(11):2864–2875, 2012.
  26. Computer-based de novo design of drug-like molecules. Nature Reviews Drug Discovery, 4:649–663, 2005.
  27. Graphaf: a flow-based autoregressive model for molecular graph generation. In International Conference on Learning Representations, 2020.
  28. The power of motifs as inductive bias for learning molecular distributions. In ICLR 2023 - Machine Learning for Drug Discovery workshop, 2023.
  29. Top-n: Equivariant set and graph generation without exchangeability. In International Conference on Learning Representations, 2022.
  30. Weininger, D. Smiles, a chemical language and information system. 1. introduction to methodology and encoding rules. J. Chem. Inf. Comput. Sci., 28:31–36, 1988.
  31. Efficient multi-objective molecular optimization in a continuous latent space. Chemical science, 10(34):8016–8024, 2019.
  32. {MARS}: Markov molecular sampling for multi-objective drug discovery. In International Conference on Learning Representations, 2021.
  33. Geometric latent diffusion models for 3d molecule generation. In International Conference on Machine Learning, pp. 38592–38610. PMLR, 2023.
  34. Molecule generation for drug design: a graph learning perspective. arXiv preprint arXiv:2202.09212, 2022.
  35. Hit and lead discovery with explorative RL and fragment-based molecule generation. In Beygelzimer, A., Dauphin, Y., Liang, P., and Vaughan, J. W. (eds.), Advances in Neural Information Processing Systems, 2021.
  36. Deep set prediction networks. Advances in Neural Information Processing Systems, 32, 2019.

Summary

We haven't generated a summary for this paper yet.

Lightbulb On Streamline Icon: https://streamlinehq.com

Continue Learning

We haven't generated follow-up questions for this paper yet.

Authors (1)

List To Do Tasks Checklist Streamline Icon: https://streamlinehq.com

Collections

Sign up for free to add this paper to one or more collections.

Don't miss out on important new AI/ML research

See which papers are being discussed right now on X, Reddit, and more:

“Emergent Mind helps me see which AI papers have caught fire online.”

Philip

Philip

Creator, AI Explained on YouTube