MolCRAFT: Structure-Based Drug Design in Continuous Parameter Space (2404.12141v4)
Abstract: Generative models for structure-based drug design (SBDD) have shown promising results in recent years. Existing works mainly focus on how to generate molecules with higher binding affinity, ignoring the feasibility prerequisites for generated 3D poses and resulting in false positives. We conduct thorough studies on key factors of ill-conformational problems when applying autoregressive methods and diffusion to SBDD, including mode collapse and hybrid continuous-discrete space. In this paper, we introduce MolCRAFT, the first SBDD model that operates in the continuous parameter space, together with a novel noise reduced sampling strategy. Empirical results show that our model consistently achieves superior performance in binding affinity with more stable 3D structure, demonstrating our ability to accurately model interatomic interactions. To our best knowledge, MolCRAFT is the first to achieve reference-level Vina Scores (-6.59 kcal/mol) with comparable molecular size, outperforming other strong baselines by a wide margin (-0.84 kcal/mol). Code is available at https://github.com/AlgoMole/MolCRAFT.
- Fast, accurate, and reliable molecular docking with quickvina 2. Bioinformatics, 31(13):2214–2216, 2015.
- Layer normalization. arXiv preprint arXiv:1607.06450, 2016.
- Molecular generation with recurrent neural networks (rnns). arXiv preprint arXiv:1705.04612, 2017.
- Three-dimensional convolutional neural networks and a cross-docked data set for structure-based drug design. Journal of Chemical Information and Modeling, 60(9):4200–4215, 2020a. doi: 10.1021/acs.jcim.0c00411. URL https://doi.org/10.1021/acs.jcim.0c00411. PMID: 32865404.
- Three-dimensional convolutional neural networks and a cross-docked data set for structure-based drug design. Journal of chemical information and modeling, 60(9):4200–4215, 2020b.
- Automatic chemical design using a data-driven continuous representation of molecules. ACS central science, 4(2):268–276, 2018.
- Bayesian flow networks. arXiv preprint arXiv:2308.07037, 2023.
- Energy-inspired molecular conformation optimization. In international conference on learning representations, 2021.
- 3d equivariant diffusion for target-aware molecule generation and affinity prediction. In The Eleventh International Conference on Learning Representations, 2022.
- DecompDiff: Diffusion models with decomposed priors for structure-based drug design. In Krause, A., Brunskill, E., Cho, K., Engelhardt, B., Sabato, S., and Scarlett, J. (eds.), Proceedings of the 40th International Conference on Machine Learning, volume 202 of Proceedings of Machine Learning Research, pp. 11827–11846. PMLR, 23–29 Jul 2023. URL https://proceedings.mlr.press/v202/guan23a.html.
- Benchmarking generated poses: How rational is structure-based drug design with generative models? arXiv preprint arXiv:2308.07413, 2023.
- Protein-ligand blind docking using quickvina-w with inter-process spatio-temporal integration. Scientific reports, 7(1):15451, 2017.
- Denoising diffusion probabilistic models. Advances in neural information processing systems, 33:6840–6851, 2020.
- Equivariant diffusion for molecule generation in 3d. In International conference on machine learning, pp. 8867–8887. PMLR, 2022.
- Structure-based drug design with geometric deep learning. Current Opinion in Structural Biology, 79:102548, April 2023. ISSN 0959440X. doi: 10.1016/j.sbi.2023.102548. URL https://linkinghub.elsevier.com/retrieve/pii/S0959440X23000222.
- Pocketflow is a data-and-knowledge-driven structure-based molecular generative model. Nature Machine Intelligence, pp. 1–12, 2024.
- Alphaspace 2.0: Representing concave biomolecular surfaces using beta-clusters. Journal of Chemical Information and Modeling, 60(3):1494–1508, 2020. doi: 10.1021/acs.jcim.9b00652. URL https://doi.org/10.1021/acs.jcim.9b00652. PMID: 31995373.
- Generating 3D Molecules for Target Protein Binding, May 2022. URL http://arxiv.org/abs/2204.09410. arXiv:2204.09410 [cs, q-bio].
- Zero-shot 3d drug design by sketching and generating. In NeurIPS, 2022.
- A 3D Generative Model for Structure-Based Drug Design. Advances in Neural Information Processing Systems, 34:6229–6239, 2021. URL http://arxiv.org/abs/2203.10446.
- Generating 3D Molecular Structures Conditional on a Receptor Binding Site with Deep Generative Models, November 2020. URL http://arxiv.org/abs/2010.14442. arXiv:2010.14442 [physics, q-bio].
- Gnina 1.0: molecular docking with deep learning. Journal of cheminformatics, 13(1):1–20, 2021.
- Pocket2Mol: Efficient molecular sampling based on 3D protein pockets. In Chaudhuri, K., Jegelka, S., Song, L., Szepesvari, C., Niu, G., and Sabato, S. (eds.), Proceedings of the 39th International Conference on Machine Learning, volume 162 of Proceedings of Machine Learning Research, pp. 17644–17655. PMLR, 17–23 Jul 2022. URL https://proceedings.mlr.press/v162/peng22b.html.
- Moldiff: Addressing the atom-bond inconsistency problem in 3d molecule diffusion generation. In International Conference on Machine Learning, pp. 27611–27629. PMLR, 2023.
- Fragment-based ligand generation guided by geometric deep learning on protein-ligand structures. In ICLR2022 Machine Learning for Drug Discovery, 2022. URL https://openreview.net/forum?id=192L9cr-8HU.
- E (n) equivariant graph neural networks. In International conference on machine learning, pp. 9323–9332. PMLR, 2021.
- Structure-based Drug Design with Equivariant Diffusion Models, October 2022. URL http://arxiv.org/abs/2210.13695. arXiv:2210.13695 [cs, q-bio].
- Generating focused molecule libraries for drug discovery with recurrent neural networks. ACS central science, 4(1):120–131, 2018.
- Unified generative modeling of 3d molecules via bayesian flow networks. arXiv preprint arXiv:2403.15441, 2024a.
- Equivariant flow matching with hybrid probability transport for 3d molecule generation. Advances in Neural Information Processing Systems, 36, 2024b.
- The application of in silico drug-likeness predictions in pharmaceutical research. Advanced drug delivery reviews, 86:2–10, 2015.
- Autodock vina: Improving the speed and accuracy of docking with a new scoring function, efficient optimization, and multithreading. Journal of Computational Chemistry, 31(2):455–461, 2010. doi: https://doi.org/10.1002/jcc.21334. URL https://onlinelibrary.wiley.com/doi/abs/10.1002/jcc.21334.
- Understanding drug-likeness. Wiley Interdisciplinary Reviews: Computational Molecular Science, 1(5):760–781, 2011.
- Walters, W. P. Virtual chemical libraries. Journal of Medicinal Chemistry, 62(3):1116–1124, 2019. doi: 10.1021/acs.jmedchem.8b01048. URL https://doi.org/10.1021/acs.jmedchem.8b01048. PMID: 30148631.
- Deep learning approaches for de novo drug design: An overview. Current Opinion in Structural Biology, 72:135–144, February 2022. ISSN 0959440X. doi: 10.1016/j.sbi.2021.10.001. URL https://linkinghub.elsevier.com/retrieve/pii/S0959440X21001433.
- Geodiff: A geometric diffusion model for molecular conformation generation. In International Conference on Learning Representations, 2021.
- Learning Subpocket Prototypes for Generalizable Structure-based Drug Design, May 2023. URL http://arxiv.org/abs/2305.13997. arXiv:2305.13997 [cs, q-bio].
- Molecule Generation For Target Protein Binding with Structural Motifs. 2023.
Sponsor
Paper Prompts
Sign up for free to create and run prompts on this paper using GPT-5.
Top Community Prompts
Collections
Sign up for free to add this paper to one or more collections.