Rethinking Specificity in SBDD: Leveraging Delta Score and Energy-Guided Diffusion (2403.12987v1)
Abstract: In the field of Structure-based Drug Design (SBDD), deep learning-based generative models have achieved outstanding performance in terms of docking score. However, further study shows that the existing molecular generative methods and docking scores both have lacked consideration in terms of specificity, which means that generated molecules bind to almost every protein pocket with high affinity. To address this, we introduce the Delta Score, a new metric for evaluating the specificity of molecular binding. To further incorporate this insight for generation, we develop an innovative energy-guided approach using contrastive learning, with active compounds as decoys, to direct generative models toward creating molecules with high specificity. Our empirical results show that this method not only enhances the delta score but also maintains or improves traditional docking scores, successfully bridging the gap between SBDD and real-world needs.
- Equivariant energy-guided sde for inverse molecular design. arXiv preprint arXiv:2209.15408, 2022.
- Posebusters: Ai-based docking methods fail to generate physically valid poses or generalise to novel sequences. Chemical Science, 2024.
- Diffusion models beat gans on image synthesis. Advances in neural information processing systems, 34:8780–8794, 2021.
- Empirical scoring functions: I. the development of a fast empirical scoring function to estimate the binding affinity of ligands in receptor complexes. Journal of computer-aided molecular design, 11:425–445, 1997.
- Molecular docking and structure-based drug design strategies. Molecules, 20(7):13384–13421, 2015.
- Three-dimensional convolutional neural networks and a cross-docked data set for structure-based drug design. Journal of chemical information and modeling, 60(9):4200–4215, 2020.
- Glide: a new approach for rapid, accurate docking and scoring. 1. method and assessment of docking accuracy. Journal of medicinal chemistry, 47(7):1739–1749, 2004.
- DrugCLIP: Contrasive protein-molecule representation learning for virtual screening. In Thirty-seventh Conference on Neural Information Processing Systems, 2023. URL https://openreview.net/forum?id=lAbCgNcxm7.
- e3nn: Euclidean neural networks, 2022.
- 3d equivariant diffusion for target-aware molecule generation and affinity prediction. In International Conference on Learning Representations, 2023a.
- Decompdiff: Diffusion models with decomposed priors for structure-based drug design. 2023b.
- Benchmarking generated poses: How rational is structure-based drug design with generative models? arXiv preprint arXiv:2308.07413, 2023.
- Harrison, R. K. Phase ii and phase iii failures: 2013–2015. Nat Rev Drug Discov, 15(12):817–818, 2016.
- Equivariant diffusion for molecule generation in 3d. In International conference on machine learning, pp. 8867–8887. PMLR, 2022.
- Docking and scoring in virtual screening for drug discovery: methods and applications. Nature reviews Drug discovery, 3(11):935–949, 2004.
- Off-target toxicity is a common mechanism of action of cancer drugs undergoing clinical trials. sci. transl. med. 11: eaaw8412, 2019.
- Zero-shot 3d drug design by sketching and generating. In NeurIPS, 2022.
- A 3d generative model for structure-based drug design. In Ranzato, M., Beygelzimer, A., Dauphin, Y., Liang, P., and Vaughan, J. W. (eds.), Advances in Neural Information Processing Systems, volume 34, pp. 6229–6239. Curran Associates, Inc., 2021. URL https://proceedings.neurips.cc/paper_files/paper/2021/file/314450613369e0ee72d0da7f6fee773c-Paper.pdf.
- Modeling the expansion of virtual screening libraries. Nature Chemical Biology, pp. 1–7, 2023.
- Generating 3d molecular structures conditional on a receptor binding site with deep generative models, 2020.
- Rapid context-dependent ligand desolvation in molecular docking. Journal of chemical information and modeling, 50(9):1561–1573, 2010.
- Pocket2mol: Efficient molecular sampling based on 3d protein pockets. In International Conference on Machine Learning, 2022.
- Vinardo: A scoring function based on autodock vina improves scoring, docking, and virtual screening. PloS one, 11(5):e0155183, 2016.
- Learning transferable visual models from natural language supervision. In International Conference on Machine Learning, 2021. URL https://api.semanticscholar.org/CorpusID:231591445.
- E (n) equivariant graph neural networks. In International conference on machine learning, pp. 9323–9332. PMLR, 2021.
- De novo design at the edge of chaos: Miniperspective. Journal of medicinal chemistry, 59(9):4077–4086, 2016.
- Autodock vina: improving the speed and accuracy of docking with a new scoring function, efficient optimization, and multithreading. Journal of computational chemistry, 31(2):455–461, 2010.
- Improved protein–ligand docking using gold. Proteins: Structure, Function, and Bioinformatics, 52(4):609–623, 2003.
- Further development and validation of empirical scoring functions for structure-based binding affinity prediction. Journal of computer-aided molecular design, 16:11–26, 2002.
- Estimation of clinical trial success rates and related parameters. Biostatistics, 20(2):273–286, 2019.
- Planet: A multi-objective graph neural network model for protein–ligand binding affinity prediction. Journal of Chemical Information and Modeling, 2023.
- Learning subpocket prototypes for generalizable structure-based drug design. ICML, 2023.
- Egsde: Unpaired image-to-image translation via energy-guided stochastic differential equations. Advances in Neural Information Processing Systems, 35:3609–3623, 2022.
- Improving protein–ligand docking and screening accuracies by incorporating a scoring function correction term. Briefings in Bioinformatics, 23(3):bbac051, 2022.
- Bowen Gao (14 papers)
- Minsi Ren (4 papers)
- Yuyan Ni (14 papers)
- Yanwen Huang (12 papers)
- Bo Qiang (10 papers)
- Zhi-Ming Ma (56 papers)
- Wei-Ying Ma (39 papers)
- Yanyan Lan (87 papers)