Geometric-Facilitated Denoising Diffusion Model for 3D Molecule Generation (2401.02683v2)
Abstract: Denoising diffusion models have shown great potential in multiple research areas. Existing diffusion-based generative methods on de novo 3D molecule generation face two major challenges. Since majority heavy atoms in molecules allow connections to multiple atoms through single bonds, solely using pair-wise distance to model molecule geometries is insufficient. Therefore, the first one involves proposing an effective neural network as the denoising kernel that is capable to capture complex multi-body interatomic relationships and learn high-quality features. Due to the discrete nature of graphs, mainstream diffusion-based methods for molecules heavily rely on predefined rules and generate edges in an indirect manner. The second challenge involves accommodating molecule generation to diffusion and accurately predicting the existence of bonds. In our research, we view the iterative way of updating molecule conformations in diffusion process is consistent with molecular dynamics and introduce a novel molecule generation method named Geometric-Facilitated Molecular Diffusion (GFMDiff). For the first challenge, we introduce a Dual-Track Transformer Network (DTN) to fully excevate global spatial relationships and learn high quality representations which contribute to accurate predictions of features and geometries. As for the second challenge, we design Geometric-Facilitated Loss (GFLoss) which intervenes the formation of bonds during the training period, instead of directly embedding edges into the latent space. Comprehensive experiments on current benchmarks demonstrate the superiority of GFMDiff.
- Blended Diffusion for Text-Driven Editing of Natural Images. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 18208–18218.
- GEOM, energy-annotated molecular conformations for property prediction and molecular generation. Scientific Data, 9(1): 185.
- Learning Gradient Fields for Shape Generation. In Vedaldi, A.; Bischof, H.; Brox, T.; and Frahm, J.-M., eds., Computer Vision – ECCV 2020, 364–381. Cham: Springer International Publishing. ISBN 978-3-030-58580-8.
- Mofusion: A Framework for Denoising-Diffusion-Based Motion Synthesis. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 9760–9770.
- NICE: Non-linear Independent Components Estimation. arXiv:1410.8516.
- E(n) Equivariant Normalizing Flows. In Ranzato, M.; Beygelzimer, A.; Dauphin, Y.; Liang, P.; and Vaughan, J. W., eds., Advances in Neural Information Processing Systems, volume 34, 4181–4192. Curran Associates, Inc.
- Symmetry-adapted generation of 3d point sets for the targeted discovery of molecules. In Wallach, H.; Larochelle, H.; Beygelzimer, A.; d'Alché-Buc, F.; Fox, E.; and Garnett, R., eds., Advances in Neural Information Processing Systems, volume 32. Curran Associates, Inc.
- Diffusion-based Molecule Generation with Informative Prior Bridges. In NeurIPS 2022 AI for Science: Progress and Promises.
- Generative Adversarial Nets. In Ghahramani, Z.; Welling, M.; Cortes, C.; Lawrence, N.; and Weinberger, K., eds., Advances in Neural Information Processing Systems, volume 27. Curran Associates, Inc.
- Denoising Diffusion Probabilistic Models. In Larochelle, H.; Ranzato, M.; Hadsell, R.; Balcan, M.; and Lin, H., eds., Advances in Neural Information Processing Systems, volume 33, 6840–6851. Curran Associates, Inc.
- Cascaded Diffusion Models for High Fidelity Image Generation. Journal of Machine Learning Research, 23(47): 1–33.
- Argmax Flows and Multinomial Diffusion: Learning Categorical Distributions. In Ranzato, M.; Beygelzimer, A.; Dauphin, Y.; Liang, P.; and Vaughan, J. W., eds., Advances in Neural Information Processing Systems, volume 34, 12454–12465. Curran Associates, Inc.
- Equivariant Diffusion for Molecule Generation in 3D. In Proceedings of the 39th International Conference on Machine Learning, volume 162 of Proceedings of Machine Learning Research, 8867–8887. PMLR.
- Mdm: Molecular diffusion model for 3d molecule generation. In Proceedings of the AAAI Conference on Artificial Intelligence, volume 37, 5105–5112.
- Variational Diffusion Models. In Ranzato, M.; Beygelzimer, A.; Dauphin, Y.; Liang, P.; and Vaughan, J. W., eds., Advances in Neural Information Processing Systems, volume 34, 21696–21707. Curran Associates, Inc.
- Auto-Encoding Variational Bayes. arXiv:1312.6114.
- NAP: Neural 3D Articulated Object Prior. In Thirty-seventh Conference on Neural Information Processing Systems.
- Score-Based Point Cloud Denoising. In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), 4583–4592.
- Geometry-Complete Diffusion for 3D Molecule Generation. In ICLR 2023 - Machine Learning for Drug Discovery workshop.
- Permutation Invariant Graph Generation via Score-Based Generative Modeling. In Chiappa, S.; and Calandra, R., eds., Proceedings of the Twenty Third International Conference on Artificial Intelligence and Statistics, volume 108 of Proceedings of Machine Learning Research, 4474–4484. PMLR.
- Quantum chemistry structures and properties of 134 kilo molecules. Scientific Data, 1(1): 1–7.
- U-Net: Convolutional Networks for Biomedical Image Segmentation. In Navab, N.; Hornegger, J.; Wells, W. M.; and Frangi, A. F., eds., Medical Image Computing and Computer-Assisted Intervention – MICCAI 2015, 234–241. Cham: Springer International Publishing. ISBN 978-3-319-24574-4.
- Step-unrolled Denoising Autoencoders for Text Generation. In International Conference on Learning Representations.
- HouseDiffusion: Vector Floorplan Generation via a Diffusion Model With Discrete and Continuous Denoising. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 5466–5475.
- Generative Modeling by Estimating Gradients of the Data Distribution. In Wallach, H.; Larochelle, H.; Beygelzimer, A.; d'Alché-Buc, F.; Fox, E.; and Garnett, R., eds., Advances in Neural Information Processing Systems, volume 32. Curran Associates, Inc.
- Score-Based Generative Modeling through Stochastic Differential Equations. In International Conference on Learning Representations.
- DiGress: Discrete Denoising diffusion for graph generation. In International Conference on Learning Representations.
- MiDi: Mixed Graph and 3D Denoising Diffusion for Molecule Generation. In ICLR 2023 - Machine Learning for Drug Discovery workshop.
- Geometric Latent Diffusion Models for 3D Molecule Generation. In Proceedings of the 40th International Conference on Machine Learning, volume 202 of Proceedings of Machine Learning Research, 38592–38610. PMLR.
- Molecular Geometry-aware Transformer for accurate 3D Atomic System modeling. arXiv:2302.00855.
- Can Xu (98 papers)
- Haosen Wang (4 papers)
- Weigang Wang (18 papers)
- Pengfei Zheng (11 papers)
- Hongyang Chen (61 papers)