3M-Diffusion: Latent Multi-Modal Diffusion for Language-Guided Molecular Structure Generation (2403.07179v2)
Abstract: Generating molecular structures with desired properties is a critical task with broad applications in drug discovery and materials design. We propose 3M-Diffusion, a novel multi-modal molecular graph generation method, to generate diverse, ideally novel molecular structures with desired properties. 3M-Diffusion encodes molecular graphs into a graph latent space which it then aligns with the text space learned by encoder-based LLMs from textual descriptions. It then reconstructs the molecular structure and atomic attributes based on the given text descriptions using the molecule decoder. It then learns a probabilistic mapping from the text space to the latent molecular graph space using a diffusion model. The results of our extensive experiments on several datasets demonstrate that 3M-Diffusion can generate high-quality, novel and diverse molecular graphs that semantically match the textual description provided.
- Scibert: A pretrained language model for scientific text. arXiv preprint arXiv:1903.10676, 2019.
- Quantifying the chemical beauty of drugs. Nature chemistry, 4(2):90–98, 2012.
- Molecular generation with recurrent neural networks (rnns). arXiv preprint arXiv:1705.04612, 2017.
- 970 million druglike small molecules for virtual screening in the chemical universe database gdb-13. Journal of the American Chemical Society, 131(25):8732–8733, 2009.
- Guacamol: benchmarking models for de novo molecular design. Journal of chemical information and modeling, 59(3):1096–1108, 2019.
- Molecule optimization by explainable evolution. In International conference on learning representation (ICLR), 2021.
- Unifying molecular and textual representations via multi-task language modelling. arXiv preprint arXiv:2301.12586, 2023.
- Improving graph generation by restricting graph bandwidth. In International Conference on Machine Learning, pp. 7939–7959. PMLR, 2023.
- Molgensurvey: A systematic survey in machine learning models for molecule design. arXiv preprint arXiv:2203.14500, 2022.
- Reoptimization of mdl keys for use in drug discovery. Journal of chemical information and computer sciences, 42(6):1273–1280, 2002.
- Text2mol: Cross-modal molecule retrieval with natural language queries. In Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, pp. 595–607, 2021.
- Translation between molecules and natural language. arXiv preprint arXiv:2204.11817, 2022.
- Mol-instructions: A large-scale biomolecular instruction dataset for large language models. arXiv preprint arXiv:2306.08018, 2023a.
- Molecular language model as multi-task generator. arXiv preprint arXiv:2301.11259, 2023b.
- Language models can learn complex molecular distributions. Nature Communications, 13(1):3293, 2022.
- Differentiable scaffolding tree for molecular optimization. arXiv preprint arXiv:2109.10469, 2021.
- Automatic chemical design using a data-driven continuous representation of molecules. ACS central science, 4(2):268–276, 2018.
- Graphgen: A scalable approach to domain-agnostic labeled graph generation. In Proceedings of The Web Conference 2020, pp. 1253–1263, 2020.
- Grisoni, F. Chemical language models for de novo drug design: Challenges and opportunities. Current Opinion in Structural Biology, 79:102527, 2023.
- Graphite: Iterative generative modeling of graphs. In International conference on machine learning, pp. 2434–2444. PMLR, 2019.
- Dipol-gan: Generating molecular graphs adversarially with relational differentiable pooling. Under review, 2017.
- A decade of fragment-based drug design: strategic advances and lessons learned. Nature reviews Drug discovery, 6(3):211–219, 2007.
- Chebi in 2016: Improved services and an expanding collection of metabolites. Nucleic acids research, 44(D1):D1214–D1219, 2016.
- Classifier-free diffusion guidance. arXiv preprint arXiv:2207.12598, 2022.
- Denoising diffusion probabilistic models. Advances in neural information processing systems, 33:6840–6851, 2020.
- Strategies for pre-training graph neural networks. arXiv preprint arXiv:1905.12265, 2019.
- Zinc: a free tool to discover chemistry for biology. Journal of chemical information and modeling, 52(7):1757–1768, 2012.
- Junction tree variational autoencoder for molecular graph generation. In International conference on machine learning, pp. 2323–2332. PMLR, 2018.
- Hierarchical generation of molecular graphs using structural motifs. In International conference on machine learning, pp. 4839–4848. PMLR, 2020.
- Score-based generative modeling of graphs via the system of stochastic differential equations. In International Conference on Machine Learning, pp. 10362–10383. PMLR, 2022.
- Pubchem substance and compound databases. Nucleic acids research, 44(D1):D1202–D1213, 2016.
- Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980, 2014.
- Auto-encoding variational bayes. arXiv preprint arXiv:1312.6114, 2013.
- Molecule generation by principal subgraph mining and assembling. Advances in Neural Information Processing Systems, 35:2550–2563, 2022.
- Selfies and the future of molecular string representations. Patterns, 3(10), 2022.
- Grammar variational autoencoder. In International conference on machine learning, pp. 1945–1954. PMLR, 2017.
- Learning deep generative models of graphs. arXiv preprint arXiv:1803.03324, 2018.
- Constrained graph variational autoencoders for molecule design. Advances in neural information processing systems, 31, 2018.
- Molca: Molecular graph-language modeling with cross-modal projector and uni-modal adapter. arXiv preprint arXiv:2310.12798, 2023.
- S2orc: The semantic scholar open research corpus. arXiv preprint arXiv:1911.02782, 2019.
- Graphdf: A discrete flow model for molecular graph generation. In International Conference on Machine Learning, pp. 7192–7203. PMLR, 2021.
- Graphnvp: An invertible flow model for generating molecular graphs. arXiv preprint arXiv:1905.11600, 2019.
- Rational drug design. European journal of pharmacology, pp. 90–100, 2009.
- Mol-cyclegan: a generative model for molecular optimization. Journal of Cheminformatics, 12(1):1–18, 2020.
- Improved denoising diffusion probabilistic models. In International Conference on Machine Learning, pp. 8162–8171. PMLR, 2021.
- Permutation invariant graph generation via score-based generative modeling. In International Conference on Artificial Intelligence and Statistics, pp. 4474–4484. PMLR, 2020.
- OpenAI, R. Gpt-4 technical report. arxiv 2303.08774. View in Article, 2:13, 2023.
- Molecularrnn: Generating realistic molecular graphs with optimized properties. arXiv preprint arXiv:1905.13372, 2019.
- Fréchet chemnet distance: a metric for generative models for molecules in drug discovery. Journal of chemical information and modeling, 58(9):1736–1741, 2018.
- What is high-throughput virtual screening? a perspective from organic materials discovery. Annual Review of Materials Research, 45:195–216, 2015.
- Learning transferable visual models from natural language supervision. In International conference on machine learning, pp. 8748–8763. PMLR, 2021.
- Exploring the limits of transfer learning with a unified text-to-text transformer. The Journal of Machine Learning Research, 21(1):5485–5551, 2020.
- Rascal: Calculation of graph similarity using maximum common edge subgraphs. The Computer Journal, 45(6):631–644, 2002.
- High-resolution image synthesis with latent diffusion models. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp. 10684–10695, 2022.
- Fast and accurate modeling of molecular atomization energies with machine learning. Physical review letters, 108(5):058301, 2012.
- Molecular transformer: a model for uncertainty-calibrated chemical reaction prediction. ACS central science, 5(9):1572–1583, 2019.
- Mapping the space of chemical reactions using attention-based neural networks. Nature machine intelligence, 3(2):144–152, 2021.
- Enhancing activity prediction models in drug discovery with the ability to understand human language. arXiv preprint arXiv:2303.03363, 2023.
- Denoising diffusion implicit models. arXiv preprint arXiv:2010.02502, 2020.
- A molecular multimodal foundation model associating molecule graphs with natural language. arXiv preprint arXiv:2209.05481, 2022.
- Unassisted noise reduction of chemical reaction datasets. Nature Machine Intelligence, pp. 485–494, 2021.
- Automated extraction of chemical synthesis actions from experimental procedures. Nature communications, 11(1):3601, 2020.
- Digress: Discrete denoising diffusion for graph generation. arXiv preprint, 2022.
- Retrieval-based controllable molecule generation. arXiv preprint arXiv:2208.11126, 2022.
- Weininger, D. Smiles, a chemical language and information system. 1. introduction to methodology and encoding rules. Journal of chemical information and computer sciences, 28(1):31–36, 1988.
- A general offline reinforcement learning framework for interactive recommendation. In Proceedings of the AAAI Conference on Artificial Intelligence, volume 35, pp. 4512–4520, 2021.
- Learning how to propagate messages in graph neural networks. In Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery & Data Mining, pp. 1894–1903, 2021.
- Molbind: Multimodal alignment of language, molecules, and proteins. arXiv preprint, 2023.
- Simple and asymmetric graph contrastive learning without augmentations. Advances in Neural Information Processing Systems, 36, 2024.
- A survey on multimodal large language models. arXiv preprint arXiv:2306.13549, 2023.
- Graph convolutional policy network for goal-directed molecular graph generation. Advances in neural information processing systems, 31, 2018.
- Moflow: an invertible flow model for generating molecuclar graphs. In Proceedings of the 26th ACM SIGKDD international conference on knowledge discovery & data mining, pp. 617–626, 2020.
- A deep-learning system bridging molecule structure and biomedical text with comprehension comparable to human professionals. Nature communications, 13(1):862, 2022.
- Gimlet: A unified graph-text model for instruction-based molecule zero-shot learning. bioRxiv, pp. 2023–05, 2023a.
- Michelangelo: Conditional 3d shape generation based on shape-image-text aligned latent representation. arXiv preprint arXiv:2306.17115, 2023b.
- Huaisheng Zhu (13 papers)
- Teng Xiao (40 papers)
- Vasant G Honavar (11 papers)