Learning Multimodal Graph-to-Graph Translation for Molecular Optimization (1812.01070v3)

Published 3 Dec 2018 in cs.LG, cs.AI, cs.NE, and stat.ML

Abstract: We view molecular optimization as a graph-to-graph translation problem. The goal is to learn to map from one molecular graph to another with better properties based on an available corpus of paired molecules. Since molecules can be optimized in different ways, there are multiple viable translations for each input graph. A key challenge is therefore to model diverse translation outputs. Our primary contributions include a junction tree encoder-decoder for learning diverse graph translations along with a novel adversarial training method for aligning distributions of molecules. Diverse output distributions in our model are explicitly realized by low-dimensional latent vectors that modulate the translation process. We evaluate our model on multiple molecular optimization tasks and show that our model outperforms previous state-of-the-art baselines.

Citations (215)

View on Semantic Scholar

Summary

The paper introduces a novel junction tree encoder-decoder architecture with stochastic latent codes to enable diverse molecular translations.
The paper employs adversarial training to align generated molecular distributions with real data, ensuring valid and applicable outputs.
The paper demonstrates superior performance over state-of-the-art models on benchmarks for drug-likeness, biological activity, and logP optimization.

Multimodal Graph-to-Graph Translation for Molecular Optimization

The paper presented in "Learning Multimodal Graph-to-Graph Translation for Molecular Optimization" explores the complex problem of molecular optimization through a novel graph-to-graph translation approach. This method is rooted in the conception that molecular structures can be represented as graphs, where optimization translates to converting one molecular graph into another with superior attributes. This paper's main contribution is the development of a junction tree encoder-decoder alongside an adversarial training method to support this translation task effectively.

Key Contributions

The core challenge addressed here is the multimodal nature of molecular optimization — one molecular structure can be optimized in multiple ways, necessitating a model that can capture diverse translation possibilities. The researchers introduce several innovative techniques to tackle this:

Junction Tree Encoder-Decoder: This architecture encodes molecular graphs into compact tree-structured components, allowing for efficient and flexible graph-to-graph mappings. The model uses neural attention mechanisms to decode these components back into optimized molecular graphs.
Stochastic Latent Codes: To embed this diversity directly into the translation process, the model integrates low-dimensional latent vectors. These vectors facilitate diverse molecular translations without overcomplicating the model or processing time.
Adversarial Training: The adversarial component ensures the reliability of molecular translations by aligning generated molecular distributions with observed ones. This alignment is crucial to maintain the validity and applicability of the model outputs.

Experimental Evaluation

The model was tested across three different molecular optimization tasks: enhancing drug-likeness (QED), improving biological activity for the dopamine receptor (DRD2), and increasing penalized logP scores. Across all benchmarks, the proposed model outmatches state-of-the-art alternatives like Junction Tree VAE and variational sequence-to-sequence models. Particularly notable is its ability to discover molecules with both desired properties and high output diversity, factors critical in drug discovery applications.

The model's performance, quantified by improvements and diversity, indicates higher efficacy in translating one molecular configuration into multiple desirable formats. The ability to generalize well and consistently generate novel molecules not seen in training datasets underscores the practical value of this methodology.

Implications and Future Directions

The implications of this research are substantial in the realms of drug discovery and chemical synthesis. By effectively capturing the multimodal nature of chemical optimization tasks, this model presents a promising tool that pharmaceutical researchers can use to explore the vast chemical space more thoroughly. The application of graph neural networks to encode complex molecular transformations could significantly aid the efficiency and effectiveness of discovering new therapeutic agents.

For future developments, the integration of more refined adversarial techniques might reduce the dependency on hand-crafted transformations. Additionally, expanding this framework to capture even broader chemical properties could further enhance its applicability. Exploring the use of this model in conjunction with reinforcement learning strategies could also unlock new avenues for automated drug design and optimization.

In conclusion, this paper offers an advanced and nuanced approach to the graph-to-graph translation problem in molecular optimization, promising not just improvements in performance metrics, but also significant contributions toward the practical viability of AI-driven chemical synthesis tools.

PDF Markdown