Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
110 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

MolecularRNN: Generating realistic molecular graphs with optimized properties (1905.13372v1)

Published 31 May 2019 in cs.LG, cs.AI, q-bio.MN, q-bio.QM, and stat.ML

Abstract: Designing new molecules with a set of predefined properties is a core problem in modern drug discovery and development. There is a growing need for de-novo design methods that would address this problem. We present MolecularRNN, the graph recurrent generative model for molecular structures. Our model generates diverse realistic molecular graphs after likelihood pretraining on a big database of molecules. We perform an analysis of our pretrained models on large-scale generated datasets of 1 million samples. Further, the model is tuned with policy gradient algorithm, provided a critic that estimates the reward for the property of interest. We show a significant distribution shift to the desired range for lipophilicity, drug-likeness, and melting point outperforming state-of-the-art works. With the use of rejection sampling based on valency constraints, our model yields 100% validity. Moreover, we show that invalid molecules provide a rich signal to the model through the use of structure penalty in our reinforcement learning pipeline.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (4)
  1. Mariya Popova (4 papers)
  2. Mykhailo Shvets (5 papers)
  3. Junier Oliva (18 papers)
  4. Olexandr Isayev (20 papers)
Citations (153)

Summary

We haven't generated a summary for this paper yet.