Generation of 3D Molecules in Pockets via Language Model (2305.10133v3)
Abstract: Generative models for molecules based on sequential line notation (e.g. SMILES) or graph representation have attracted an increasing interest in the field of structure-based drug design, but they struggle to capture important 3D spatial interactions and often produce undesirable molecular structures. To address these challenges, we introduce Lingo3DMol, a pocket-based 3D molecule generation method that combines LLMs and geometric deep learning technology. A new molecular representation, fragment-based SMILES with local and global coordinates, was developed to assist the model in learning molecular topologies and atomic spatial positions. Additionally, we trained a separate noncovalent interaction predictor to provide essential binding pattern information for the generative model. Lingo3DMol can efficiently traverse drug-like chemical spaces, preventing the formation of unusual structures. The Directory of Useful Decoys-Enhanced (DUD-E) dataset was used for evaluation. Lingo3DMol outperformed state-of-the-art methods in terms of drug-likeness, synthetic accessibility, pocket binding mode, and molecule generation speed.
- Wei Feng (208 papers)
- Lvwei Wang (1 paper)
- Zaiyun Lin (2 papers)
- Yanhao Zhu (3 papers)
- Han Wang (418 papers)
- Jianqiang Dong (1 paper)
- Rong Bai (2 papers)
- Huting Wang (1 paper)
- Jielong Zhou (2 papers)
- Wei Peng (164 papers)
- Bo Huang (66 papers)
- Wenbiao Zhou (2 papers)