SMILES-Mamba: Chemical Mamba Foundation Models for Drug ADMET Prediction (2408.05696v1)

Published 11 Aug 2024 in cs.LG and q-bio.QM

Abstract: In drug discovery, predicting the absorption, distribution, metabolism, excretion, and toxicity (ADMET) properties of small-molecule drugs is critical for ensuring safety and efficacy. However, the process of accurately predicting these properties is often resource-intensive and requires extensive experimental data. To address this challenge, we propose SMILES-Mamba, a two-stage model that leverages both unlabeled and labeled data through a combination of self-supervised pretraining and fine-tuning strategies. The model first pre-trains on a large corpus of unlabeled SMILES strings to capture the underlying chemical structure and relationships, before being fine-tuned on smaller, labeled datasets specific to ADMET tasks. Our results demonstrate that SMILES-Mamba exhibits competitive performance across 22 ADMET datasets, achieving the highest score in 14 tasks, highlighting the potential of self-supervised learning in improving molecular property prediction. This approach not only enhances prediction accuracy but also reduces the dependence on large, labeled datasets, offering a promising direction for future research in drug discovery.

Authors (8)

Bohao Xu (3 papers)
Yingzhou Lu (15 papers)
Chenhao Li (27 papers)
Ling Yue (13 papers)
Xiao Wang (507 papers)
Nan Hao (3 papers)
Tianfan Fu (53 papers)
Jim Chen (3 papers)

Citations (8)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Tweets

https://twitter.com/AllThingsApx/status/1929900645873791440

SMILES-Mamba: Chemical Mamba Foundation Models for Drug ADMET Prediction (2408.05696v1)

Summary

Related Papers

Tweets