Protein-Ligand Scoring with Convolutional Neural Networks (1612.02751v1)

Published 8 Dec 2016 in stat.ML, cs.LG, and q-bio.BM

Abstract: Computational approaches to drug discovery can reduce the time and cost associated with experimental assays and enable the screening of novel chemotypes. Structure-based drug design methods rely on scoring functions to rank and predict binding affinities and poses. The ever-expanding amount of protein-ligand binding and structural data enables the use of deep machine learning techniques for protein-ligand scoring. We describe convolutional neural network (CNN) scoring functions that take as input a comprehensive 3D representation of a protein-ligand interaction. A CNN scoring function automatically learns the key features of protein-ligand interactions that correlate with binding. We train and optimize our CNN scoring functions to discriminate between correct and incorrect binding poses and known binders and non-binders. We find that our CNN scoring function outperforms the AutoDock Vina scoring function when ranking poses both for pose prediction and virtual screening.

Citations (221)

View on Semantic Scholar

Summary

The paper introduces a novel CNN architecture that transforms 3D protein-ligand grids to learn interaction features, outperforming traditional models.
The CNN models demonstrated superior inter-target pose prediction and virtual screening performance compared to AutoDock Vina on benchmarks like DUD-E.
Data augmentation and visualization techniques enhance model generalizability and interpretability, revealing critical biochemical interactions aligned with experimental results.

Analyzing CNN-Based Protein-Ligand Scoring Functions

The paper "Protein-Ligand Scoring with Convolutional Neural Networks" by Ragoza et al. presents an in-depth exploration of employing convolutional neural networks (CNNs) in the evaluation of protein-ligand interactions. Traditional methods in structure-based drug design, such as empirical or knowledge-based scoring functions, parameterize physical interactions into predetermined models. However, these methods often face limitations in flexibility and expressiveness. In contrast, machine learning-based scoring functions hold the potential to learn complex interactions and structures directly from labeled data, although at the expense of interpretability and with a risk of overfitting.

Methodological Insights and Optimization

The authors introduce CNN models that process comprehensive 3D representations of protein-ligand interactions. These models are notably independent of pre-extracted features that can limit expressiveness in traditional models. In order to create effective CNN scoring functions, they optimize the network's architecture through a detailed examination of various parameters such as atom types, occupancy types, grid resolution, network depth, layer width, and pooling mechanisms. A significant part of their methodology is the transformation of ligand-protein structural data into 3D grids, which provides a rich input format that both represents atomic types and maintains spatial resolution. Another important optimization step involves enhancing the models to prevent overfitting, through data augmentation techniques like random rotation and translation.

Results and Performance Evaluation

The CNN models are tested across different tasks: pose prediction, virtual screening, and affinity prediction. CNN-based scoring functions outperform AutoDock Vina in inter-target pose prediction, providing superior ROC AUC measurements. In virtual screening, CNN models also show enhanced performance on the DUD-E benchmark, indicating efficacy in distinguishing active compounds from inactives. However, it was noted that intra-target pose ranking, a task closer to typical drug-design scenarios, did not exhibit as strong performance as inter-target ranking.

For practical applications, the model combining CSAR-based pose prediction and DUD-E virtual screening training data increased the general applicability across both pose prediction and virtual screening tasks, illustrating successful multi-task learning in deep networks despite occasionally lesser per-target performance compared to models trained on isolated datasets. The research includes insights into the model's behavior with independent test sets, such as the challenging MUV dataset, confirming the models' robustness.

Visualization and Interpretability

Utilizing visualization algorithms based on masking techniques, the authors provide modest insights into model interpretability. By evaluating protein-ligand interactions and their influence on binding predictions, they bridge some of the interpretability gaps inherent to neural networks. Interestingly, critical residues determined by the model align with experimental findings, suggesting that the model captures relevant biochemical interactions.

Implications and Future Directions

The paper exemplifies CNN's potential in modeling sophisticated biochemical interactions beyond the reach of traditional scoring functions. While its approach is promising, several areas remain for future exploration: enhancing intra-target ranking, incorporating affinity data in training, and addressing overfitting risks with more rigorous validation and regularization strategies. Moreover, integrating different types of biochemical data might yield a multi-objective CNN that effectively navigates the intricate landscape of structure-based drug discovery.

The paper demonstrates that CNN scoring functions could redefine protein-ligand interaction assessment in computational drug discovery, offering adaptive and potentially more accurate insights. This advancement may inspire further developments in applying deep learning architectures to a range of complex biochemical problems, potentially impacting AI applications in biological and pharmaceutical domains on a broader scale.

PDF Markdown