Protein Secondary Structure Prediction with Long Short Term Memory Networks (1412.7828v2)

Published 25 Dec 2014 in q-bio.QM, cs.LG, and cs.NE

Abstract: Prediction of protein secondary structure from the amino acid sequence is a classical bioinformatics problem. Common methods use feed forward neural networks or SVMs combined with a sliding window, as these models does not naturally handle sequential data. Recurrent neural networks are an generalization of the feed forward neural network that naturally handle sequential data. We use a bidirectional recurrent neural network with long short term memory cells for prediction of secondary structure and evaluate using the CB513 dataset. On the secondary structure 8-class problem we report better performance (0.674) than state of the art (0.664). Our model includes feed forward networks between the long short term memory cells, a path that can be further explored.

Citations (113)

View on Semantic Scholar

Summary

The paper demonstrates that bidirectional LSTM networks capture long-range dependencies, significantly improving protein secondary structure prediction.
It achieves 67.4% accuracy on the CB513 dataset, outperforming models like GSNs, CNFs, and the SSpro8 system.
The novel integration of feed-forward layers within LSTMs offers practical insights for advancing bioinformatics applications in drug discovery and protein engineering.

Protein Secondary Structure Prediction with Long Short Term Memory Networks

The paper discusses the application of Long Short Term Memory (LSTM) networks, a subtype of Recurrent Neural Networks (RNNs), for predicting protein secondary structures based on amino acid sequences. This task is integral to bioinformatics as it provides insights into protein folding and function, essential for understanding biochemical processes and drug design.

Background and Methodology

Traditional approaches to protein secondary structure prediction have utilized feed-forward neural networks and Support Vector Machines (SVMs) with sliding windows. These models, however, struggle with capturing dependencies across non-adjacent amino acids due to their inherent design constraints. RNNs naturally accommodate sequential data, but traditional RNNs face challenges, such as vanishing gradients, which hinder their ability to learn long-term dependencies. The introduction of LSTM cells is crucial as they effectively mitigate vanishing gradient problems, allowing for the learning of dependencies over hundreds of time steps.

The featured paper enhances predictive capability by employing bidirectional LSTM networks. This model processes the sequence in both forward and backward directions, thereby maximizing contextual information for predictions. Moreover, the paper introduces a novel architecture that integrates feed-forward neural networks within bidirectional LSTMs, facilitating stronger predictions through concatenation and additional layers between recurrent connections.

Results and Significance

Evaluated against the CB513 dataset, the bidirectional LSTM models demonstrated significant performance improvements, achieving an accuracy of 67.4% in the challenging 8-class secondary structure classification task. This result surpasses previous benchmarks, notably exceeding the performance of Generative Stochastic Networks (GSNs) and Conditional Neural Fields (CNFs), along with outperforming the SSpro8 system, which relied on Bidirectional RNNs (Q8 accuracy: 0.511).

Theoretical and Practical Implications

The successful application of LSTMs in this domain marks a significant advancement in computational biology. The ability of LSTM networks to model long-range dependencies provides a robust tool for capturing complex relationships inherent in biological sequences. Practically, this improvement can enhance model applications in drug discovery, enzyme design, and synthetic biology by providing more accurate structural predictions.

Future Prospects

The paper opens avenues for further exploration into more sophisticated architectures involving RNNs and feed-forward networks. There is potential in experimenting with different configurations of LSTM units and integrating more comprehensive evolutionary information as inputs to these networks. Moreover, continuing advancements in GPU technology and deep learning frameworks like Theano and Lasagne are likely to facilitate even more complex models, further pushing the boundaries of protein structural prediction.

In conclusion, the research presents a notable step forward in using neural networks for bioinformatics tasks, emphasizing the strengths of LSTM networks in sequence data modeling and setting a new bar for accuracy in the domain of protein secondary structure prediction.

PDF Markdown

Related Papers

YouTube

Show All Videos