2000 character limit reached
Towards Interpreting Zoonotic Potential of Betacoronavirus Sequences With Attention (2108.08077v1)
Published 18 Aug 2021 in q-bio.QM and cs.LG
Abstract: Current methods for viral discovery target evolutionarily conserved proteins that accurately identify virus families but remain unable to distinguish the zoonotic potential of newly discovered viruses. Here, we apply an attention-enhanced long-short-term memory (LSTM) deep neural net classifier to a highly conserved viral protein target to predict zoonotic potential across betacoronaviruses. The classifier performs with a 94% accuracy. Analysis and visualization of attention at the sequence and structure-level features indicate possible association between important protein-protein interactions governing viral replication in zoonotic betacoronaviruses and zoonotic transmission.
- Kahini Wadhawan (11 papers)
- Payel Das (104 papers)
- Barbara A. Han (3 papers)
- Ilya R. Fischhoff (1 paper)
- Adrian C. Castellanos (1 paper)
- Arvind Varsani (1 paper)
- Kush R. Varshney (121 papers)