Effective Protein-Protein Interaction Exploration with PPIretrieval (2402.03675v1)
Abstract: Protein-protein interactions (PPIs) are crucial in regulating numerous cellular functions, including signal transduction, transportation, and immune defense. As the accuracy of multi-chain protein complex structure prediction improves, the challenge has shifted towards effectively navigating the vast complex universe to identify potential PPIs. Herein, we propose PPIretrieval, the first deep learning-based model for protein-protein interaction exploration, which leverages existing PPI data to effectively search for potential PPIs in an embedding space, capturing rich geometric and chemical information of protein surfaces. When provided with an unseen query protein with its associated binding site, PPIretrieval effectively identifies a potential binding partner along with its corresponding binding site in an embedding space, facilitating the formation of protein-protein complexes.
- Protein-protein interaction (ppi) network: recent advances in drug discovery. Current drug metabolism, 18(1):5–10, 2017.
- Dockq: a quality measure for protein-protein docking models. PloS one, 11(8):e0161879, 2016.
- Efficient curvature estimation for oriented point clouds. stat, 1050:26, 2019.
- Cuturi, M. Sinkhorn distances: Lightspeed computation of optimal transport. Advances in neural information processing systems, 26, 2013.
- Protein–protein interactions essentials: key concepts to building and analyzing interactome networks. PLoS computational biology, 6(6):e1000807, 2010.
- Fry, D. C. Protein–protein interactions as targets for small molecule drug discovery. Peptide Science: Original Research on Biomolecules, 84(6):535–552, 2006.
- Deciphering interaction fingerprints from protein molecular surfaces using geometric deep learning. Nature Methods, 17(2):184–192, 2020.
- Hierarchical graph learning for protein–protein interaction. Nature Communications, 14(1):1093, 2023.
- Structure-based protein function prediction using graph convolutional networks. Nature communications, 12(1):3168, 2021.
- High-order pooling for graph neural networks with tensor decomposition. Advances in Neural Information Processing Systems, 35:6021–6033, 2022.
- Mudiff: Unified diffusion for complete molecule generation. arXiv preprint arXiv:2304.14621, 2023.
- Structure-based drug design with geometric deep learning. Current Opinion in Structural Biology, 79:102548, 2023.
- Robust 3d shape correspondence in the spectral domain. In IEEE International Conference on Shape Modeling and Applications 2006 (SMI’06), pp. 19–19. IEEE, 2006.
- Lévy, B. Laplace-beltrami eigenfunctions towards an algorithm that” understands” geometry. In IEEE International Conference on Shape Modeling and Applications 2006 (SMI’06), pp. 13–13. IEEE, 2006.
- Jigsaw: Learning to assemble multiple fractured objects. arXiv preprint arXiv:2305.17975, 2023.
- Complete the missing half: Augmenting aggregation filtering with diversification for graph convolutional networks. arXiv preprint arXiv:2008.08844, 2020.
- Is heterophily a real nightmare for graph neural networks to do node classification? arXiv preprint arXiv:2109.05641, 2021.
- Revisiting heterophily for graph neural networks. Advances in neural information processing systems, 35:1362–1375, 2022.
- Dips-plus: The enhanced database of interacting protein structures for interface prediction. Scientific Data, 10(1):509, 2023.
- A lock-and-key model for protein–protein interactions. Bioinformatics, 22(16):2012–2019, 2006.
- Deeprank-gnn: a graph neural network framework to learn patterns in protein–protein interfaces. Bioinformatics, 39(1):btac759, 2023.
- E (n) equivariant graph neural networks. In International conference on machine learning, pp. 9323–9332. PMLR, 2021.
- Small molecules, big targets: drug discovery faces the protein–protein interaction challenge. Nature Reviews Drug Discovery, 15(8):533–550, 2016.
- A concise and provably informative multi-scale signature based on heat diffusion. In Computer graphics forum, number 5, pp. 1383–1392. Wiley Online Library, 2009.
- Sequence-based prediction of protein protein interaction using a deep-learning algorithm. BMC bioinformatics, 18:1–8, 2017.
- Fast end-to-end learning on protein surfaces. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 15272–15281, 2021.
- Scannet: an interpretable geometric deep learning model for structure-based protein binding site prediction. Nature Methods, 19(6):730–739, 2022.
- Fast and accurate protein structure search with foldseek. Nature Biotechnology, pp. 1–4, 2023.
- Learning harmonic molecular representations on riemannian manifold. arXiv preprint arXiv:2303.15520, 2023.
- Whitford, D. Proteins: structure and function. John Wiley & Sons, 2013.
- Pointcontrast: Unsupervised pre-training for 3d point cloud understanding. In Computer Vision–ECCV 2020: 16th European Conference, Glasgow, UK, August 23–28, 2020, Proceedings, Part III 16, pp. 574–591. Springer, 2020.
- Tm-align: a protein structure alignment algorithm based on the tm-score. Nucleic acids research, 33(7):2302–2309, 2005.
- Protein representation learning by geometric structure pretraining. arXiv preprint arXiv:2203.06125, 2022.
- Chenqing Hua (18 papers)
- Connor Coley (6 papers)
- Guy Wolf (79 papers)
- Doina Precup (206 papers)
- Shuangjia Zheng (21 papers)