Evaluating Zero-Shot Scoring for In Vitro Antibody Binding Prediction with Experimental Validation (2312.05273v1)
Abstract: The success of therapeutic antibodies relies on their ability to selectively bind antigens. AI-based antibody design protocols have shown promise in generating epitope-specific designs. Many of these protocols use an inverse folding step to generate diverse sequences given a backbone structure. Due to prohibitive screening costs, it is key to identify candidate sequences likely to bind in vitro. Here, we compare the efficacy of 8 common scoring paradigms based on open-source models to classify antibody designs as binders or non-binders. We evaluate these approaches on a novel surface plasmon resonance (SPR) dataset, spanning 5 antigens. Our results show that existing methods struggle to detect binders, and performance is highly variable across antigens. We find that metrics computed on flexibly docked antibody-antigen complexes are more robust, and ensembles scores are more consistent than individual metrics. We provide experimental insight to analyze current scoring techniques, highlighting that the development of robust, zero-shot filters is an important research gap.
- Origins of specificity and affinity in antibody–protein interactions. Proceedings of the National Academy of Sciences, 111(26):E2656–E2665, 2014.
- Learning inverse folding from millions of predicted structures. pages 8946–8970, 2022.
- Robust deep learning–based protein sequence design using ProteinMPNN. Science, 378(6615):49–56, October 2022. doi: 10.1126/science.add2187. URL https://doi.org/10.1126/science.add2187.
- Computational scoring and experimental evaluation of enzymes generated by neural networks. bioRxiv, pages 2023–03, 2023.
- Protein structure generation via folding diffusion. 2022.
- Unlocking de novo antibody design with generative artificial intelligence. bioRxiv, pages 2023–01, 2023.
- Improving de novo protein binder design with deep learning. Nature Communications, 14(1):2625, 2023.
- Deep learning enables therapeutic antibody optimization in mammalian cells by deciphering high-dimensional protein sequence space. BioRxiv, page 617860, 2019.
- Generative diffusion models for antibody design, docking, and optimization. bioRxiv, 2023. doi: 10.1101/2023.09.25.559190.
- Efficient evolution of human antibodies from general protein language models. Nature Biotechnology, 2023.
- Language models of protein sequences at the scale of evolution enable accurate structure prediction. BioRxiv, 2022:500902, 2022.
- Immunebuilder: Deep-learning models for predicting the structures of immune proteins. Communications Biology, 6(1):575, 2023.
- Wolfgang Kabsch. A solution for the best rotation to relate two sets of vectors. Acta Crystallographica Section A: Crystal Physics, Diffraction, Theoretical and General Crystallography, 32(5):922–923, 1976.
- End-to-end full-atom antibody design. ICML Conference Poster, 2023.
- Sabdab: the structural antibody database. Nucleic acids research, 42(D1):D1140–D1146, 2014.
- Openmm 7: Rapid development of high performance algorithms for molecular dynamics. PLoS computational biology, 13(7):e1005659, 2017.