CRISPR: Ensemble Model (2403.03018v1)
Abstract: Clustered Regularly Interspaced Short Palindromic Repeats (CRISPR) is a gene editing technology that has revolutionized the fields of biology and medicine. However, one of the challenges of using CRISPR is predicting the on-target efficacy and off-target sensitivity of single-guide RNAs (sgRNAs). This is because most existing methods are trained on separate datasets with different genes and cells, which limits their generalizability. In this paper, we propose a novel ensemble learning method for sgRNA design that is accurate and generalizable. Our method combines the predictions of multiple machine learning models to produce a single, more robust prediction. This approach allows us to learn from a wider range of data, which improves the generalizability of our model. We evaluated our method on a benchmark dataset of sgRNA designs and found that it outperformed existing methods in terms of both accuracy and generalizability. Our results suggest that our method can be used to design sgRNAs with high sensitivity and specificity, even for new genes or cells. This could have important implications for the clinical use of CRISPR, as it would allow researchers to design more effective and safer treatments for a variety of diseases.
- Deepcrispr: optimized crispr guide rna design by deep learning. Genome biology, 19:1–18, 2018.
- Cas-offinder: a fast and versatile algorithm that searches for potential off-target sites of cas9 rna-guided endonucleases. Bioinformatics, 30(10):1473–1475, 2014.
- Crispor: intuitive guide selection for crispr/cas9 genome editing experiments and screens. Nucleic acids research, 46(W1):W242–W245, 2018.
- Chopchop v3: expanding the crispr web toolbox beyond genome editing. Nucleic acids research, 47(W1):W171–W174, 2019.
- Cctop: an intuitive, flexible and reliable crispr/cas9 target prediction tool. PloS one, 10(4):e0124633, 2015.
- “off-spotter”: very fast and exhaustive enumeration of genomic lookalikes for designing crispr/cas guide rnas. Biology direct, 10:1–10, 2015.
- Optimized sgrna design to maximize activity and minimize off-target effects of crispr-cas9. Nature biotechnology, 34(2):184–191, 2016.
- Sequence determinants of improved crispr sgrna design. Genome research, 25(8):1147–1157, 2015.
- Crisprscan: designing highly efficient sgrnas for crispr-cas9 targeting in vivo. Nature methods, 12(10):982–988, 2015.
- David H Wolpert. Stacked generalization. Neural networks, 5(2):241–259, 1992.
- Optimized crispr guide rna design for two high-fidelity cas9 variants by deep learning. Nature communications, 10(1):4284, 2019.
- Prediction of the sequence-specific cleavage activity of cas9 variants. Nature Biotechnology, 38(11):1328–1336, 2020.