Personalised Drug Identifier for Cancer Treatment with Transformers using Auxiliary Information (2402.10551v1)
Abstract: Cancer remains a global challenge due to its growing clinical and economic burden. Its uniquely personal manifestation, which makes treatment difficult, has fuelled the quest for personalized treatment strategies. Thus, genomic profiling is increasingly becoming part of clinical diagnostic panels. Effective use of such panels requires accurate drug response prediction (DRP) models, which are challenging to build due to limited labelled patient data. Previous methods to address this problem have used various forms of transfer learning. However, they do not explicitly model the variable length sequential structure of the list of mutations in such diagnostic panels. Further, they do not utilize auxiliary information (like patient survival) for model training. We address these limitations through a novel transformer based method, which surpasses the performance of state-of-the-art DRP models on benchmark data. We also present the design of a treatment recommendation system (TRS), which is currently deployed at the National University Hospital, Singapore and is being evaluated in a clinical trial.
- The consequence of oncomorphic TP53 mutations in ovarian cancer. International journal of molecular sciences 14, 9 (2013), 19257–19275.
- Progression-free survival is a surrogate for survival in advanced colorectal cancer. Journal of clinical oncology 25, 33 (2007), 5218–5224.
- The cBio cancer genomics portal: an open platform for exploring multidimensional cancer genomics data. Cancer discovery 2, 5 (2012), 401–404.
- The GENIE BPC NSCLC Cohort: A Real-World Repository Integrating Standardized Clinical and Genomic Data for 1,846 Patients with Non–Small Cell Lung Cancer. Clinical Cancer Research 29, 17 (2023), 3418–3428.
- ClinicalTrials.gov. 2023. Neural Network-based Treatment Decision Support Tool in Patients With Refractory Solid Organ Malignancies (DRUID). Retrieved February 7, 2024 from https://clinicaltrials.gov/study/NCT05719428
- Artificial intelligence in cancer research and precision medicine: Applications, limitations and priorities to drive transformation in the delivery of equitable and unbiased care. Cancer Treatment Reviews 112 (2023), 102498.
- David R Cox. 1972. Regression models and life-tables. Journal of the Royal Statistical Society: Series B (Methodological) 34, 2 (1972), 187–202.
- Explainable AI (XAI): Core ideas, techniques, and solutions. Comput. Surveys 55, 9 (2023), 1–33.
- Survival analysis with PyTorch. Retrieved February 8, 2024 from https://github.com/havakv/pycox
- Stephane Fotso. 2018. Deep neural networks for survival analysis based on a multi-task framework. arXiv preprint arXiv:1801.05512 (2018).
- Michael F Gensheimer and Balasubramanian Narasimhan. 2019. A scalable discrete-time survival model for neural networks. PeerJ 7 (2019), e6257.
- Next-generation characterization of the cancer cell line encyclopedia. Nature 569, 7757 (2019), 503–508.
- A context-aware deconfounding autoencoder for robust prediction of personalized clinical drug response from cell-line compound screening. Nature Machine Intelligence 4, 10 (2022), 879–892.
- A multi-task domain-adapted model to predict chemotherapy response from mutations in recurrently altered cancer genes. medRxiv (2023), 2023–11.
- Deep generative neural network for accurate drug response imputation. Nature communications 12, 1 (2021), 1740.
- Edward L Kaplan and Paul Meier. 1958. Nonparametric estimation from incomplete observations. Journal of the American statistical association 53, 282 (1958), 457–481.
- DeepSurv: personalized treatment recommender system using a Cox proportional hazards deep neural network. BMC medical research methodology 18, 1 (2018), 1–12.
- The possibility of clinical sequencing in the management of cancer. Japanese Journal of Clinical Oncology 46, 5 (2016), 399–406.
- Håvard Kvamme and Ørnulf Borgan. 2019. Continuous and discrete-time survival prediction with neural networks. arXiv preprint arXiv:1910.06724 (2019).
- ClinVar: improving access to variant interpretations and supporting evidence. Nucleic acids research 46, D1 (2018), D1062–D1067.
- Deephit: A deep learning approach to survival analysis with competing risks. In Proceedings of the AAAI conference on artificial intelligence, Vol. 32.
- Seungyeoun Lee and Heeju Lim. 2019. Review of statistical methods for survival analysis using genomic data. Genomics & informatics 17, 4 (2019).
- A protein-centric approach for exome variant aggregation enables sensitive association analysis with clinical outcomes. Human mutation 41, 5 (2020), 934–945.
- Molecular tumor boards in clinical practice. Trends in Cancer 6, 9 (2020), 738–744.
- Few-shot learning creates predictive models of drug response that translate from high-throughput screens to individual patients. Nature Cancer 2, 2 (2021), 233–244.
- Clinical and analytical validation of FoundationOne® CDx, a comprehensive genomic profiling assay for solid tumors. PLoS One 17, 3 (2022), e0264138.
- Collaborative strategies for deploying AI-based physician decision support systems: challenges and deployment approaches. npj Digital Medicine 6, 1 (2023), 137.
- Harry L Morgan. 1965. The generation of a unique machine description for chemical structures-a technique developed at chemical abstracts service. Journal of chemical documentation 5, 2 (1965), 107–113.
- PRECISE: a domain adaptation approach to transfer predictors of drug response from pre-clinical models to tumors. Bioinformatics 35, 14 (2019), i510–i519.
- TUGDA: task uncertainty guided domain adaptation for robust generalization of cancer drug response prediction from in vitro to in vivo settings. Bioinformatics 37, Supplement_1 (2021), i76–i83.
- PanDrugs: a novel method to prioritize anticancer drug treatments according to individual genomic data. Genome medicine 10 (2018), 1–11.
- Metastatic non-small cell lung cancer: ESMO Clinical Practice Guidelines for diagnosis, treatment and follow-up. Annals of Oncology 29 (2018), iv192–iv237.
- Single-cell transcriptomics of NRAS-mutated melanoma transitioning to drug resistance reveals P2RX7 as an indicator of early drug response. Cell Reports 42, 7 (2023).
- On the use of Harrell’s C for clinical risk prediction via random survival forests. Expert Systems with Applications 63 (2016), 450–459.
- Out-of-distribution generalization from labelled and unlabelled gene expression data for drug response prediction. Nature Machine Intelligence 3, 11 (2021), 962–972.
- AITL: adversarial inductive transfer learning with input and output space adaptation for pharmacogenomics. Bioinformatics 36, Supplement_1 (2020), i380–i388.
- New guidelines to evaluate the response to treatment in solid tumors. Journal of the National Cancer Institute 92, 3 (2000), 205–216.
- Molecular tumour boards—current and future considerations for precision oncology. Nature Reviews Clinical Oncology 20, 12 (2023), 843–863.
- Advanced colorectal cancer: ESMO Clinical Practice Guidelines for treatment. Annals of oncology 21 (2010), v93–v97.
- Attention is all you need. Advances in neural information processing systems 30 (2017).
- The coming decade in precision oncology: six riddles. Nature Reviews Cancer 23, 1 (2023), 43–54.
- Evaluation and Comparison of Real-World Databases for Conducting Research in Patients With Colorectal Cancer. JCO Clinical Cancer Informatics 7 (2023), e2200184.
- ANNOVAR: functional annotation of genetic variants from high-throughput sequencing data. Nucleic acids research 38, 16 (2010), e164–e164.
- Machine learning for survival analysis: A survey. ACM Computing Surveys (CSUR) 51, 6 (2019), 1–36.
- Evaluation of the trusight oncology 500 assay for routine clinical testing of tumor mutational burden and clinical utility for predicting response to pembrolizumab. The Journal of Molecular Diagnostics 24, 6 (2022), 600–608.
- The cancer genome atlas pan-cancer analysis project. Nature genetics 45, 10 (2013), 1113–1120.
- Genomics of Drug Sensitivity in Cancer (GDSC): a resource for therapeutic biomarker discovery in cancer cells. Nucleic acids research 41, D1 (2012), D955–D961.
- Learning patient-specific cancer survival distributions as a sequence of dependent regressors. Advances in neural information processing systems 24 (2011).
- Mutational landscape of metastatic cancer revealed from prospective clinical sequencing of 10,000 patients. Nature medicine 23, 6 (2017), 703–713.