CPLLM: Clinical Prediction with Large Language Models (2309.11295v2)

Published 20 Sep 2023 in cs.CL, cs.AI, and cs.LG

Abstract: We present Clinical Prediction with LLMs (CPLLM), a method that involves fine-tuning a pre-trained LLM for clinical disease and readmission prediction. We utilized quantization and fine-tuned the LLM using prompts. For diagnosis prediction, we predict whether patients will be diagnosed with a target disease during their next visit or in the subsequent diagnosis, leveraging their historical diagnosis records. We compared our results to various baselines, including RETAIN, and Med-BERT, the current state-of-the-art model for disease prediction using temporal structured EHR data. In addition, We also evaluated CPLLM for patient hospital readmission prediction and compared our method's performance with benchmark baselines. Our experiments have shown that our proposed method, CPLLM, surpasses all the tested models in terms of PR-AUC and ROC-AUC metrics, showing state-of-the-art results for diagnosis prediction and patient hospital readmission prediction. Such a method can be easily implemented and integrated into the clinical process to help care providers estimate the next steps of patients

References (56)

Authors (2)

Ofir Ben Shoham (4 papers)
Nadav Rappoport (8 papers)

Citations (20)

View on Semantic Scholar

Summary

Insights into CPLLM: Clinical Prediction with LLMs

The paper "CPLLM: Clinical Prediction with LLMs" addresses the integration of LLMs into clinical prediction tasks, focusing on disease and patient hospital readmission prediction. The authors propose a novel approach that fine-tunes pre-trained LLMs, using quantization and prompt-based training, to model sequential electronic health record (EHR) data.

Methodological Advancements

At the core of this research is the CPLLM framework, which leverages LLMs to predict clinical events by encoding patient histories as text sequences. The method applies to multiple tasks, including predicting a patient's next diagnosis or their likelihood of hospital readmission within a set timeframe. Notably, the fine-tuning of these models does not require pre-training tasks specific to the clinical domain, distinguishing it significantly from existing models like Med-BERT.

The researchers used two specific LLMs: Llama2 and BioMedLM, augmenting them with QLoRA (a parameter-efficient fine-tuning technique) to enhance their applicability to clinical tasks. This approach, interestingly, does not necessitate information such as the length of stay (LOS) or the sequence of specific visitations, which are often challenging to obtain and are integral to models like Med-BERT.

Data and Evaluation

The performance of CPLLM was benchmarked against existing state-of-the-art methods using two well-known datasets: MIMIC-IV and eICU-CRD. These datasets are instrumental for modeling both individual hospital data streams and multi-center data streams, covering a wide range of ICD-9 and ICD-10 coded medical conditions. The evaluation metrics included PR-AUC and ROC-AUC, with CPLLM consistently surpassing other models across multiple tasks.

For instance, in predicting acute and unspecified renal failure, CPLLM showed a significant improvement by achieving a PR-AUC score of 45.442%, markedly superior to the baseline models. Similarly, for hospital readmission prediction tasks, CPLLM outperformed competitive models by a notable margin when assessed on both datasets.

Implications and Future Directions

CPLLM presents a versatile framework capable of integrating into existing healthcare systems to provide enhanced predictive insights that could improve patient management strategies. Its ability to handle long-term sequences, with token limits that far exceed those of BERT-based models, reflects its potential to process extensive EHR data without the extensive preprocessing steps that other models require.

The practical implications include its adaptability to scenarios where detailed LOS data is unavailable, or where quick deployment is necessary in settings that prioritize real-time analytics over model training.

Despite its strengths, CPLLM does require substantial computing resources for fine-tuning LLMs, which may present a barrier in resource-constrained environments. Additionally, the customization of prompts for different tasks raises questions about the general applicability of predefined prompts across diverse datasets.

Conclusion

In summary, "CPLLM: Clinical Prediction with LLMs" showcases an innovative application of LLMs in the medical field, offering a method that is not only robust and highly flexible but also performs well against traditional clinical prediction models. Future work would benefit from exploring the integration of domain-specific retrieval augmentation to capitalize on the latest advancements in LLMs and further enhance prediction capabilities. This research sets a new benchmark for LLMs in clinical applications, marking a significant step forward in using these models beyond conventional natural language processing tasks.

PDF Markdown

Related Papers

Find Related Papers

GitHub

GitHub - nadavlab/CPLLM (29 stars)

Tweets

YouTube

Show All Videos