ClinicalMamba: A Generative Clinical Language Model on Longitudinal Clinical Notes (2403.05795v1)

Published 9 Mar 2024 in cs.CL

Abstract: The advancement of NLP systems in healthcare hinges on LLM ability to interpret the intricate information contained within clinical notes. This process often requires integrating information from various time points in a patient's medical history. However, most earlier clinical LLMs were pretrained with a context length limited to roughly one clinical document. In this study, We introduce ClinicalMamba, a specialized version of the Mamba LLM, pretrained on a vast corpus of longitudinal clinical notes to address the unique linguistic characteristics and information processing needs of the medical domain. ClinicalMamba, with 130 million and 2.8 billion parameters, demonstrates a superior performance in modeling clinical language across extended text lengths compared to Mamba and clinical Llama. With few-shot learning, ClinicalMamba achieves notable benchmarks in speed and accuracy, outperforming existing clinical LLMs and general domain large models like GPT-4 in longitudinal clinical notes information extraction tasks.

PDF HTML Abstract

Summarize PDF Markdown Bookmark Chat (Pro)

References (43)

Authors (4)

Zhichao Yang (37 papers)
Avijit Mitra (7 papers)
Sunjae Kwon (16 papers)
Hong Yu (114 papers)

Citations (12)

View on Semantic Scholar

ClinicalMamba: A Generative Clinical Language Model on Longitudinal Clinical Notes (2403.05795v1)

Related Papers