A Survey of Large Language Models for Healthcare: from Data, Technology, and Applications to Accountability and Ethics (2310.05694v2)

Published 9 Oct 2023 in cs.CL

Abstract: The utilization of LLMs in the Healthcare domain has generated both excitement and concern due to their ability to effectively respond to freetext queries with certain professional knowledge. This survey outlines the capabilities of the currently developed LLMs for Healthcare and explicates their development process, with the aim of providing an overview of the development roadmap from traditional Pretrained LLMs (PLMs) to LLMs. Specifically, we first explore the potential of LLMs to enhance the efficiency and effectiveness of various Healthcare applications highlighting both the strengths and limitations. Secondly, we conduct a comparison between the previous PLMs and the latest LLMs, as well as comparing various LLMs with each other. Then we summarize related Healthcare training data, training methods, optimization strategies, and usage. Finally, the unique concerns associated with deploying LLMs in Healthcare settings are investigated, particularly regarding fairness, accountability, transparency and ethics. Our survey provide a comprehensive investigation from perspectives of both computer science and Healthcare specialty. Besides the discussion about Healthcare concerns, we supports the computer science community by compiling a collection of open source resources, such as accessible datasets, the latest methodologies, code implementations, and evaluation benchmarks in the Github. Summarily, we contend that a significant paradigm shift is underway, transitioning from PLMs to LLMs. This shift encompasses a move from discriminative AI approaches to generative AI approaches, as well as a shift from model-centered methodologies to data-centered methodologies. Also, we determine that the biggest obstacle of using LLMs in Healthcare are fairness, accountability, transparency and ethics.

PDF Abstract

LLMs in Healthcare: A Comprehensive Overview

The paper "A Survey of LLMs for Healthcare: from Data, Technology, and Applications to Accountability and Ethics" provides a detailed examination of the development, application, and implications of LLMs in the healthcare domain. It thoroughly explores the shift from Pretrained LLMs (PLMs) to LLMs, contrasting their distinctive characteristics and emphasizing the growing applicability of LLMs. This survey is significant as it captures the nuances of integrating advanced AI technologies into healthcare, focusing on the technical, ethical, and practical dimensions.

Initially, the paper delineates the transition from PLMs to LLMs, emphasizing a substantial paradigm shift from discriminative to generative AI, alongside a shift from model-centered to data-centered approaches. The authors argue that while PLMs served foundational tasks like Named Entity Recognition (NER), Relation Extraction (RE), and Text Classification (TC), LLMs exhibit potential for more advanced applications, such as question answering and dialogue systems that imitate clinical interactions. They propose that LLMs enhance efficiency and effectiveness through their emergent capabilities like contextual reasoning and instruction following.

Healthcare-specific adaptations of LLMs are highlighted, including the development of models that incorporate medical knowledge bases or specialized training from healthcare text corpora. The paper compares multiple LLM implementations like Med-PaLM and GatorTron, evaluated on healthcare-specific datasets such as USMLE and PubMedQA. A significant finding is that these models demonstrate increased proficiency in medical comprehension tasks, narrowing the gap to human-level performance.

The authors extensively investigate ethical considerations, notably fairness, accountability, transparency, and ethics in deploying LLMs in healthcare. They argue for addressing biases that may be embedded in training data and propose mechanisms for enhancing model accountability through transparency measures. This aligns with global discussions on the responsible use of AI, which is accentuated when dealing with sensitive health information.

Robustness and adaptability of LLMs are additional concerns that are addressed. The paper underscores the need for models to remain reliable amidst diverse and unforeseen inputs, a critical aspect when the stakes involve health outcomes. Moreover, the potential of these models to produce hallucinatory outputs or confabulation of information mandates stringent evaluation to ensure factual accuracy and reliability.

The implications of this research are multifaceted. Practically, it highlights the potential to enhance healthcare delivery through AI-driven efficiency and accuracy. Theoretically, it opens avenues for future AI research to focus on refining the complex decision-making capabilities intrinsic to healthcare environments. For policymakers and stakeholders, the synthesis offered by the paper could guide the formulation of guidelines and standards ensuring technology adoption that aligns with ethical and professional practices in healthcare.

Moreover, the paper speculates on future developments highlighting the significance of multimodal LLMs and AI agents as promising areas. The integration of visual data alongside text could transform diagnostic and consultative processes significantly. The authors also foresee a critical role for AI agents that may autonomously perform complex tasks, streamlining operations within healthcare settings.

In summary, this paper provides a meticulous exploration of LLMs in healthcare, articulating both technological advancements and associated challenges. By offering a comprehensive investigation from data, technology, and applications to accountability and ethics, it sets a grounded foundation for the responsible use of AI in healthcare. The insights gathered underscore the transformative potential of LLMs, while also cautioning the need for responsibly navigating the complexities associated with these powerful tools.