Evaluation of AI Chatbots for Patient-Specific EHR Questions
Abstract: This paper investigates the use of artificial intelligence chatbots for patient-specific question answering (QA) from clinical notes using several LLM based systems: ChatGPT (versions 3.5 and 4), Google Bard, and Claude. We evaluate the accuracy, relevance, comprehensiveness, and coherence of the answers generated by each model using a 5-point Likert scale on a set of patient-specific questions.
Paper Prompts
Sign up for free to create and run prompts on this paper using GPT-5.
Top Community Prompts
Collections
Sign up for free to add this paper to one or more collections.