Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
38 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
41 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

ArgMed-Agents: Explainable Clinical Decision Reasoning with LLM Disscusion via Argumentation Schemes (2403.06294v2)

Published 10 Mar 2024 in cs.AI, cs.MA, and cs.SC

Abstract: There are two main barriers to using LLMs in clinical reasoning. Firstly, while LLMs exhibit significant promise in NLP tasks, their performance in complex reasoning and planning falls short of expectations. Secondly, LLMs use uninterpretable methods to make clinical decisions that are fundamentally different from the clinician's cognitive processes. This leads to user distrust. In this paper, we present a multi-agent framework called ArgMed-Agents, which aims to enable LLM-based agents to make explainable clinical decision reasoning through interaction. ArgMed-Agents performs self-argumentation iterations via Argumentation Scheme for Clinical Discussion (a reasoning mechanism for modeling cognitive processes in clinical reasoning), and then constructs the argumentation process as a directed graph representing conflicting relationships. Ultimately, use symbolic solver to identify a series of rational and coherent arguments to support decision. We construct a formal model of ArgMed-Agents and present conjectures for theoretical guarantees. ArgMed-Agents enables LLMs to mimic the process of clinical argumentative reasoning by generating explanations of reasoning in a self-directed manner. The setup experiments show that ArgMed-Agents not only improves accuracy in complex clinical decision reasoning problems compared to other prompt methods, but more importantly, it provides users with decision explanations that increase their confidence.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (44)
  1. Comparing Physician and Artificial Intelligence Chatbot Responses to Patient Questions Posted to a Public Social Media Forum. JAMA Internal Medicine, 183(6):589–596, 06 2023.
  2. Disc-medllm: Bridging general large language models and real-world medical consultation, 2023.
  3. Computational argumentation-based chatbots: a survey, 2024.
  4. Exploring the potential of large language models in computational argumentation, 2023.
  5. Selection-inference: Exploiting large language models for interpretable logical reasoning, 2022.
  6. Argumentation for explainable reasoning with conflicting medical recommendations. CEUR Workshop Proceedings, 2237, January 2018. 2018 Joint Reasoning with Ambiguous and Conflicting Evidence and Recommendations in Medicine and the 3rd International Workshop on Ontology Modularity, Contextuality, and Evolution, MedRACER + WOMoCoE 2018 ; Conference date: 29-10-2018.
  7. I wish to have an argument: Argumentative reasoning in large language models, 2023.
  8. Computational argumentation and cognition, 2021.
  9. Argumentation: A calculus for human-centric ai. Frontiers in Artificial Intelligence, 5, 2022.
  10. Phan Minh Dung. On the acceptability of arguments and its fundamental role in nonmonotonic reasoning, logic programming and n-person games. Artificial Intelligence, 77(2):321–357, 1995.
  11. Determinants of llm-assisted decision-making, 2024.
  12. On computing explanations in argumentation. In Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, AAAI’15, page 1496–1492. AAAI Press, 2015.
  13. Strategic reasoning with language models, 2023.
  14. An interaction model for merging multi-agent argumentation in shared clinical decision making. In 2023 IEEE International Conference on Bioinformatics and Biomedicine (BIBM), pages 4304–4311, 2023.
  15. ISO, IEC: AWI TS 29119-11: Software and systems engineering - software testing - part 11: Testing of AI systems. Technical report, 2020.
  16. Pubmedqa: A dataset for biomedical research question answering, 2019.
  17. What disease does this patient have? a large-scale open domain question answering dataset from medical exams, 2020.
  18. Genegpt: Augmenting large language models with domain tools for improved access to biomedical information, 2023.
  19. Maieutic prompting: Logically consistent reasoning with recursive explanations, 2022.
  20. Resource description framework (rdf): Concepts and abstract syntax. W3C Recommendation, 2004.
  21. Llava-med: Training a large language-and-vision assistant for biomedicine in one day, 2023.
  22. Comparison of history of present illness summaries generated by a chatbot and senior internal medicine residents. JAMA Intern Med, 183(9):1026–1027, Sep 2023.
  23. Can generalist foundation models outcompete special-purpose tuning? case study in medicine, 2023.
  24. Argumentation with goals for clinical decision support in multimorbidity. In Proceedings of the 17th International Conference on Autonomous Agents and MultiAgent Systems, 2018. 2031–2033.
  25. OpenAI. Gpt-4 technical report, 2023.
  26. Med-halt: Medical domain hallucination test for large language models, 2023.
  27. Logic-lm: Empowering large language models with symbolic solvers for faithful logical reasoning, 2023.
  28. Unifying large language models and knowledge graphs: A roadmap. IEEE Transactions on Knowledge and Data Engineering, page 1–20, 2024.
  29. Analysis of clinical discussions based on argumentation schemes. Procedia Computer Science, 64:282–289, 2015. Conference on ENTERprise Information Systems/International Conference on Project MANagement/Conference on Health and Social Care Information Systems and Technologies, CENTERIS/ProjMAN / HCist 2015 October 7-9, 2015.
  30. Argumentation schemes for clinical decision support. Argument and Computation, 12(3):329–355, November 2021.
  31. Diagnostic reasoning prompts reveal the potential for large language model interpretability in medicine. npj Digital Medicine, 7:20, 2024.
  32. Ehragent: Code empowers large language models for few-shot complex tabular reasoning on electronic health records, 2024.
  33. Large language models encode clinical knowledge, 2022.
  34. Towards expert-level medical question answering with large language models, 2023.
  35. Medagents: Large language models as collaborators for zero-shot medical reasoning, 2024.
  36. Argumentation Schemes. Cambridge University Press, New York, 2008.
  37. Douglas Walton. Argumentation Schemes for Presumptive Reasoning. Routledge, 1st edition, 1996.
  38. A survey on large language model based autonomous agents, 2023.
  39. Chain-of-thought prompting elicits reasoning in large language models, 2023.
  40. Autogen: Enabling next-gen llm applications via multi-agent conversation, 2023.
  41. Travelplanner: A benchmark for real-world planning with language agents, 2024.
  42. LogicNMR: Probing the non-monotonic reasoning ability of pre-trained language models. In Yoav Goldberg, Zornitsa Kozareva, and Yue Zhang, editors, Findings of the Association for Computational Linguistics: EMNLP 2022, pages 3616–3626, Abu Dhabi, United Arab Emirates, December 2022. Association for Computational Linguistics.
  43. Explainable and Argumentation-based Decision Making with Qualitative Preferences for Diagnostics and Prognostics of Alzheimer’s Disease. In Proceedings of the 17th International Conference on Principles of Knowledge Representation and Reasoning, pages 816–826, 9 2020.
  44. Cumulative reasoning with large language models, 2023.
User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (4)
  1. Shengxin Hong (3 papers)
  2. Liang Xiao (80 papers)
  3. Xin Zhang (904 papers)
  4. Jianxia Chen (1 paper)
Citations (1)