Automating PTSD Diagnostics in Clinical Interviews: Leveraging Large Language Models for Trauma Assessments (2405.11178v1)
Abstract: The shortage of clinical workforce presents significant challenges in mental healthcare, limiting access to formal diagnostics and services. We aim to tackle this shortage by integrating a customized LLM into the workflow, thus promoting equity in mental healthcare for the general population. Although LLMs have showcased their capability in clinical decision-making, their adaptation to severe conditions like Post-traumatic Stress Disorder (PTSD) remains largely unexplored. Therefore, we collect 411 clinician-administered diagnostic interviews and devise a novel approach to obtain high-quality data. Moreover, we build a comprehensive framework to automate PTSD diagnostic assessments based on interview contents by leveraging two state-of-the-art LLMs, GPT-4 and Llama-2, with potential for broader clinical diagnoses. Our results illustrate strong promise for LLMs, tested on our dataset, to aid clinicians in diagnostic validation. To the best of our knowledge, this is the first AI system that fully automates assessments for mental illness based on clinician-administered interviews.
- Chatgpt Demonstrates Potential for Identifying Psychiatric Disorders: Application to Childbirth-Related Post-Traumatic Stress Disorder.
- Mindwatch: A smart cloud-based ai solution for suicide ideation detection leveraging large language models. medRxiv, pages 2023–09.
- Evaluating the Feasibility of Chatgpt in Healthcare: An Analysis of Multiple Clinical and Research Scenarios. Journal of Medical Systems, 47(1).
- Palm: Scaling Language Modeling with Pathways.
- From Classification to Clinical Insights: Towards Analyzing and Reasoning About Mobile and Behavioral Health Data With Large Language Models.
- The Rich Transcription 2006 Spring Meeting Recognition Evaluation. In Proceedings of International Workshop on Machine Learning and Multimodal Interaction, pages 309–322.
- Enhancing Psychological Counseling with Large Language Model: A Multifaceted Decision-Support System for Non-Professionals.
- The Capability of Large Language Models to Measure Psychiatric Functioning.
- Trauma exposure and stress-related disorders in a large, urban, predominantly african-american, female sample. Archives of Women’s Mental Health, 24(6):893–901.
- Aligning Speakers: Evaluating and Visualizing Text-based Speaker Diarization Using Efficient Multiple Sequence Alignment. In Proceedings of the 35th IEEE International Conference on Tools with Artificial Intelligence, ICTAI’23.
- The distress analysis interview corpus of human and computer interviews. In Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC’14), pages 3123–3128, Reykjavik, Iceland. European Language Resources Association (ELRA).
- Improving Large Language Models for Clinical Named Entity Recognition via Prompt Engineering.
- Large Language Models in Mental Health Care: a Scoping Review.
- Rethinking Large Language Models in Mental Health Applications.
- The Longitudinal Interval Follow-up Evaluation. A comprehensive method for assessing outcome in prospective longitudinal studies. Archives Of General Psychiatry, 44(6):540–548.
- Dietrich Klakow and Jochen Peters. 2002. Testing the Correlation of Word Error Rate and Perplexity. Speech Communication, 38(1):19–28.
- Psy-Llm: Scaling up Global Mental Health Psychological Services with Ai-based Large Language Models.
- Bishal Lamichhane. 2023. Evaluation of Chatgpt for Nlp-based Mental Health Applications.
- Vladimir I Levenshtein. 1966. Binary Codes Capable of Correcting Deletions, Insertions, and Reversals. Soviet Physics Doklady, 10(8):707–710.
- Utility of Chatgpt in Clinical Practice. Journal of Medical Internet Research, 25:e48568.
- Chatcounselor: A Large Language Models for Mental Health Support.
- Towards emotional support dialog systems.
- Siyuan Brandon Loh and Aravind Sesagiri Raamkumar. 2023. Harnessing Large Language Models’ Empathetic Response Generation Capabilities for Online Mental Health Counselling Support. arXiv preprint arXiv:2310.08017.
- Impressiongpt: An Iterative Optimizing Framework for Radiology Report Summarization with Chatgpt.
- Understanding the benefits and challenges of using large language model-based conversational agents for mental well-being support. In AMIA Annual Symposium Proceedings, volume 2023, page 1105. American Medical Informatics Association.
- Capabilities of Gpt-4 on Medical Challenge Problems.
- OpenAI. 2023. Gpt-4 Technical Report.
- Model Tuning or Prompt Tuning? A Study of Large Language Models for Clinical Concept and Relation Extraction.
- Read, diagnose and chat: Towards explainable and interactive llms-augmented depression detection in social media.
- Robust Speech Recognition via Large-scale Weak Supervision. In Proceedings of the 40th International Conference on Machine Learning, ICML’23, pages 28492–28518.
- Nils Reimers and Iryna Gurevych. 2019. Sentence-Bert: Sentence Embeddings using Siamese Bert-Networks.
- The mini-international neuropsychiatric interview (mini): the development and validation of a structured diagnostic psychiatric interview for dsm-iv and icd-10. Journal of clinical psychiatry, 59(20):22–33.
- Automatic depression detection: An emotional audio-textual corpus and a gru/bilstm-based model.
- Llama 2: Open Foundation and Fine-Tuned Chat Models.
- Avec 2014: 3d dimensional affect and depression recognition challenge. In Proceedings of the 4th International Workshop on Audio/Visual Emotion Challenge, AVEC ’14, page 3–10, New York, NY, USA. Association for Computing Machinery.
- The Clinician-Administered PTSD Scale for DSM-5 (CAPS-5): Development and initial psychometric evaluation in military veterans. Psychological Assessment, 30(3):383–395.
- Chain-of-Thought Prompting Elicits Reasoning in Large Language Models.
- World Health Organization. 2021. Mental health atlas 2020. World Health Organization.
- World Medical Association. 2013. World Medical Association Declaration of Helsinki: Ethical Principles for Medical Research Involving Human Subjects. The Journal of the American Medical Association, 310(20):2191–2194.
- Automatic Post-Traumatic Stress Disorder Diagnosis via Clinical Transcripts: A Novel Text Augmentation with Large Language Models. In 2023 IEEE Biomedical Circuits and Systems Conference (BioCAS), pages 1–5.
- Mental-Llm: Leveraging Large Language Models for Mental Health Prediction via Online Text Data.
- Mentallama: Interpretable Mental Health Analysis on Social Media with Large Language Models.
- D4: a chinese dialogue dataset for depression-diagnosis-oriented chat.
- Natural language processing applied to mental illness detection: a narrative review. npj Digital Medicine, 5(1).
- Sichang Tu (3 papers)
- Abigail Powers (2 papers)
- Natalie Merrill (1 paper)
- Negar Fani (2 papers)
- Sierra Carter (1 paper)
- Stephen Doogan (2 papers)
- Jinho D. Choi (67 papers)