Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
144 tokens/sec
GPT-4o
7 tokens/sec
Gemini 2.5 Pro Pro
45 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Semantic Similarity Models for Depression Severity Estimation (2211.07624v2)

Published 14 Nov 2022 in cs.CL

Abstract: Depressive disorders constitute a severe public health issue worldwide. However, public health systems have limited capacity for case detection and diagnosis. In this regard, the widespread use of social media has opened up a way to access public information on a large scale. Computational methods can serve as support tools for rapid screening by exploiting this user-generated social media content. This paper presents an efficient semantic pipeline to study depression severity in individuals based on their social media writings. We select test user sentences for producing semantic rankings over an index of representative training sentences corresponding to depressive symptoms and severity levels. Then, we use the sentences from those results as evidence for predicting users' symptom severity. For that, we explore different aggregation methods to answer one of four Beck Depression Inventory (BDI) options per symptom. We evaluate our methods on two Reddit-based benchmarks, achieving 30\% improvement over state of the art in terms of measuring depression severity.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (47)
  1. Hessam Amini and Leila Kosseim. 2020. Towards Explainability in Using Deep Learning for the Detection of Anorexia in Social Media. Natural Language Processing and Information Systems, 12089:225 – 235.
  2. Preventive strategies for mental health. The Lancet Psychiatry, 5(7):591–604.
  3. UPV-Symanto at eRisk 2021: Mental Health Author Profiling for Early Risk Prediction on the Internet. In Proceedings of the Working Notes of CLEF 2021 - Conference and Labs of the Evaluation Forum, Bucharest, Romania, September 21st - to - 24th, 2021, volume 2936 of CEUR Workshop Proceedings, pages 908–927. CEUR-WS.org.
  4. Early detection of depression: social network analysis and random forest techniques. Journal of medical Internet research, 21(6):e12554.
  5. Early risk detection of self-harm and depression severity using BERT-based transformers: iLab at CLEF eRisk 2020. Early Risk Prediction on the Internet.
  6. Tianfeng Chai and Roland R Draxler. 2014. Root mean square error (rmse) or mean absolute error (mae). Geoscientific Model Development Discussions, 7(1):1525–1534.
  7. Stevie Chancellor and Munmun De Choudhury. 2020. Methods in predictive techniques for mental health status on social media: a critical review. NPJ digital medicine, 3(1):1–11.
  8. SMHD: a large-scale resource for exploring online language usage for multiple mental health conditions. In Proceedings of the 27th International Conference on Computational Linguistics, pages 1485–1497, Santa Fe, New Mexico, USA. Association for Computational Linguistics.
  9. Quantifying mental health signals in Twitter. In Proceedings of the Workshop on Computational Linguistics and Clinical Psychology: From Linguistic Signal to Clinical Reality, pages 51–60, Baltimore, Maryland, USA. Association for Computational Linguistics.
  10. Natural Language Processing of Social Media as Screening for Suicide Risk. Biomedical Informatics Insights, 10:117822261879286.
  11. Early Detection of Mental Health Disorders by Social Media Monitoring. Springer International Publishing.
  12. Social Media as a Measurement Tool of Depression in Populations. In Proceedings of the 5th Annual ACM Web Science Conference, WebSci ’13, page 47–56, New York, NY, USA. Association for Computing Machinery.
  13. A psychometric evaluation of the Beck Depression Inventory–II. Psychological assessment, 10(2):83.
  14. “I didn’t know what was wrong”: How people with undiagnosed depression recognize, name and explain their distress. Journal of general internal medicine, 25(9):954–961.
  15. Methodological gaps in predicting mental health states from social media: Triangulating diagnostic signals. CHI ’19, page 1–16, New York, NY, USA. Association for Computing Machinery.
  16. Characterization of time-variant and time-invariant assessment of suicidality on reddit using c-ssrs. PloS one, 16(5):e0250448.
  17. Do models of mental health based on social media data generalize? In Findings of the Association for Computational Linguistics: EMNLP 2020, pages 3774–3788, Online. Association for Computational Linguistics.
  18. Detection of mental health from Reddit via deep contextualized representations. In Proceedings of the 11th International Workshop on Health Text Mining and Information Analysis, pages 147–156, Online. Association for Computational Linguistics.
  19. Payam Karisani and Eugene Agichtein. 2018. Did You Really Just Have a Heart Attack? Towards Robust Detection of Personal Health Mentions in Social Media. WWW ’18, page 137–146, Republic and Canton of Geneva, CHE. International World Wide Web Conferences Steering Committee.
  20. The PHQ-9: validity of a brief depression severity measure. Journal of general internal medicine, 16(9):606–613.
  21. The use of the Beck Depression Inventory to screen for depression in the general population: a preliminary analysis. Journal of affective disorders, 57(1-3):261–265.
  22. Overview of erisk 2019 early risk prediction on the internet. In International Conference of the Cross-Language Evaluation Forum for European Languages, pages 340–357. Springer.
  23. eRisk 2020: Self-harm and depression challenges. In European Conference on Information Retrieval, pages 557–563. Springer.
  24. Early Mental Health Risk Assessment through Writing Styles, Topics and Neural Models. In CLEF (Working Notes).
  25. Understanding depressive symptoms and psychosocial stressors on twitter: a corpus-based study. Journal of medical Internet research, 19(2):e6895.
  26. Early identification of depression severity levels on reddit using ordinal classification. In Proceedings of the ACM Web Conference 2022, pages 2563–2572.
  27. Improving the generalizability of depression detection by leveraging clinical questionnaires. In Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), ACL 2022, Dublin, Ireland, May 22-27, 2022, pages 8446–8459. Association for Computational Linguistics.
  28. Overview of eRisk 2021: Early risk prediction on the internet. In International Conference of the Cross-Language Evaluation Forum for European Languages, pages 324–344. Springer.
  29. Psychological aspects of natural language use: Our words, our selves. Annual review of psychology, 54(1):547–577.
  30. A randomised controlled trial of the effectiveness of a program for early detection and treatment of depression in primary care. Journal of affective disorders, 198:96–101.
  31. Automatic depression score estimation with word embedding models. Artificial Intelligence in Medicine, 132:102380.
  32. Dsm-5 field trials in the united states and canada, part ii: test-retest reliability of selected categorical diagnoses. American journal of psychiatry, 170(1):59–70.
  33. Nils Reimers and Iryna Gurevych. 2019. Sentence-bert: Sentence embeddings using siamese bert-networks. In Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing. Association for Computational Linguistics.
  34. On the validity of the beck depression inventory. Psychopathology, 31(3):160–168.
  35. A survey of computational methods for online mental state assessment on social media. ACM Trans. Comput. Healthcare, 2(2).
  36. Okapi at TREC-3. Nist Special Publication Sp, 109:109.
  37. Transfer Learning for Automated Responses to the BDI Questionnaire. In Working Notes of CLEF 2021 – Conference and Labs of the Evaluation Forum, volume 2936, pages 1046–1058, Bucharest, Romania.
  38. BioInfo@UAVR at eRisk 2020: on the Use of Psycholinguistics Features and Machine Learning for the Classification and Quantification of Mental Diseases. In Working Notes of CLEF 2020 - Conference and Labs of the Evaluation Forum, Thessaloniki, Greece, September 22-25, 2020, volume 2696 of CEUR Workshop Proceedings. CEUR-WS.org.
  39. Utilizing Neural Networks and Linguistic Metadata for Early Detection of Depression Indications in Text Sequences. IEEE Transactions on Knowledge and Data Engineering, 32:588–601.
  40. Ana-Sabina Uban and Paolo Rosso. 2020. Deep learning architectures and strategies for early detection of self-harm and depression level prediction. In CEUR Workshop Proceedings, volume 2696, pages 1–12. Sun SITE Central Europe.
  41. Stigma, biomarkers, and algorithmic bias: recommendations for precision behavioral health with artificial intelligence. JAMIA Open, 3(1):9–15.
  42. Shih-Hung Wu and Zhao-Jun Qiu. 2021. A RoBERTa-based model on measuring the severity of the signs of depression. In Proceedings of the Working Notes of CLEF 2021 - Conference and Labs of the Evaluation Forum, Bucharest, Romania, September 21st - to - 24th, 2021, volume 2936 of CEUR Workshop Proceedings, pages 1071–1080. CEUR-WS.org.
  43. Learning to answer psychological questionnaire for personality detection. In Findings of the Association for Computational Linguistics: EMNLP 2021, Virtual Event / Punta Cana, Dominican Republic, 16-20 November, 2021, pages 1131–1142. Association for Computational Linguistics.
  44. Depression and self-harm risk assessment in online forums. In Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, pages 2968–2978, Copenhagen, Denmark. Association for Computational Linguistics.
  45. Psychiatric scale guided risky post screening for early detection of depression. In Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, IJCAI 2022, Vienna, Austria, 23-29 July 2022, pages 5220–5226. ijcai.org.
  46. Symptom identification for interpretable detection of multiple mental disorders. arXiv preprint arXiv:2205.11308.
  47. Proceedings of the Eighth Workshop on Computational Linguistics and Clinical Psychology. Association for Computational Linguistics, Seattle, USA.
Citations (6)

Summary

We haven't generated a summary for this paper yet.