Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
102 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Text-based classification of interviews for mental health -- juxtaposing the state of the art (2008.01543v1)

Published 29 Jul 2020 in cs.CL, cs.LG, cs.SD, eess.AS, and stat.ML

Abstract: Currently, the state of the art for classification of psychiatric illness is based on audio-based classification. This thesis aims to design and evaluate a state of the art text classification network on this challenge. The hypothesis is that a well designed text-based approach poses a strong competition against the state-of-the-art audio based approaches. Dutch natural LLMs are being limited by the scarcity of pre-trained monolingual NLP models, as a result Dutch natural LLMs have a low capture of long range semantic dependencies over sentences. For this issue, this thesis presents belabBERT, a new Dutch LLM extending the RoBERTa[15] architecture. belabBERT is trained on a large Dutch corpus (+32GB) of web crawled texts. After this thesis evaluates the strength of text-based classification, a brief exploration is done, extending the framework to a hybrid text- and audio-based classification. The goal of this hybrid framework is to show the principle of hybridisation with a very basic audio-classification network. The overall goal is to create the foundations for a hybrid psychiatric illness classification, by proving that the new text-based classification is already a strong stand-alone solution.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (1)
Citations (1)