2000 character limit reached
The Large Language Model GreekLegalRoBERTa (2410.12852v1)
Published 10 Oct 2024 in cs.CL and cs.LG
Abstract: We develop four versions of GreekLegalRoBERTa, which are four LLMs trained on Greek legal and nonlegal text. We show that our models surpass the performance of GreekLegalBERT, Greek- LegalBERT-v2, and GreekBERT in two tasks involving Greek legal documents: named entity recognition and multi-class legal topic classification. We view our work as a contribution to the study of domain-specific NLP tasks in low-resource languages, like Greek, using modern NLP techniques and methodologies.