ViDeBERTa: A powerful pre-trained language model for Vietnamese (2301.10439v2)

Published 25 Jan 2023 in cs.CL and cs.LG

Abstract: This paper presents ViDeBERTa, a new pre-trained monolingual LLM for Vietnamese, with three versions - ViDeBERTa_xsmall, ViDeBERTa_base, and ViDeBERTa_large, which are pre-trained on a large-scale corpus of high-quality and diverse Vietnamese texts using DeBERTa architecture. Although many successful pre-trained LLMs based on Transformer have been widely proposed for the English language, there are still few pre-trained models for Vietnamese, a low-resource language, that perform good results on downstream tasks, especially Question answering. We fine-tune and evaluate our model on three important natural language downstream tasks, Part-of-speech tagging, Named-entity recognition, and Question answering. The empirical results demonstrate that ViDeBERTa with far fewer parameters surpasses the previous state-of-the-art models on multiple Vietnamese-specific natural language understanding tasks. Notably, ViDeBERTa_base with 86M parameters, which is only about 23% of PhoBERT_large with 370M parameters, still performs the same or better results than the previous state-of-the-art model. Our ViDeBERTa models are available at: https://github.com/HySonLab/ViDeBERTa.

PDF Abstract

Summarize PDF Markdown Bookmark Chat (Pro)

Authors (5)

Cong Dao Tran (3 papers)
Nhut Huy Pham (1 paper)
Anh Nguyen (157 papers)
Truong Son Hy (28 papers)
Tu Vu (24 papers)

Citations (11)

View on Semantic Scholar

ViDeBERTa: A powerful pre-trained language model for Vietnamese (2301.10439v2)

Related Papers