2000 character limit reached
Hate Speech Detection on Vietnamese Social Media Text using the Bidirectional-LSTM Model (1911.03648v1)
Published 9 Nov 2019 in cs.CL and cs.LG
Abstract: In this paper, we describe our system which participates in the shared task of Hate Speech Detection on Social Networks of VLSP 2019 evaluation campaign. We are provided with the pre-labeled dataset and an unlabeled dataset for social media comments or posts. Our mission is to pre-process and build machine learning models to classify comments/posts. In this report, we use Bidirectional Long Short-Term Memory to build the model that can predict labels for social media text according to Clean, Offensive, Hate. With this system, we achieve comparative results with 71.43% on the public standard test set of VLSP 2019.
- Hang Thi-Thuy Do (2 papers)
- Huy Duc Huynh (2 papers)
- Kiet Van Nguyen (74 papers)
- Ngan Luu-Thuy Nguyen (56 papers)
- Anh Gia-Tuan Nguyen (13 papers)