LT@Helsinki at SemEval-2020 Task 12: Multilingual or language-specific BERT? (2008.00805v1)

Published 3 Aug 2020 in cs.CL

Abstract: This paper presents the different models submitted by the LT@Helsinki team for the SemEval 2020 Shared Task 12. Our team participated in sub-tasks A and C; titled offensive language identification and offense target identification, respectively. In both cases we used the so-called Bidirectional Encoder Representation from Transformer (BERT), a model pre-trained by Google and fine-tuned by us on the OLID and SOLID datasets. The results show that offensive tweet classification is one of several language-based tasks where BERT can achieve state-of-the-art results.

PDF Abstract

Summarize Bookmark Chat (Pro)

Authors (4)

Marc Pàmies (7 papers)
Emily Öhman (6 papers)
Kaisla Kajava (2 papers)
Jörg Tiedemann (41 papers)

Citations (10)

View on Semantic Scholar

LT@Helsinki at SemEval-2020 Task 12: Multilingual or language-specific BERT? (2008.00805v1)

Related Papers