2000 character limit reached
nlpBDpatriots at BLP-2023 Task 1: A Two-Step Classification for Violence Inciting Text Detection in Bangla (2311.15029v1)
Published 25 Nov 2023 in cs.CL
Abstract: In this paper, we discuss the nlpBDpatriots entry to the shared task on Violence Inciting Text Detection (VITD) organized as part of the first workshop on Bangla Language Processing (BLP) co-located with EMNLP. The aim of this task is to identify and classify the violent threats, that provoke further unlawful violent acts. Our best-performing approach for the task is two-step classification using back translation and multilinguality which ranked 6th out of 27 teams with a macro F1 score of 0.74.
- Overview of mex-a3t at iberlef 2019: Authorship and aggressiveness analysis in mexican spanish tweets. In Proceedings of IberLEF.
- Unsupervised cross-lingual representation learning at scale. In Proceedings of ACL.
- Hate speech and offensive language detection in bengali. In Proceedings of AACL.
- BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. In Proceedings of NAACL.
- Google. 2021. Google cloud translation api documentation. Accessed: 2023-08-28.
- BanglaHateBERT: BERT for abusive language detection in Bengali. In Proceedings ResTUP.
- Muril: Multilingual representations for indian languages. arXiv preprint arXiv:2103.10730.
- Bangla-bert: transformer-based efficient model for transfer learning and language understanding. IEEE Access, 10:91855–91870.
- Benchmarking Aggression Identification in Social Media. In Proceedings of TRAC.
- Evaluating aggression identification in social media. In Proceedings of TRAC.
- Overview of the hasoc subtrack at fire 2021: Hate speech and offensive content identification in english and indo-aryan languages and conversational hate speech. In Proceedings of FIRE.
- OpenAI. 2023. Gpt-3.5 turbo fine-tuning and api updates. Accessed: 2023-08-28.
- Bengali hate speech detection in public facebook pages. In Proceedings of ICISET.
- Bd-shs: A benchmark dataset for learning to detect online bangla hate speech in different social contexts. In Proceedings of LREC.
- Blp-2023 task 1: Violence inciting text detection (vitd). In Proceedings of the 1st International Workshop on Bangla Language Processing (BLP-2023).
- Vio-lens: A novel dataset of annotated social network posts leading to different forms of communal violence and its evaluation. In Proceedings of BLP.
- L-boost: Identifying offensive texts from social media post in bengali. Ieee Access, 9:164681–164699.
- Vicarious offense and noise audit of offensive speech classifiers: Unifying human and machine disagreement on what is offensive. In Proceedings of EMNLP.
- SemEval-2019 task 6: Identifying and categorizing offensive language in social media (OffensEval). In Proceedings of SemEval.
- SemEval-2020 Task 12: Multilingual Offensive Language Identification in Social Media (OffensEval 2020). In Proceedings of SemEval.
- Improving zero-shot cross-lingual hate speech detection with pseudo-label fine-tuning of transformer language models. In Proceedings of ICWSM.