Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
139 tokens/sec
GPT-4o
7 tokens/sec
Gemini 2.5 Pro Pro
46 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

nlpBDpatriots at BLP-2023 Task 1: A Two-Step Classification for Violence Inciting Text Detection in Bangla (2311.15029v1)

Published 25 Nov 2023 in cs.CL

Abstract: In this paper, we discuss the nlpBDpatriots entry to the shared task on Violence Inciting Text Detection (VITD) organized as part of the first workshop on Bangla Language Processing (BLP) co-located with EMNLP. The aim of this task is to identify and classify the violent threats, that provoke further unlawful violent acts. Our best-performing approach for the task is two-step classification using back translation and multilinguality which ranked 6th out of 27 teams with a macro F1 score of 0.74.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (21)
  1. Overview of mex-a3t at iberlef 2019: Authorship and aggressiveness analysis in mexican spanish tweets. In Proceedings of IberLEF.
  2. Unsupervised cross-lingual representation learning at scale. In Proceedings of ACL.
  3. Hate speech and offensive language detection in bengali. In Proceedings of AACL.
  4. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. In Proceedings of NAACL.
  5. Google. 2021. Google cloud translation api documentation. Accessed: 2023-08-28.
  6. BanglaHateBERT: BERT for abusive language detection in Bengali. In Proceedings ResTUP.
  7. Muril: Multilingual representations for indian languages. arXiv preprint arXiv:2103.10730.
  8. Bangla-bert: transformer-based efficient model for transfer learning and language understanding. IEEE Access, 10:91855–91870.
  9. Benchmarking Aggression Identification in Social Media. In Proceedings of TRAC.
  10. Evaluating aggression identification in social media. In Proceedings of TRAC.
  11. Overview of the hasoc subtrack at fire 2021: Hate speech and offensive content identification in english and indo-aryan languages and conversational hate speech. In Proceedings of FIRE.
  12. OpenAI. 2023. Gpt-3.5 turbo fine-tuning and api updates. Accessed: 2023-08-28.
  13. Bengali hate speech detection in public facebook pages. In Proceedings of ICISET.
  14. Bd-shs: A benchmark dataset for learning to detect online bangla hate speech in different social contexts. In Proceedings of LREC.
  15. Blp-2023 task 1: Violence inciting text detection (vitd). In Proceedings of the 1st International Workshop on Bangla Language Processing (BLP-2023).
  16. Vio-lens: A novel dataset of annotated social network posts leading to different forms of communal violence and its evaluation. In Proceedings of BLP.
  17. L-boost: Identifying offensive texts from social media post in bengali. Ieee Access, 9:164681–164699.
  18. Vicarious offense and noise audit of offensive speech classifiers: Unifying human and machine disagreement on what is offensive. In Proceedings of EMNLP.
  19. SemEval-2019 task 6: Identifying and categorizing offensive language in social media (OffensEval). In Proceedings of SemEval.
  20. SemEval-2020 Task 12: Multilingual Offensive Language Identification in Social Media (OffensEval 2020). In Proceedings of SemEval.
  21. Improving zero-shot cross-lingual hate speech detection with pseudo-label fine-tuning of transformer language models. In Proceedings of ICWSM.
Citations (3)

Summary

We haven't generated a summary for this paper yet.