BanglaAbuseMeme: A Dataset for Bengali Abusive Meme Classification (2310.11748v1)
Abstract: The dramatic increase in the use of social media platforms for information sharing has also fueled a steep growth in online abuse. A simple yet effective way of abusing individuals or communities is by creating memes, which often integrate an image with a short piece of text layered on top of it. Such harmful elements are in rampant use and are a threat to online safety. Hence it is necessary to develop efficient models to detect and flag abusive memes. The problem becomes more challenging in a low-resource setting (e.g., Bengali memes, i.e., images with Bengali text embedded on it) because of the absence of benchmark datasets on which AI models could be trained. In this paper we bridge this gap by building a Bengali meme dataset. To setup an effective benchmark we implement several baseline models for classifying abusive memes using this dataset. We observe that multimodal models that use both textual and visual information outperform unimodal models. Our best-performing model achieves a macro F1 score of 70.51. Finally, we perform a qualitative error analysis of the misclassified memes of the best-performing text-based, image-based and multimodal models.
- The Most Spoken Languages in the World | Berlitz — berlitz.com. https://www.berlitz.com/en-uy/blog/most-spoken-languages-world. [Accessed 17-Jun-2023].
- Banglabert: Lagnuage model pretraining and benchmarks for low-resource language understanding evaluation in bangla. Findings of the North American Chapter of the Association for Computational Linguistics: NAACL.
- T Britannica. 2022. Bengali language.
- “subverting the jewtocracy”: Online antisemitism detection using multimodal deep learning. In 13th ACM Web Science Conference 2021, pages 148–157.
- Uniter: Universal image-text representation learning. In European conference on computer vision, pages 104–120. Springer.
- Unsupervised cross-lingual representation learning at scale. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, pages 8440–8451.
- Toward cultural bias evaluation datasets: The case of bengali gender, religious, and national identity. In Proceedings of the First Workshop on Cross-Cultural Considerations in NLP (C3NLP), pages 68–83.
- Data bootstrapping approaches to improve low resource abusive language detection for indic languages. pages 32–42.
- Bert: Pre-training of deep bidirectional transformers for language understanding. In NAACL.
- An image is worth 16x16 words: Transformers for image recognition at scale. ICLR.
- Empath: Understanding topic signals in large-scale text. In Proc. of the 2016 CHI Conference on Human Factors in Computing Systems, pages 4647–4657.
- Exploring hate speech detection in multimodal publications. In Proceedings of the IEEE/CVF winter conference on applications of computer vision, pages 1470–1478.
- The Guardian. 2017. Moderators who had to view child abuse content sue microsoft, claiming ptsd.
- An expert annotated dataset for the detection of online misogyny. In Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume, pages 1336–1350.
- Visual attention network. arXiv preprint arXiv:2202.09741.
- Deep residual learning for image recognition. In Proceedings of the IEEE conference on computer vision and pattern recognition, pages 770–778.
- Mute: A multimodal dataset for detecting hateful memes. In Proceedings of the 2nd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 12th International Joint Conference on Natural Language Processing: Student Research Workshop, pages 32–39.
- Multimodal hate speech detection from bengali memes and texts. arXiv preprint arXiv:2204.10196.
- Classification benchmarks for under-resourced bengali language based on multichannel convolutional-lstm network. In 2020 IEEE 7th International Conference on Data Science and Advanced Analytics (DSAA), pages 390–399. IEEE.
- Kimmo Karkkainen and Jungseock Joo. 2021. Fairface: Face attribute dataset for balanced race, gender, and age for bias measurement and mitigation. In Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, pages 1548–1558.
- Muril: Multilingual representations for indian languages. arXiv preprint arXiv:2103.10730.
- Supervised multimodal bitransformers for classifying images and text. arXiv preprint arXiv:1909.02950.
- The hateful memes challenge: Detecting hate speech in multimodal memes. Advances in Neural Information Processing Systems, 33:2611–2624.
- Gokul Karthik Kumar and Karthik Nanadakumar. 2022. Hate-clipper: Multimodal hateful meme classification based on cross-modal interaction of clip features. arXiv preprint arXiv:2210.05916.
- Disentangling hate in online memes. In Proceedings of the 29th ACM International Conference on Multimedia, pages 5138–5147.
- What does bert with vision look at? In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, pages 5265–5275.
- Vilbert: Pretraining task-agnostic visiolinguistic representations for vision-and-language tasks. Advances in neural information processing systems, 32.
- A multitask framework for sentiment, emotion and sarcasm aware cyberbullying detection from multi-modal code-mixed memes. In Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR ’22, page 1739–1749, New York, NY, USA. Association for Computing Machinery.
- Patrick E McKnight and Julius Najab. 2010. Mann-whitney u test. The Corsini encyclopedia of psychology, pages 1–1.
- Multilingual and multi-aspect hate speech analysis. In Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP), pages 4675–4684.
- Detecting harmful memes and their targets. In Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021, pages 2783–2796.
- Momenta: A multimodal framework for detecting harmful memes and their targets. In Findings of the Association for Computational Linguistics: EMNLP 2021, pages 4439–4455.
- Learning transferable visual models from natural language supervision. In International Conference on Machine Learning, pages 8748–8763. PMLR.
- Hate speech in pixels: Detection of offensive memes towards automatic moderation. arXiv preprint arXiv:1910.02334.
- Sagor Sarker. 2021. Bnlp: Natural language processing toolkit for bengali language. arXiv preprint arXiv:2102.00405.
- Karen Simonyan and Andrew Zisserman. 2014. Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556.
- N Statt. 2017. Youtube is facing a full-scale advertising boycott over hate speech. The Verge.
- Multimodal meme dataset (multioff) for identifying offensive content in image and text. In Proceedings of the second workshop on trolling, aggression and cyberbullying, pages 32–41.
- Hate speech harms: a social justice discussion of disabled norwegians’ experiences. Disability & Society, 34(3):368–383.
- Riza Velioglu and Jewgeni Rose. 2020. Detecting hate speech in memes using multimodal deep learning approaches: Prize-winning solution to hateful memes challenge. arXiv preprint arXiv:2012.12975.
- Examining characteristics and associated distress related to internet harassment: findings from the second youth internet safety survey. Pediatrics, 118(4):e1169–e1177.
- Mithun Das (16 papers)
- Animesh Mukherjee (154 papers)