2000 character limit reached
Overview of the 2023 ICON Shared Task on Gendered Abuse Detection in Indic Languages (2401.03677v1)
Published 8 Jan 2024 in cs.CL and cs.LG
Abstract: This paper reports the findings of the ICON 2023 on Gendered Abuse Detection in Indic Languages. The shared task deals with the detection of gendered abuse in online text. The shared task was conducted as a part of ICON 2023, based on a novel dataset in Hindi, Tamil and the Indian dialect of English. The participants were given three subtasks with the train dataset consisting of approximately 6500 posts sourced from Twitter. For the test set, approximately 1200 posts were provided. The shared task received a total of 9 registrations. The best F-1 scores are 0.616 for subtask 1, 0.572 for subtask 2 and, 0.616 and 0.582 for subtask 3. The paper contains examples of hateful content owing to its topic.
- The uli dataset: An exercise in experience led annotation of ogbv. arXiv preprint arXiv:2311.09086.
- Developing a multilingual annotated corpus of misogyny and aggression. arXiv preprint arXiv:2003.07428.
- Daniel L Byman. 2021. How hateful rhetoric connects to real-world violence.
- Findings of the shared task on offensive language identification in tamil, malayalam, and kannada. In Proceedings of the first workshop on speech and language technologies for Dravidian languages, pages 133–145.
- Improving cyberbullying detection with user context. In Advances in Information Retrieval: 35th European Conference on IR Research, ECIR 2013, Moscow, Russia, March 24-27, 2013. Proceedings 35, pages 693–696. Springer.
- Online hate speech against women: Automatic identification of misogyny and sexism on twitter. Journal of intelligent & fuzzy systems, 36(5):4743–4752.
- Multilingual abusive comment detection at scale for indic languages. Advances in Neural Information Processing Systems, 35:26176–26191.
- Evaluating aggression identification in social media. In Proceedings of the second workshop on trolling, aggression and cyberbullying, pages 1–5.
- Proceedings of the first workshop on trolling, aggression and cyberbullying (trac-2018). In Proceedings of the First Workshop on Trolling, Aggression and Cyberbullying (TRAC-2018).
- Comma@ icon: Multilingual gender biased and communal language identification task at icon-2021. In Proceedings of the 18th International Conference on Natural Language Processing: Shared Task on Multilingual Gender Biased and Communal Language Identification, pages 1–12.
- Aggression-annotated corpus of hindi-english code-mixed data. arXiv preprint arXiv:1803.09402.
- Accurately detecting trolls in slashdot zoo via decluttering. In 2014 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM 2014), pages 188–195. IEEE.
- Overview of the hasoc track at fire 2020: Hate speech and offensive language identification in tamil, malayalam, hindi, english and german. In Proceedings of the 12th Annual Meeting of the Forum for Information Retrieval Evaluation, pages 29–32.
- Overview of the hasoc track at fire 2019: Hate speech and offensive content identification in indo-european languages. In Proceedings of the 11th annual meeting of the Forum for Information Retrieval Evaluation, pages 14–17.
- Overview of the hasoc subtrack at fire 2021: Hate speech and offensive content identification in english and indo-aryan languages. arXiv preprint arXiv:2112.09301.
- Maya Mirchandani. 2018. Digital hatred, real violence: Majoritarian radicalisation and social media in india. ORF Occasional Paper, 167:1–30.
- Overview of the hasoc subtrack at fire 2021: Hate speech and offensive content identification in english and indo-aryan languages and conversational hate speech. In Proceedings of the 13th Annual Meeting of the Forum for Information Retrieval Evaluation, pages 1–3.
- Luis Gerardo Mojica. 2016. Modeling trolling in social media conversations. arXiv preprint arXiv:1612.05310.
- Hate speech detection in the bengali language: A dataset and its baseline evaluation. In Proceedings of International Joint Conference on Advances in Computational Intelligence: IJCACI 2020, pages 457–468. Springer.
- Anita Saroj and Sukomal Pal. 2020. An indian language social media collection for hate and offensive speech. In Proceedings of the Workshop on Resources and Techniques for User and Author Profiling in Abusive Language, pages 2–8.
- Hate and offensive speech detection in hindi and marathi. arXiv preprint arXiv:2110.12200.
- Understanding abuse: A typology of abusive language detection subtasks. arXiv preprint arXiv:1705.09899.
- Predicting the type and target of offensive posts in social media. arXiv preprint arXiv:1902.09666.
- Semeval-2019 task 6: Identifying and categorizing offensive language in social media (offenseval). arXiv preprint arXiv:1903.08983.