Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
139 tokens/sec
GPT-4o
47 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Cross-lingual Offensive Language Detection: A Systematic Review of Datasets, Transfer Approaches and Challenges (2401.09244v1)

Published 17 Jan 2024 in cs.CL

Abstract: The growing prevalence and rapid evolution of offensive language in social media amplify the complexities of detection, particularly highlighting the challenges in identifying such content across diverse languages. This survey presents a systematic and comprehensive exploration of Cross-Lingual Transfer Learning (CLTL) techniques in offensive language detection in social media. Our study stands as the first holistic overview to focus exclusively on the cross-lingual scenario in this domain. We analyse 67 relevant papers and categorise these studies across various dimensions, including the characteristics of multilingual datasets used, the cross-lingual resources employed, and the specific CLTL strategies implemented. According to "what to transfer", we also summarise three main CLTL transfer approaches: instance, feature, and parameter transfer. Additionally, we shed light on the current challenges and future research opportunities in this field. Furthermore, we have made our survey resources available online, including two comprehensive tables that provide accessible references to the multilingual datasets and CLTL methods used in the reviewed literature.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (200)
  1. Automatic detection of offensive language for urdu and roman urdu. IEEE Access 8 (2020), 91213–91226.
  2. Areej Al-Hassan and Hmood Al-Dossari. 2019. Detection of hate speech in social networks: a survey on multilingual corpus. In Proc. of COMIT, Vol. 10. 10–5121.
  3. Dataset construction for the detection of anti-social behaviour in online communication in Arabic. Procedia Computer Science 142 (2018), 174–181.
  4. Hate speech detection in the Indonesian language: A dataset and preliminary study. In Proc. of ICACSIS. 233–238.
  5. Building for tomorrow: Assessing the temporal persistence of text classifiers. Information Processing & Management 60, 2 (2023), 103200.
  6. Fatimah Alkomah and Xiaogang Ma. 2022. A literature review of textual hate speech detection methods and datasets. Information 13, 6 (2022), 273.
  7. Deep learning models for multilingual hate speech detection. arXiv preprint arXiv:2004.06465 (2020).
  8. A Deep Dive into Multilingual Hate Speech Classification. In Proc. of ECML PKDD, Yuxiao Dong, Georgiana Ifrim, Dunja Mladenić, Craig Saunders, and Sofie Van Hoecke (Eds.). Springer, Cham, 423–439.
  9. UrduThreat@FIRE2021: Shared Track on Abusive Threat Identification in Urdu. In Proc. of FIRE (Virtual Event, India). ACM, New York, NY, USA, 9–11.
  10. Hate Speech Detection is Not as Easy as You May Think: A Closer Look at Model Validation. In Proc. of ACM SIGIR (Paris, France). ACM, New York, NY, USA, 45–54.
  11. Cross-lingual hate speech detection based on multilingual domain-specific word embeddings. arXiv preprint arXiv:2104.14728 (2021).
  12. Hate speech detection is not as easy as you may think: A closer look at model validation (extended version). Information Systems 105 (2022), 101584.
  13. Resources for Multilingual Hate Speech Detection. In Proc. of the WOAH Workshop. ACL, Seattle, Washington (Hybrid), 122–130.
  14. Masaki Arata. 2019. Study on change of detection accuracy over time in cyberbullying detection. Ph. D. Dissertation. Master’s thesis, Kitami Institute of Technology, Department of Computer Science.
  15. Mikel Artetxe and Holger Schwenk. 2019. Massively Multilingual Sentence Embeddings for Zero-Shot Cross-Lingual Transfer and Beyond. Transactions of the ACL 7 (2019), 597–610.
  16. Model-Agnostic Meta-Learning for Multilingual Hate Speech Detection. IEEE Trans. on Computational Social Systems (2023), 1–10.
  17. Machine learning techniques for hate speech classification of twitter data: State-of-the-art, future challenges and research directions. Computer Science Review 38 (2020), 100311.
  18. Code-Switching Patterns Can Be an Effective Route to Improve Performance of Downstream NLP Applications: A Case Study of Humour, Sarcasm and Hate Speech Detection. In Proc. of ACL. ACL, Online, 1018–1023.
  19. SemEval-2019 Task 5: Multilingual Detection of Hate Speech Against Immigrants and Women in Twitter. In Proc. of the SemEval Workshop. ACL, Minneapolis, Minnesota, USA, 54–63.
  20. Developing a multilingual annotated corpus of misogyny and aggression. arXiv preprint arXiv:2003.07428 (2020).
  21. Cross-Lingual Transfer Learning for Hate Speech Detection. In Proc. of the LT-EDI Workshop. ACL, Kyiv, 15–25.
  22. Addressing the Challenges of Cross-Lingual Hate Speech Detection. arXiv preprint arXiv:2201.05922 (2022).
  23. Label modification and bootstrapping for zero-shot cross-lingual hate speech detection. Language Resources and Evaluation (2023), 1–32.
  24. A Dataset of Hindi-English Code-Mixed Social Media Text for Hate Speech Detection. In Proc. of PEOPLES Workshop. ACL, New Orleans, Louisiana, USA, 36–41.
  25. Overview of the evalita 2018 hate speech detection task. In Proc. of CEUR Workshop, Vol. 2263. CEUR, 1–9.
  26. Detecting Hateful and Offensive Speech in Arabic Social Media Using Transfer Learning. Applied Sciences 12, 24 (2022).
  27. Uwe Bretschneider and Ralf Peters. 2017. Detecting offensive statements towards foreigners in social media. (2017).
  28. Language models are few-shot learners. Proc. of NeurIPS 33 (2020), 1877–1901.
  29. Ben Burtenshaw and Mike Kestemont. 2021. A Dutch Dataset for Cross-lingual Multilabel Toxicity Detection. In Proc. of the BUCC Workshop. INCOMA Ltd., Online (Virtual Mode), 75–79.
  30. Dravidiancodemix: Sentiment analysis and offensive language identification dataset for dravidian languages in code-mixed text. Language Resources and Evaluation 56, 3 (2022), 765–806.
  31. Towards countering hate speech against journalists on social media. Online Social Networks and Media 17 (2020), 100071.
  32. AdaPrompt: Adaptive Model Training for Prompt-based NLP. In Findings of EMNLP. ACL, Abu Dhabi, United Arab Emirates, 6057–6068.
  33. Anusha Chhabra and Dinesh Kumar Vishwakarma. 2023. A literature survey on multimodal and multilingual automatic hate speech identification. Multimedia Systems (2023), 1–28.
  34. Multilingual and Multitarget Hate Speech Detection in Tweets. In Proc. of TALN-RECITAL. ATALA, Toulouse, France, 351–360.
  35. CONAN - COunter NArratives through Nichesourcing: a Multilingual Dataset of Responses to Fight Online Hate Speech. In Proc. of ACL. ACL, Florence, Italy, 2819–2829.
  36. Çağrı Çöltekin. 2020. A corpus of Turkish offensive language on social media. In Proc. of LREC. 6174–6184.
  37. Unsupervised Cross-lingual Representation Learning at Scale. In Proc. of ACL. ACL, Online, 8440–8451.
  38. Hybrid Emoji-Based Masked Language Models for Zero-Shot Abusive Language Detection. In Findings of EMNLP. ACL, Online, 943–949.
  39. Data Bootstrapping Approaches to Improve Low Resource Abusive Language Detection for Indic Languages. In Proc. of ACM HT (Barcelona, Spain). ACM, New York, NY, USA, 32–42.
  40. Automated Hate Speech Detection and the Problem of Offensive Language. In Proc. of ICWSM (Montreal, Canada). 512–515.
  41. Hate Speech Dataset from a White Supremacy Forum. In Proc. of the ALW Workshop. ACL, Brussels, Belgium, 11–20.
  42. Gretel Liz De la Peña Sarracén and Paolo Rosso. 2022. Unsupervised Embeddings with Graph Auto-Encoders for Multi-domain and Multilingual Hate Speech Detection. In Proc. of LREC. ELRA, Marseille, France, 2196–2204.
  43. Rogers de Pelle and Viviane Moreira. 2017. Offensive Comments in the Brazilian Web: a dataset and baseline results. In Anais do VI Brazilian Workshop on Social Network Analysis and Mining (São Paulo). SBC, Porto Alegre, RS, Brasil.
  44. Wubetu Barud Demilie and Ayodeji Olalekan Salau. 2022. Detection of fake news and hate speech for Ethiopian languages: a systematic review of the approaches. Journal of Big Data 9, 1 (2022), 66.
  45. COLD: A Benchmark for Chinese Offensive Language Detection. In Proc. of EMNLP. ACL, Abu Dhabi, United Arab Emirates, 11580–11599.
  46. Highly generalizable models for multilingual hate speech detection. arXiv preprint arXiv:2201.11294 (2022).
  47. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. In Proc. of NAACL-HLT. ACL, Minneapolis, Minnesota, 4171–4186.
  48. Exploring linguistic features for extremist texts detection (on the material of Russian-speaking illegal texts). In Proc. of IEEE ISI. 188–190.
  49. Suman Dowlagar and Radhika Mamidi. 2021. A survey of recent neural network models on code-mixed indian hate speech data. In Proc. of FIRE. 67–74.
  50. Matthew S. Dryer and Martin Haspelmath (Eds.). 2013. WALS Online (v2020.3). Zenodo. https://doi.org/10.5281/zenodo.7385533
  51. Machine Learning and Affect Analysis against Cyber-Bullying. In Proc. of the Linguistic and Cognitive Approaches to Dialog Agents Symposium.
  52. Hate Speech Detection on Multilingual Twitter Using Convolutional Neural Networks. Revue d’Intelligence Artificielle 34, 1 (2020), 81–88.
  53. Peer to peer hate: Hate speech instigators and their targets. In Proc. of ICWSM, Vol. 12.
  54. Transfer language selection for zero-shot cross-lingual abusive language detection. Information Processing & Management 59, 4 (2022), 102981.
  55. Language-agnostic BERT Sentence Embedding. In Proc. of ACL. ACL, Dublin, Ireland, 878–891.
  56. Jointly Learning to Embed and Predict with Multiple Languages. In Proc. of ACL). ACL, Berlin, Germany, 2019–2028.
  57. Overview of the Evalita 2018 Task on Automatic Misogyny Identification (AMI). In EVALITA@CLiC-it.
  58. Overview of the Task on Automatic Misogyny Identification at IberEval 2018. In IberEval@SEPLN.
  59. Model-agnostic meta-learning for fast adaptation of deep networks. In Proc. of ICML. PMLR, 1126–1135.
  60. Using Cross Lingual Learning for Detecting Hate Speech in Portuguese. In Database and Expert Systems Applications, Christine Strauss, Gabriele Kotsis, A. Min Tjoa, and Ismail Khalil (Eds.). Springer, Cham, 170–175.
  61. Time of your hate: The challenge of time in hate speech detection on social media. Applied Sciences 10, 12 (2020), 4180.
  62. Rights for Peace. [n. d.]. What is Hate Speech? https://www.rightsforpeace.org/hate-speech. Accessed: 2023-10-24.
  63. Paula Fortuna. 2017. Automatic detection of hate speech in text: an overview of the topic and dataset annotation with hierarchical classes.
  64. Paula Fortuna and Sérgio Nunes. 2018. A Survey on Automatic Detection of Hate Speech in Text. 51, 4, Article 85 (jul 2018), 30 pages.
  65. A Hierarchically-Labeled Portuguese Hate Speech Dataset. In Proc. of the ALW Workshop. ACL, Florence, Italy, 94–104.
  66. Large scale crowdsourcing and characterization of twitter abusive behavior. In Proc. of ICWSM, Vol. 12.
  67. Cross-lingual Offensive Language Identification for Low Resource Languages: The Case of Marathi. In Proc. of RANLP. INCOMA Ltd., Held Online, 437–443.
  68. Lei Gao and Ruihong Huang. 2017. Detecting online hate speech using context aware models. arXiv preprint arXiv:1710.07395 (2017).
  69. Handling Bias in Toxic Speech Detection: A Survey. 55, 13s, Article 264 (jul 2023), 32 pages.
  70. XHate-999: Analyzing and Detecting Abusive Language Across Domains and Languages. In Proc. of COLING. ICCL, Barcelona, Spain (Online), 6350–6365.
  71. A Large Labeled Corpus for Online Harassment Research. In Proc. of ACM WebSci (Troy, New York, USA). ACM, New York, NY, USA, 229–233.
  72. Learning Word Vectors for 157 Languages. In Proc. of LREC.
  73. Multilingual Abusive Comment Detection at Scale for Indic Languages. Proc. of NeurIPS 35 (2022), 26176–26191.
  74. Modeling Profanity and Hate Speech in Social Media with Semantic Subspaces. In Proc. of the WOAH Workshop. ACL, Online, 6–16.
  75. Lawrence Han and Hao Tang. 2022. Designing of Prompts for Hate Speech Recognition with In-Context Learning. In Proc. of CSCI. IEEE, 319–320.
  76. Benchmarking multi-task learning for sentiment analysis and offensive language identification in under-resourced dravidian languages. arXiv preprint arXiv:2108.03867 (2021).
  77. KanCMD: Kannada CodeMixed Dataset for Sentiment Analysis and Offensive Language Detection. In Proc. of the PEOPLES Workshop. ACL, Barcelona, Spain (Online), 54–63.
  78. Offensive language identification in low-resourced code-mixed dravidian languages using pseudo-labeling. arXiv preprint arXiv:2108.12177 (2021).
  79. ToxiGen: A Large-Scale Machine-Generated Dataset for Adversarial and Implicit Hate Speech Detection. In Proc. of ACL. ACL, Dublin, Ireland, 3309–3326.
  80. Fatemah Husain and Ozlem Uzuner. 2021. A survey of offensive language detection for the Arabic language. ACM TALLIP 20, 1 (2021), 1–44.
  81. Fatemah Husain and Ozlem Uzuner. 2022. Transfer Learning Across Arabic Dialects for Offensive Language Detection. In Proc. of IALP. 196–205.
  82. Muhammad Okky Ibrohim and Indra Budi. 2019a. Translated vs non-translated method for multilingual hate speech identification in Twitter. International Journal on Advanced Science, Engineering and Information Technology 9, 4 (1 Jan. 2019), 1116–1123.
  83. Muhammad Okky Ibrohim and Indra Budi. 2019b. Multi-label Hate Speech and Abusive Language Detection in Indonesian Twitter. In Proc. of the ALW Workshop. ACL, Florence, Italy, 46–57.
  84. Md Saroar Jahan and Mourad Oussalah. 2023. A systematic review of Hate Speech automatic detection using Natural Language Processing. Neurocomputing (2023), 126232.
  85. KOLD: Korean Offensive Language Dataset. In Proc. of EMNLP. ACL, Abu Dhabi, United Arab Emirates, 10818–10833.
  86. SWSR: A Chinese dataset and lexicon for online sexism detection. Online Social Networks and Media 27 (2022), 100182.
  87. Aiqi Jiang and Arkaitz Zubiaga. 2021. Cross-Lingual Capsule Network for Hate Speech Detection in Social Media. In Proc. of ACM HT (Virtual Event, USA). ACM, New York, NY, USA, 217–223.
  88. Aiqi Jiang and Arkaitz Zubiaga. 2023. SexWEs: Domain-Aware Word Embeddings via Cross-Lingual Semantic Specialisation for Chinese Sexism Detection in Social Media. Proc. of ICWSM 17, 1 (Jun. 2023), 447–458.
  89. Is bert really robust? a strong baseline for natural language attack on text classification and entailment. In Proc. of AAAI, Vol. 34. 8018–8025.
  90. Mind Your Language: Abuse and Offense Detection for Code-Switched Languages. Proc. of AAAI 33, 01 (Jul. 2019), 9951–9952.
  91. Contextualizing hate speech classifiers with post-hoc explanation. arXiv preprint arXiv:2005.02439 (2020).
  92. Hate Speech Detection in Roman Urdu. ACM Transactions on Asian and Low-Resource Language Information Processing 20, 1, Article 9 (Mar. 2021).
  93. Confronting abusive language online: A survey from the ethical and human rights perspective. Artificial Intelligence Research 71 (2021), 431–478.
  94. Siamese neural networks for one-shot image recognition. In ICML deep learning workshop, Vol. 2. Lille.
  95. Akshi Kumar and Nitin Sachdeva. 2022. Multi-input integrative learning using deep neural networks and transfer learning for cyberbullying detection in real-time code-mix data. Multimedia systems 28, 6 (2022), 2027–2041.
  96. Evaluating aggression identification in social media. In Proc. of the TRAC Workshop. 1–5.
  97. Aggression-annotated Corpus of Hindi-English Code-mixed Data. In Proc. of LREC. ELRA, Miyazaki, Japan.
  98. Haspeede3 at evalita 2023: Overview of the political and religious hate speech detection task. In EVALITA.
  99. Unsupervised machine translation using monolingual corpora only. In Proc. of ICLR.
  100. Word translation without parallel data. In Proc. of ICLR.
  101. Exploring the Data Efficiency of Cross-Lingual Post-Training in Pretrained Language Models. Applied Sciences 11, 5 (2021).
  102. Dong-Hyun Lee et al. 2013. Pseudo-label: The simple and efficient semi-supervised learning method for deep neural networks. In ICML Workshop on challenges in representation learning, Vol. 3. Atlanta, 896.
  103. Multilingual Denoising Pre-training for Neural Machine Translation. Transactions of the ACL 8 (2020), 726–742.
  104. The FRENK datasets of socially unacceptable discourse in Slovene and English. In Proc. of TSD. Springer, 103–114.
  105. Scott M Lundberg and Su-In Lee. 2017. A Unified Approach to Interpreting Model Predictions. In Proc. of NeurIPS 30, I. Guyon, U. V. Luxburg, S. Bengio, H. Wallach, R. Fergus, S. Vishwanathan, and R. Garnett (Eds.). Curran Associates, Inc., 4765–4774.
  106. Overview of the HASOC Track at FIRE 2020: Hate Speech and Offensive Language Identification in Tamil, Malayalam, Hindi, English and German. In Proc. of FIRE (Hyderabad, India). ACM, New York, NY, USA, 29–32.
  107. Overview of the HASOC Track at FIRE 2019: Hate Speech and Offensive Content Identification in Indo-European Languages. In Proc. of FIRE (Kolkata, India). ACM, New York, NY, USA, 14–17.
  108. Exploring Stylometric and Emotion-Based Features for Multilingual Cross-Domain Hate Speech Detection. In Proc. of the WASSA Workshop. ACL, Online, 149–159.
  109. Hatexplain: A benchmark dataset for explainable hate speech detection. In Proc. of AAAI, Vol. 35. 14867–14875.
  110. Did you offend me? Classification of Offensive Tweets in Hinglish Language. In Proc. of the ALW Workshop. ACL, Brussels, Belgium, 138–148.
  111. Detecting Offensive Tweets in Hindi-English Code-Switched Language. In Proc. of the SocialNLP Workshop. ACL, Melbourne, Australia, 18–26.
  112. Tackling online abuse: A survey of automated abuse detection methods. arXiv preprint arXiv:1908.06024 (2019).
  113. Overview of the hasoc subtrack at fire 2021: Hate speech and offensive content identification in english and indo-aryan languages and conversational hate speech. In Proce. of FIRE. 1–3.
  114. Multilingual Auxiliary Tasks Training: Bridging the Gap between Languages for Zero-Shot Transfer of Hate Speech Detection Models. In Findings of AACL-IJCNLP. ACL, Online only, 347–363.
  115. Cross-Lingual Few-Shot Hate Speech and Offensive Language Detection Using Meta Learning. IEEE Access 10 (2022), 14880–14896.
  116. Abusive Language Detection on Arabic Social Media. In Proc. of the ALW Workshop. ACL, Vancouver, BC, Canada, 52–56.
  117. Overview of OSACT4 Arabic Offensive Language Detection Shared Task. In Proc. of the OSACT Workshop. ELRA, Marseille, France, 48–52.
  118. Arabic Offensive Language on Twitter: Analysis and Experiments. In Proc. of the WANLP Workshop. ACL, Kyiv, Ukraine (Virtual), 126–135.
  119. L-hsab: A levantine twitter dataset for hate speech and abusive language. In Proc. of the ALW Workshop. 111–118.
  120. Arianna Muti and Alberto Barrón-Cedeño. 2022. A Checkpoint on Multilingual Misogyny Identification. In Proc. of ACL SRW. ACL, Dublin, Ireland, 454–460.
  121. Debora Nozza. 2021. Exposing the limits of Zero-shot Cross-lingual Hate Speech Detection. In Proc. of ACL-IJCNLP. ACL, Online, 907–914.
  122. OpenAI. 2023. GPT-4 Technical Report. arXiv:2303.08774 [cs.CL]
  123. Multilingual and Multi-Aspect Hate Speech Analysis. In Proc. of EMNLP-IJCNLP. ACL, Hong Kong, China, 4675–4684.
  124. ERNIE-M: Enhanced Multilingual Representation by Aligning Cross-lingual Semantics with Monolingual Corpora. In Proc. of EMNLP. ACL, Online and Punta Cana, Dominican Republic, 27–38.
  125. The PRISMA 2020 statement: An updated guideline for reporting systematic reviews. International Journal of Surgery 88 (2021), 105906.
  126. Misogyny Detection in Twitter: a Multilingual and Cross-Domain Study. Information Processing & Management 57, 6 (2020), 102360.
  127. A joint learning approach with knowledge injection for zero-shot cross-lingual hate speech detection. Information Processing & Management 58, 4 (2021), 102544.
  128. Towards multidomain and multilingual abusive language detection: a survey. Personal and Ubiquitous Computing 27, 1 (2021), 17–43.
  129. Endang Wahyu Pamungkas and Viviana Patti. 2019. Cross-domain and Cross-lingual Abusive Language Detection: A Hybrid Approach with Deep Learning and a Multilingual Lexicon. In Proc. of ACL SRW. ACL, Florence, Italy, 363–370.
  130. Sinno Jialin Pan and Qiang Yang. 2010. A Survey on Transfer Learning. IEEE Trans. on Knowledge and Data Engineering 22, 10 (2010), 1345–1359.
  131. Kartikey Pant and Tanvi Dadu. 2020. Towards Code-switched Classification Exploiting Constituent Language Resources. In Proc. of AACL-IJCNLP SRW. ACL, Suzhou, China, 37–43.
  132. COVID-19 and cyberbullying: deep ensemble model to identify cyberbullying from code-switched languages during the pandemic. Multimedia Tools and Applications 82, 6 (2023), 8773–8789.
  133. SemEval-2021 Task 5: Toxic Spans Detection. In Proc. of the SemEval Workshop. ACL, Online, 59–69.
  134. Zero-shot Cross-lingual Content Filtering: Offensive Language and Hate Speech Detection. In Proc.of the EACL Hackashop. ACL, Online, 30–34.
  135. Investigating cross-lingual training for offensive language detection. PeerJ Computer Science 7 (2021), e559.
  136. Detecting and Monitoring Hate Speech in Twitter. Sensors 19, 21 (2019).
  137. Deep Contextualized Word Representations. In Proc. of NAACL-HLT. ACL, New Orleans, Louisiana, 2227–2237.
  138. Tung Minh Phung and Jan Cloos. 2022. An exploratory experiment on Hindi, Bengali hate-speech detection and transfer learning using neural networks. arXiv preprint arXiv:2201.01997 (2022).
  139. Cross-lingual learning for text processing: A survey. Expert Systems with Applications 165 (2021), 113765.
  140. Offensive Language Identification in Greek. In Proc. of LREC. ELRA, Marseille, France, 5113–5119.
  141. Overview of EXIST 2023: sEXism Identification in Social NeTworks. In Proc. of ECIR. Springer, 593–599.
  142. Respectful or Toxic? Using Zero-Shot Learning with Language Models to Detect Hate Speech. In Proc. of the WOAH workshop. ACL, Toronto, Canada, 60–68.
  143. Resources and benchmark corpora for hate speech detection: a systematic review. Language Resources and Evaluation 55 (2021), 477–523.
  144. Cyberbullying Detection–Technical Report 2/2018, Department of Computer Science AGH, University of Science and Technology. arXiv preprint arXiv:1808.00926 (2018).
  145. Results of the poleval 2019 shared task 6: First dataset and open shared task for automatic cyberbullying detection in polish twitter. (2019).
  146. Tharindu Ranasinghe and Marcos Zampieri. 2020. Multilingual Offensive Language Identification with Cross-lingual Embeddings. In Proc. of EMNLP. ACL, Online, 5838–5844.
  147. Tharindu Ranasinghe and Marcos Zampieri. 2021a. An Evaluation of Multilingual Offensive Language Identification Methods for the Languages of India. Information 12, 8 (2021).
  148. Tharindu Ranasinghe and Marcos Zampieri. 2021b. MUDES: Multilingual Detection of Offensive Spans. In Proc. of NAACL-HLT: Demonstrations. ACL, Online, 144–152.
  149. Tharindu Ranasinghe and Marcos Zampieri. 2021c. Multilingual Offensive Language Identification for Low-Resource Languages. ACM Transactions on Asian and Low-Resource Language Information Processing 21, 1, Article 4 (Nov. 2021), 13 pages.
  150. “Why Should I Trust You?”: Explaining the Predictions of Any Classifier. In Proc. of NAACL: Demonstrations. ACL, San Diego, California, 97–101.
  151. Bilingual Word Embeddings Rivaling. 2021. Llod-driven bilingual word embeddings rivaling cross-lingual transformers in quality of life concept detection from french online health communities. In Proc. of of SEMANTiCS, Vol. 53. IOS Press, 89.
  152. Hate-speech and offensive language detection in roman Urdu. In Proc. of EMNLP. 2512–2522.
  153. Detecting Hate Speech in Cross-Lingual and Multi-lingual Settings Using Language Agnostic Representations. In Proc. of CIARP, João Manuel R. S. Tavares, João Paulo Papa, and Manuel González Hidalgo (Eds.). Springer, Cham, 77–87.
  154. Overview of exist 2021: sexism identification in social networks. Procesamiento del Lenguaje Natural 67 (2021), 195–207.
  155. Overview of exist 2022: sexism identification in social networks. Procesamiento del Lenguaje Natural 69 (2022), 229–240.
  156. Hate speech detection in the bengali language: A dataset and its baseline evaluation. In Proc. of IJCACI. Springer, 457–468.
  157. Measuring the reliability of hate speech annotations: The case of the european refugee crisis. arXiv preprint arXiv:1701.08118 (2017).
  158. Data-Efficient Strategies for Expanding Hate Speech Detection into Under-Resourced Languages. In Proc. of EMNLP. ACL, Abu Dhabi, United Arab Emirates, 5674–5691.
  159. Multilingual HateCheck: Functional Tests for Multilingual Hate Speech Detection Models. In Proc. of the WOAH Workshop. ACL, Seattle, Washington (Hybrid), 154–169.
  160. Probing LLMs for hate speech detection: strengths and vulnerabilities. arXiv preprint arXiv:2310.12860 (2023).
  161. Training Multilingual and Adversarial Attack-Robust Models for Hate Detection on Social Media. Procedia Computer Science 213 (2022), 196–202.
  162. Offensive Content Detection via Synthetic Code-Switched Text. In Proc. of COLING. ICCL, Gyeongju, Republic of Korea, 6617–6624.
  163. Haspeede 2@ evalita2020: Overview of the evalita 2020 hate speech detection task. EVALITA (2020).
  164. An Italian Twitter Corpus of Hate Speech against Immigrants. In Proc. of LREC. ELRA, Miyazaki, Japan.
  165. Overview of the HASOC Subtrack at FIRE 2022: Hate Speech and Offensive Content Identification in English and Indo-Aryan Languages. In Proc. of FIRE (Kolkata, India). ACM, New York, NY, USA, 4–7.
  166. Anna Schmidt and Michael Wiegand. 2017. A survey on hate speech detection using natural language processing. In Proc. of the SocialNLP Workshop. 1–10.
  167. Automating News Comment Moderation with Limited Resources: Benchmarking in Croatian and Estonian. Journal for Language Technology and Computational Linguistics (2020).
  168. Cross-lingual offensive speech identification with transfer learning for low-resource languages. Computers and Electrical Engineering 101 (2022), 108005.
  169. Gudbjartur Ingi Sigurbergsson and Leon Derczynski. 2020. Offensive Language and Hate Speech Detection for Danish. In Proc. of LREC. ELRA, Marseille, France, 3498–3508.
  170. Offline bilingual word vectors, orthogonal transformations and the inverted softmax. In Proc. of ICLR.
  171. Hajung Sohn and Hyunju Lee. 2019. MC-BERT4HATE: Hate Speech Detection using Multi-channel BERT for Different Languages and Translations. In 2019 International Conference on Data Mining Workshops (ICDMW). 551–559.
  172. A Study of Multilingual Toxic Text Detection Approaches under Imbalanced Sample Distribution. Information 12, 5 (2021).
  173. Cleansing & expanding the HURTLEX(el) with a multidimensional categorization of offensive words. In Proc. of the WOAH Workshop. ACL, Seattle, Washington (Hybrid), 102–108.
  174. Cross-lingual zero-and few-shot hate speech detection utilising frozen transformer language models and AXEL. arXiv preprint arXiv:2004.13850 (2020).
  175. Overview of GermEval Task 2, 2019 shared task on the identification of offensive language. In Proc. of KONVENS. München [u.a.], 352 – 363.
  176. A corpus of english-hindi code-mixed tweets for sarcasm detection. arXiv preprint arXiv:1805.11869 (2018).
  177. Teodor Tiţa and Arkaitz Zubiaga. 2021. Cross-lingual Hate Speech Detection using Transformer Models. arXiv preprint arXiv:2111.00981 (2021).
  178. Llama: Open and efficient foundation language models. arXiv preprint arXiv:2302.13971 (2023).
  179. UNSPAHS. 2019. UN Strategy and Plan of Action on Hate Speech. https://www.un.org/en/genocideprevention/hate-speech-strategy.shtml. Accessed: 2023-10-24.
  180. Automatic detection and prevention of cyberbullying. In Proc. of HUSO. IARIA, 13–18.
  181. Neeraj Vashistha and Arkaitz Zubiaga. 2021. Online Multilingual Hate Speech Detection: Experimenting with Hindi and English Social Media. Information 12, 1 (2021).
  182. Bertie Vidgen and Leon Derczynski. 2020. Directions in abusive language training data, a systematic review: Garbage in, garbage out. Plos One 15, 12 (2020), e0243300.
  183. Learning from the worst: Dynamically generated datasets to improve online hate detection. arXiv preprint arXiv:2012.15761 (2020).
  184. Efficient Detection of Multilingual Hate Speech by Using Interactive Attention Network with Minimal Human Feedback. In Proc. of ACM WebSci (Virtual Event, United Kingdom). ACM, New York, NY, USA, 130–138.
  185. Dong Wang and Thomas Fang Zheng. 2015. Transfer learning for speech and language processing. In Proc. of APSIPA. IEEE, 1225–1237.
  186. Evaluating GPT-3 Generated Explanations for Hateful Content Moderation. In Proc. of IJCAI. 6255–6263.
  187. Zeerak Waseem and Dirk Hovy. 2016. Hateful Symbols or Hateful People? Predictive Features for Hate Speech Detection on Twitter. In Proc. of NAACL SRW. ACL, San Diego, California, 88–93.
  188. Finetuned language models are zero-shot learners. arXiv preprint arXiv:2109.01652 (2021).
  189. Overview of the germeval 2018 shared task on the identification of offensive language. (2018).
  190. Ex machina: Personal attacks seen at scale. In Proc. of WWW. 1391–1399.
  191. mT5: A Massively Multilingual Pre-trained Text-to-Text Transformer. In Proc. of NAACL-HLT. ACL, Online, 483–498.
  192. Peiling Yi and Arkaitz Zubiaga. 2023. Session-based cyberbullying detection in social media: A survey. Online Social Networks and Media 36 (2023), 100250.
  193. Wenjie Yin and Arkaitz Zubiaga. 2021. Towards generalisable hate speech detection: a review on obstacles and solutions. PeerJ Computer Science 7 (2021), e598.
  194. A multilingual offensive language detection method based on transfer learning from transformer fine-tuning model. Computer and Information Sciences 34, 8, Part B (2022), 6048–6056.
  195. SemEval-2019 Task 6: Identifying and Categorizing Offensive Language in Social Media (OffensEval). In Proc. of the SemEval Workshop, Jonathan May, Ekaterina Shutova, Aurelie Herbelot, Xiaodan Zhu, Marianna Apidianaki, and Saif M. Mohammad (Eds.). ACL, Minneapolis, Minnesota, USA, 75–86.
  196. SemEval-2020 Task 12: Multilingual Offensive Language Identification in Social Media (OffensEval 2020). In Proc. of the SemEval Workshop. ICCL, Barcelona (online), 1425–1447.
  197. Interpretable unified language checking. arXiv preprint arXiv:2304.03728 (2023).
  198. A survey of large language models. arXiv preprint arXiv:2303.18223 (2023).
  199. Cross-Cultural Transfer Learning for Chinese Offensive Language Detection. In Proc. of the C3NLP Workshop). ACL, Dubrovnik, Croatia, 8–15.
  200. Improving Zero-Shot Cross-Lingual Hate Speech Detection with Pseudo-Label Fine-Tuning of Transformer Language Models. Proc. of ICWSM 16, 1 (May 2022), 1435–1439.
Citations (2)

Summary

  • The paper offers a systematic review of 67 studies, evaluating datasets, transfer techniques, and challenges in detecting offensive language across languages.
  • It categorizes transfer methods into instance, feature, and parameter levels, highlighting strategies like machine translation and zero-shot learning.
  • The survey identifies key challenges such as data scarcity, linguistic nuances, and annotation inconsistencies, suggesting paths for future research.

Introduction

Cross-Lingual Transfer Learning (CLTL) is an evolving subfield within the domain of offensive language detection on social media platforms. The challenge in this area is amplified by the need to identify offensive content that can vary significantly with linguistic nuances and cultural contexts. CLTL strategies are crucial in mitigating data scarcity issues encountered in low-resource languages. This survey inspects the techniques of cross-lingual detection by examining 67 studies, dissecting them based on datasets leveraged, resources applied, and the dimensions of transfer—instance, feature, and parameter.

Existing Datasets and Cross-Lingual Resources

Multilingual datasets serve as the foundation for cross-lingual studies, but they are often limited by factors such as data scarcity, linguistic diversity, and annotation challenges. This review has shed light on 82 multilingual datasets, noting varying topics, source platforms, language families, size, availability, and typologies of labels. The datasets are predominantly in Indo-European languages, followed by semitic languages like Arabic. In addition to existing datasets, cross-lingual resources such as multilingual lexicons, parallel corpora, and machine translation tools are critical to aligning linguistic features and facilitating CLTL.

Transfer Approaches in CLTL

The paper categorizes CLTL approaches into three main levels:

  1. Instance Transfer: This focuses on transferring data elements, such as text or labels, across languages, using techniques like annotation projection, pseudo-labelling, machine translation, and text alignment.
  2. Feature Transfer: It leverages cross-lingual word embeddings and contextualized representations to maintain a shared feature space across languages. Retrofitting and additional features integration also fall within this spectrum.
  3. Parameter Transfer: This level encompasses the transfer of model parameters or behaviors across languages. The paper breaks down parameter transfer into zero-shot, joint, and cascade learning scenarios accompanied by hybrid strategies like ensemble and meta-learning.

Challenges and Future Prospects

The myriad challenges identified pertain to linguistic structures, dataset limitations, and methodological hurdles. These include the adaptability problems posed by language-specific nuances, limited labelled datasets, inconsistent definitions and annotations, imbalance in datasets, and the limited generalization capabilities of CLTL models. Future research directions point towards creating balanced and comprehensive datasets, improving annotation strategies, integrating additional language-agnostic features, optimizing multilingual PLMs, and experimenting with advanced training strategies such as meta-learning and adversarial training.

This survey points to a need for a continued focus on CLTL in offensive language detection to bridge the language resource gap and enable robust moderation systems in multilingual online environments. It emphasizes the combined utility of multilingual datasets, cross-lingual resources, and innovative learning strategies while highlighting complexities that arise from cultural specificity and rapid linguistic evolution in digital communication.

X Twitter Logo Streamline Icon: https://streamlinehq.com