Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Deep Learning for Hate Speech Detection: A Comparative Study (2202.09517v2)

Published 19 Feb 2022 in cs.CL, cs.AI, cs.IR, and cs.LG

Abstract: Automated hate speech detection is an important tool in combating the spread of hate speech, particularly in social media. Numerous methods have been developed for the task, including a recent proliferation of deep-learning based approaches. A variety of datasets have also been developed, exemplifying various manifestations of the hate-speech detection problem. We present here a large-scale empirical comparison of deep and shallow hate-speech detection methods, mediated through the three most commonly used datasets. Our goal is to illuminate progress in the area, and identify strengths and weaknesses in the current state-of-the-art. We particularly focus our analysis on measures of practical performance, including detection accuracy, computational efficiency, capability in using pre-trained models, and domain generalization. In doing so we aim to provide guidance as to the use of hate-speech detection in practice, quantify the state-of-the-art, and identify future research directions. Code and dataset are available at https://github.com/jmjmalik22/Hate-Speech-Detection.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (62)
  1. Deep learning for detecting cyberbullying across multiple social media platforms. In European conference on information retrieval, pages 141–153. Springer, 2018.
  2. Akiko Aizawa. An information-theoretic perspective of tf–idf measures. Information Processing & Management, 39(1):45–65, 2003.
  3. Hate speech detection is not as easy as you may think: A closer look at model validation. In Proceedings of the 42nd international acm sigir conference on research and development in information retrieval, pages 45–54, 2019.
  4. Classifying sensitive content in online advertisements with deep learning. International Journal of Data Science and Analytics, 10:265–276, 2020.
  5. Angrybert: Joint learning target and emotion for hate speech detection. arXiv preprint arXiv:2103.11800, 2021.
  6. A probabilistic clustering model for hate speech classification in twitter. Expert Systems with Applications, 173:114762, 2021.
  7. Deep learning for hate speech detection in tweets. In Proceedings of the 26th international conference on World Wide Web companion, pages 759–760, 2017.
  8. Neural machine translation by jointly learning to align and translate. arXiv preprint arXiv:1409.0473, 2014.
  9. Enriching word vectors with subword information. Transactions of the Association for Computational Linguistics, 5:135–146, 2017.
  10. Cyber hate speech on twitter: An application of machine classification and statistical modeling for policy and decision making. Policy & internet, 7(2):223–242, 2015.
  11. From word to sense embeddings: A survey on vector representations of meaning. Journal of Artificial Intelligence Research, 63:743–788, 2018.
  12. Bharathi Raja Chakravarthi. Multilingual hope speech detection in english and dravidian languages. International Journal of Data Science and Analytics, 14(4):389–406, 2022.
  13. Detecting offensive language in social media to protect adolescent online safety. In 2012 International Conference on Privacy, Security, Risk and Trust and 2012 International Confernece on Social Computing, pages 71–80. IEEE, 2012.
  14. The advantages of the matthews correlation coefficient (mcc) over f1 score and accuracy in binary classification evaluation. BMC genomics, 21(1):1–13, 2020.
  15. Electra: Pre-training text encoders as discriminators rather than generators. arXiv preprint arXiv:2003.10555, 2020.
  16. A multilingual evaluation for online hate speech detection. ACM Transactions on Internet Technology (TOIT), 20(2):1–22, 2020.
  17. Automated hate speech detection and the problem of offensive language. In Proceedings of the International AAAI Conference on Web and Social Media, volume 11, 2017.
  18. Hate me, hate me not: Hate speech detection on facebook. In Proceedings of the First Italian Conference on Cybersecurity (ITASEC17), pages 86–95, 2017.
  19. Bert: Pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805, 2018.
  20. A survey on automatic detection of hate speech in text. ACM Computing Surveys (CSUR), 51(4):1–30, 2018.
  21. Large scale crowdsourcing and characterization of twitter abusive behavior. In Twelfth International AAAI Conference on Web and Social Media, 2018.
  22. Using convolutional neural networks to classify hate-speech. In Proceedings of the first workshop on abusive language online, pages 85–90, 2017.
  23. A lexicon-based approach for hate speech detection. International Journal of Multimedia and Ubiquitous Engineering, 10(4):215–230, 2015.
  24. Classifying racist texts using a support vector machine. In Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval, pages 468–469, 2004.
  25. Multi-label hate speech and abusive language detection in indonesian twitter. In Proceedings of the Third Workshop on Abusive Language Online, pages 46–57, 2019.
  26. When does a compliment become sexist? analysis and classification of ambivalent sexism using twitter data. In Proceedings of the second workshop on NLP and computational social science, pages 7–16, 2017.
  27. Fasttext.zip: Compressing text classification models. arXiv preprint arXiv:1612.03651, 2016.
  28. A deep neural network based multi-task learning approach to hate speech detection. Knowledge-Based Systems, 210:106458, 2020.
  29. Locate the hate: Detecting tweets against blacks. In Proceedings of the AAAI Conference on Artificial Intelligence, volume 27, pages 1621–1622, 2013.
  30. Distributed representations of sentences and documents. In International conference on machine learning, pages 1188–1196. PMLR, 2014.
  31. K-mhas: A multi-label hate speech detection dataset in korean online news comment. arXiv preprint arXiv:2208.10684, 2022.
  32. Hate speech detection: Challenges and solutions. PloS one, 14(8):e0221152, 2019.
  33. Hatexplain: A benchmark dataset for explainable hate speech detection. arXiv preprint arXiv:2012.10289, 2020.
  34. Do characters abuse more than words? In Proceedings of the 17th Annual Meeting of the Special Interest Group on Discourse and Dialogue, pages 299–303, 2016.
  35. Distributed representations of words and phrases and their compositionality. In Advances in neural information processing systems, pages 3111–3119, 2013.
  36. Ethos: a multi-label hate speech detection dataset. Complex & Intelligent Systems, 8(6):4663–4678, 2022.
  37. Vulnerable community identification using hate speech detection on social media. Information Processing & Management, 57(3):102087, 2020.
  38. A bert-based transfer learning approach for hate speech detection in online social media. In International Conference on Complex Networks and Their Applications, pages 928–940. Springer, 2019.
  39. Ru Ni and Huan Cao. Sentiment analysis based on glove and lstm-gru. In 2020 39th Chinese control conference (CCC), pages 7492–7497. IEEE, 2020.
  40. Abusive language detection in online user content. In Proceedings of the 25th international conference on world wide web, pages 145–153, 2016.
  41. Misogyny detection in twitter: a multilingual and cross-domain study. Information Processing & Management, 57(6):102360, 2020.
  42. One-step and two-step classification for abusive language detection on twitter. arXiv preprint arXiv:1706.01206, 2017.
  43. Glove: Global vectors for word representation. In Proceedings of the 2014 conference on empirical methods in natural language processing (EMNLP), pages 1532–1543, 2014.
  44. Resources and benchmark corpora for hate speech detection: a systematic review. Language Resources and Evaluation, 55:477–523, 2021.
  45. Using tf-idf n-gram and word embedding cluster ensembles for author profiling: Notebook for pan at clef 2017. In CEUR Workshop Proceedings, volume 1866. CEUR, 2017.
  46. Fake news detection based on news content and social contexts: a transformer-based approach. International Journal of Data Science and Analytics, 13(4):335–362, 2022.
  47. Systematic literature review of hate speech detection with text mining. In 2020 2nd International Conference on Cybernetics and Intelligent System (ICORIS), pages 1–6. IEEE, 2020.
  48. Leveraging multilingual transformers for hate speech detection. arXiv preprint arXiv:2101.03207, 2021.
  49. Irlab@ iitbhu at hasoc 2019: Traditional machine learning for hate speech and offensive content identification. In FIRE (Working Notes), pages 308–314, 2019.
  50. A survey on hate speech detection using natural language processing. In Proceedings of the fifth international workshop on natural language processing for social media, pages 1–10, 2017.
  51. Vector representation of words for sentiment analysis using glove. In 2017 international conference on intelligent communication and computational techniques (icct), pages 279–284. IEEE, 2017.
  52. Several alternative term weighting methods for text representation and classification. Knowledge-Based Systems, 207:106399, 2020.
  53. Efficient transformers: A survey. arXiv preprint arXiv:2009.06732, 2020.
  54. Well-read students learn better: The impact of student initialization on knowledge distillation. arXiv preprint arXiv:1908.08962, 13, 2019.
  55. Interpretable multi-modal hate speech detection. arXiv preprint arXiv:2103.01616, 2021.
  56. Detecting hate speech on the world wide web. In Proceedings of the second workshop on language in social media, pages 19–26, 2012.
  57. Hateful symbols or hateful people? predictive features for hate speech detection on twitter. In Proceedings of the NAACL student research workshop, pages 88–93, 2016.
  58. Detecting offensive tweets via topical feature discovery over a large scale twitter corpus. In Proceedings of the 21st ACM international conference on Information and knowledge management, pages 1980–1984, 2012.
  59. Deep learning for detecting inappropriate content in text. International Journal of Data Science and Analytics, 6:273–286, 2018.
  60. Towards generalisable hate speech detection: a review on obstacles and solutions. PeerJ Computer Science, 7:e598, 2021.
  61. Detecting hate speech on twitter using a convolution-gru based deep neural network. In European semantic web conference, pages 745–760. Springer, 2018.
  62. Content-driven detection of cyberbullying on the instagram social network. In IJCAI International Joint Conference on Artificial Intelligence, volume 2016, pages 3952–3958, 2016.
User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (4)
  1. Jitendra Singh Malik (1 paper)
  2. Hezhe Qiao (14 papers)
  3. Guansong Pang (82 papers)
  4. Anton van den Hengel (188 papers)
Citations (36)

Summary

We haven't generated a summary for this paper yet.