Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Author Profiling for Hate Speech Detection (1902.06734v1)

Published 14 Feb 2019 in cs.CL

Abstract: The rapid growth of social media in recent years has fed into some highly undesirable phenomena such as proliferation of abusive and offensive language on the Internet. Previous research suggests that such hateful content tends to come from users who share a set of common stereotypes and form communities around them. The current state-of-the-art approaches to hate speech detection are oblivious to user and community information and rely entirely on textual (i.e., lexical and semantic) cues. In this paper, we propose a novel approach to this problem that incorporates community-based profiling features of Twitter users. Experimenting with a dataset of 16k tweets, we show that our methods significantly outperform the current state of the art in hate speech detection. Further, we conduct a qualitative analysis of model characteristics. We release our code, pre-trained models and all the resources used in the public domain.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (4)
  1. Pushkar Mishra (23 papers)
  2. Helen Yannakoudakis (32 papers)
  3. Ekaterina Shutova (52 papers)
  4. Marco del Tredici (13 papers)
Citations (17)

Summary

We haven't generated a summary for this paper yet.