Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

A Multimodal Framework for the Detection of Hateful Memes (2012.12871v2)

Published 23 Dec 2020 in cs.CL and cs.AI

Abstract: An increasingly common expression of online hate speech is multimodal in nature and comes in the form of memes. Designing systems to automatically detect hateful content is of paramount importance if we are to mitigate its undesirable effects on the society at large. The detection of multimodal hate speech is an intrinsically difficult and open problem: memes convey a message using both images and text and, hence, require multimodal reasoning and joint visual and language understanding. In this work, we seek to advance this line of research and develop a multimodal framework for the detection of hateful memes. We improve the performance of existing multimodal approaches beyond simple fine-tuning and, among others, show the effectiveness of upsampling of contrastive examples to encourage multimodality and ensemble learning based on cross-validation to improve robustness. We furthermore analyze model misclassifications and discuss a number of hypothesis-driven augmentations and their effects on performance, presenting important implications for future research in the field. Our best approach comprises an ensemble of UNITER-based models and achieves an AUROC score of 80.53, placing us 4th on phase 2 of the 2020 Hateful Memes Challenge organized by Facebook.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (7)
  1. Phillip Lippe (21 papers)
  2. Nithin Holla (4 papers)
  3. Shantanu Chandra (2 papers)
  4. Santhosh Rajamanickam (2 papers)
  5. Georgios Antoniou (18 papers)
  6. Ekaterina Shutova (52 papers)
  7. Helen Yannakoudakis (32 papers)
Citations (64)

Summary

We haven't generated a summary for this paper yet.