Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
97 tokens/sec
GPT-4o
53 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
5 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

M2H2: A Multimodal Multiparty Hindi Dataset For Humor Recognition in Conversations (2108.01260v1)

Published 3 Aug 2021 in cs.CL

Abstract: Humor recognition in conversations is a challenging task that has recently gained popularity due to its importance in dialogue understanding, including in multimodal settings (i.e., text, acoustics, and visual). The few existing datasets for humor are mostly in English. However, due to the tremendous growth in multilingual content, there is a great demand to build models and systems that support multilingual information access. To this end, we propose a dataset for Multimodal Multiparty Hindi Humor (M2H2) recognition in conversations containing 6,191 utterances from 13 episodes of a very popular TV series "Shrimaan Shrimati Phir Se". Each utterance is annotated with humor/non-humor labels and encompasses acoustic, visual, and textual modalities. We propose several strong multimodal baselines and show the importance of contextual and multimodal information for humor recognition in conversations. The empirical results on M2H2 dataset demonstrate that multimodal information complements unimodal information for humor recognition. The dataset and the baselines are available at http://www.iitp.ac.in/~ai-nlp-ml/resources.html and https://github.com/declare-lab/M2H2-dataset.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (8)
  1. Dushyant Singh Chauhan (3 papers)
  2. Gopendra Vikram Singh (3 papers)
  3. Navonil Majumder (48 papers)
  4. Amir Zadeh (36 papers)
  5. Asif Ekbal (74 papers)
  6. Pushpak Bhattacharyya (153 papers)
  7. Soujanya Poria (138 papers)
  8. Louis-Philippe Morency (123 papers)
Citations (16)

Summary

We haven't generated a summary for this paper yet.