Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Emo-bias: A Large Scale Evaluation of Social Bias on Speech Emotion Recognition (2406.05065v2)

Published 7 Jun 2024 in eess.AS

Abstract: The rapid growth of Speech Emotion Recognition (SER) has diverse global applications, from improving human-computer interactions to aiding mental health diagnostics. However, SER models might contain social bias toward gender, leading to unfair outcomes. This study analyzes gender bias in SER models trained with Self-Supervised Learning (SSL) at scale, exploring factors influencing it. SSL-based SER models are chosen for their cutting-edge performance. Our research pioneering research gender bias in SER from both upstream model and data perspectives. Our findings reveal that females exhibit slightly higher overall SER performance than males. Modified CPC and XLS-R, two well-known SSL models, notably exhibit significant bias. Moreover, models trained with Mandarin datasets display a pronounced bias toward valence. Lastly, we find that gender-wise emotion distribution differences in training data significantly affect gender bias, while upstream model representation has a limited impact.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (5)
  1. Yi-Cheng Lin (24 papers)
  2. Haibin Wu (85 papers)
  3. Huang-Cheng Chou (9 papers)
  4. Chi-Chun Lee (11 papers)
  5. Hung-yi Lee (327 papers)
Citations (3)

Summary

We haven't generated a summary for this paper yet.