Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
97 tokens/sec
GPT-4o
53 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Augmenting Generative Adversarial Networks for Speech Emotion Recognition (2005.08447v3)

Published 18 May 2020 in cs.SD and eess.AS

Abstract: Generative adversarial networks (GANs) have shown potential in learning emotional attributes and generating new data samples. However, their performance is usually hindered by the unavailability of larger speech emotion recognition (SER) data. In this work, we propose a framework that utilises the mixup data augmentation scheme to augment the GAN in feature learning and generation. To show the effectiveness of the proposed framework, we present results for SER on (i) synthetic feature vectors, (ii) augmentation of the training data with synthetic features, (iii) encoded features in compressed representation. Our results show that the proposed framework can effectively learn compressed emotional representations as well as it can generate synthetic samples that help improve performance in within-corpus and cross-corpus evaluation.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (6)
  1. Siddique Latif (38 papers)
  2. Muhammad Asim (15 papers)
  3. Rajib Rana (52 papers)
  4. Sara Khalifa (21 papers)
  5. Raja Jurdak (108 papers)
  6. Björn W. Schuller (153 papers)
Citations (25)

Summary

We haven't generated a summary for this paper yet.