Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Are you sure? Analysing Uncertainty Quantification Approaches for Real-world Speech Emotion Recognition (2407.01143v1)

Published 1 Jul 2024 in cs.SD, cs.AI, and eess.AS

Abstract: Uncertainty Quantification (UQ) is an important building block for the reliable use of neural networks in real-world scenarios, as it can be a useful tool in identifying faulty predictions. Speech emotion recognition (SER) models can suffer from particularly many sources of uncertainty, such as the ambiguity of emotions, Out-of-Distribution (OOD) data or, in general, poor recording conditions. Reliable UQ methods are thus of particular interest as in many SER applications no prediction is better than a faulty prediction. While the effects of label ambiguity on uncertainty are well documented in the literature, we focus our work on an evaluation of UQ methods for SER under common challenges in real-world application, such as corrupted signals, and the absence of speech. We show that simple UQ methods can already give an indication of the uncertainty of a prediction and that training with additional OOD data can greatly improve the identification of such signals.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (5)
  1. Oliver Schrüfer (2 papers)
  2. Manuel Milling (20 papers)
  3. Felix Burkhardt (11 papers)
  4. Florian Eyben (14 papers)
  5. Björn Schuller (83 papers)
Citations (2)

Summary

We haven't generated a summary for this paper yet.