Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Two-Stream Aural-Visual Affect Analysis in the Wild (2002.03399v2)

Published 9 Feb 2020 in cs.CV, cs.LG, and stat.ML

Abstract: Human affect recognition is an essential part of natural human-computer interaction. However, current methods are still in their infancy, especially for in-the-wild data. In this work, we introduce our submission to the Affective Behavior Analysis in-the-wild (ABAW) 2020 competition. We propose a two-stream aural-visual analysis model to recognize affective behavior from videos. Audio and image streams are first processed separately and fed into a convolutional neural network. Instead of applying recurrent architectures for temporal analysis we only use temporal convolutions. Furthermore, the model is given access to additional features extracted during face-alignment. At training time, we exploit correlations between different emotion representations to improve performance. Our model achieves promising results on the challenging Aff-Wild2 database.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (3)
  1. Felix Kuhnke (3 papers)
  2. Lars Rumberg (1 paper)
  3. Jörn Ostermann (11 papers)
Citations (71)

Summary

We haven't generated a summary for this paper yet.