Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
97 tokens/sec
GPT-4o
53 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
5 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Improving Deep Learning Sound Events Classifiers using Gram Matrix Feature-wise Correlations (2102.11771v1)

Published 23 Feb 2021 in cs.SD, cs.LG, and eess.AS

Abstract: In this paper, we propose a new Sound Event Classification (SEC) method which is inspired in recent works for out-of-distribution detection. In our method, we analyse all the activations of a generic CNN in order to produce feature representations using Gram Matrices. The similarity metrics are evaluated considering all possible classes, and the final prediction is defined as the class that minimizes the deviation with respect to the features seeing during training. The proposed approach can be applied to any CNN and our experimental evaluation of four different architectures on two datasets demonstrated that our method consistently improves the baseline models.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (3)
  1. Antonio Joia Neto (6 papers)
  2. Andre G C Pacheco (1 paper)
  3. Diogo C Luvizon (3 papers)
Citations (2)

Summary

We haven't generated a summary for this paper yet.