Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
184 tokens/sec
GPT-4o
7 tokens/sec
Gemini 2.5 Pro Pro
45 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

A Comparison of Audio Preprocessing Techniques and Deep Learning Algorithms for Raga Recognition (2212.05335v1)

Published 10 Dec 2022 in cs.SD, cs.IR, and eess.AS

Abstract: Ragas form the foundation for Indian Classical Music. The task of Raga Recognition has gained traction in the Music Information Retrieval community in the recent past, which can be attributed to the nuances of Indian Classical Music that have resulted in a plethora of research problems in Computing. In this work, we used two different digital audio signal processing techniques to preprocess audio samples of Carnatic classical ragas that were then processed by various Deep Learning models. Their results were compared in order to infer which DASP technique is better suited to the task of raga recognition. We obtained state of the art results, with our best model reaching a testing accuracy of 98.1%. We also compared each model ability to distinguish between similar ragas.

Summary

We haven't generated a summary for this paper yet.