Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
166 tokens/sec
GPT-4o
7 tokens/sec
Gemini 2.5 Pro Pro
42 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Music Genre Classification: Training an AI model (2405.15096v1)

Published 23 May 2024 in cs.SD, cs.LG, and eess.AS

Abstract: Music genre classification is an area that utilizes machine learning models and techniques for the processing of audio signals, in which applications range from content recommendation systems to music recommendation systems. In this research I explore various machine learning algorithms for the purpose of music genre classification, using features extracted from audio signals.The systems are namely, a Multilayer Perceptron (built from scratch), a k-Nearest Neighbours (also built from scratch), a Convolutional Neural Network and lastly a Random Forest wide model. In order to process the audio signals, feature extraction methods such as Short-Time Fourier Transform, and the extraction of Mel Cepstral Coefficients (MFCCs), is performed. Through this extensive research, I aim to asses the robustness of machine learning models for genre classification, and to compare their results.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (10)
  1. Understanding of a convolutional neural network. In 2017 international conference on engineering and technology (ICET), pages 1–6. Ieee, 2017.
  2. Ambiguity modelling with label distribution learning for music classification. In ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pages 611–615. IEEE, 2022.
  3. Aman Kataria and MD Singh. A review of data classification using k-nearest neighbour algorithm. International Journal of Emerging Technology and Advanced Engineering, 3(6):354–360, 2013.
  4. Andrada Olteanu. Gtzan dataset-music genre classification. Kaggle. com, 2020.
  5. Y-S Park and S Lek. Artificial neural networks: Multilayer perceptron for ecological modeling. In Developments in environmental modelling, volume 28, pages 123–140. Elsevier, 2016.
  6. Michael Portnoff. Time-frequency representation of digital signals and systems based on short-time fourier analysis. IEEE Transactions on Acoustics, Speech, and Signal Processing, 28(1):55–69, 1980.
  7. Comparison of svm, knn, and nb classifier for genre music classification based on metadata. In 2020 international seminar on application for technology of information and communication (iSemantic), pages 12–16. IEEE, 2020.
  8. A machine learning approach to automatic music genre classification. Journal of the Brazilian Computer Society, 14:7–18, 2008.
  9. Sadie A Stafford. Music in the digital age: The emergence of digital music and its repercussions on the music industry. The Elon Journal of Undergraduate Research in Communications, 1(2):112–120, 2010.
  10. Musical genre classification of audio signals. IEEE Transactions on speech and audio processing, 10(5):293–302, 2002.
Citations (1)

Summary

We haven't generated a summary for this paper yet.