Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
102 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Towards Proper Contrastive Self-supervised Learning Strategies For Music Audio Representation (2207.04471v1)

Published 10 Jul 2022 in cs.SD, cs.AI, cs.MM, and eess.AS

Abstract: The common research goal of self-supervised learning is to extract a general representation which an arbitrary downstream task would benefit from. In this work, we investigate music audio representation learned from different contrastive self-supervised learning schemes and empirically evaluate the embedded vectors on various music information retrieval (MIR) tasks where different levels of the music perception are concerned. We analyze the results to discuss the proper direction of contrastive learning strategies for different MIR tasks. We show that these representations convey a comprehensive information about the auditory characteristics of music in general, although each of the self-supervision strategies has its own effectiveness in certain aspect of information.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (4)
  1. Jeong Choi (4 papers)
  2. Seongwon Jang (3 papers)
  3. Hyunsouk Cho (11 papers)
  4. Sehee Chung (5 papers)
Citations (6)
Youtube Logo Streamline Icon: https://streamlinehq.com