Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

CCMusic: An Open and Diverse Database for Chinese Music Information Retrieval Research (2503.18802v1)

Published 24 Mar 2025 in cs.IR and cs.SD

Abstract: Data are crucial in various computer-related fields, including music information retrieval (MIR), an interdisciplinary area bridging computer science and music. This paper introduces CCMusic, an open and diverse database comprising multiple datasets specifically designed for tasks related to Chinese music, highlighting our focus on this culturally rich domain. The database integrates both published and unpublished datasets, with steps taken such as data cleaning, label refinement, and data structure unification to ensure data consistency and create ready-to-use versions. We conduct benchmark evaluations for all datasets using a unified evaluation framework developed specifically for this purpose. This publicly available framework supports both classification and detection tasks, ensuring standardized and reproducible results across all datasets. The database is hosted on HuggingFace and ModelScope, two open and multifunctional data and model hosting platforms, ensuring ease of accessibility and usability.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (7)
  1. Monan Zhou (5 papers)
  2. Shenyang Xu (1 paper)
  3. Zhaorui Liu (2 papers)
  4. Zhaowen Wang (55 papers)
  5. Feng Yu (58 papers)
  6. Wei Li (1122 papers)
  7. Baoqiang Han (3 papers)

Summary

We haven't generated a summary for this paper yet.