Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Cross-Modal Music Retrieval and Applications: An Overview of Key Methodologies (1902.04397v1)

Published 12 Feb 2019 in cs.IR and cs.MM

Abstract: There has been a rapid growth of digitally available music data, including audio recordings, digitized images of sheet music, album covers and liner notes, and video clips. This huge amount of data calls for retrieval strategies that allow users to explore large music collections in a convenient way. More precisely, there is a need for cross-modal retrieval algorithms that, given a query in one modality (e.g., a short audio excerpt), find corresponding information and entities in other modalities (e.g., the name of the piece and the sheet music). This goes beyond exact audio identification and subsequent retrieval of metainformation as performed by commercial applications like Shazam [1].

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (5)
  1. Meinard Müller (12 papers)
  2. Andreas Arzt (14 papers)
  3. Stefan Balke (2 papers)
  4. Matthias Dorfer (21 papers)
  5. Gerhard Widmer (144 papers)
Citations (45)

Summary

We haven't generated a summary for this paper yet.