Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
97 tokens/sec
GPT-4o
53 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Multi-Source Diffusion Models for Simultaneous Music Generation and Separation (2302.02257v4)

Published 4 Feb 2023 in cs.SD, cs.LG, and eess.AS

Abstract: In this work, we define a diffusion-based generative model capable of both music synthesis and source separation by learning the score of the joint probability density of sources sharing a context. Alongside the classic total inference tasks (i.e., generating a mixture, separating the sources), we also introduce and experiment on the partial generation task of source imputation, where we generate a subset of the sources given the others (e.g., play a piano track that goes well with the drums). Additionally, we introduce a novel inference method for the separation task based on Dirac likelihood functions. We train our model on Slakh2100, a standard dataset for musical source separation, provide qualitative results in the generation settings, and showcase competitive quantitative results in the source separation setting. Our method is the first example of a single model that can handle both generation and separation tasks, thus representing a step toward general audio models.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (6)
  1. Giorgio Mariani (7 papers)
  2. Irene Tallini (4 papers)
  3. Emilian Postolache (11 papers)
  4. Michele Mancusi (12 papers)
  5. Luca Cosmo (24 papers)
  6. Emanuele RodolĂ  (90 papers)
Citations (32)

Summary

We haven't generated a summary for this paper yet.