Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Disentangling Timbre and Singing Style with Multi-singer Singing Synthesis System (1910.13069v1)

Published 29 Oct 2019 in cs.SD and eess.AS

Abstract: In this study, we define the identity of the singer with two independent concepts - timbre and singing style - and propose a multi-singer singing synthesis system that can model them separately. To this end, we extend our single-singer model into a multi-singer model in the following ways: first, we design a singer identity encoder that can adequately reflect the identity of a singer. Second, we use encoded singer identity to condition the two independent decoders that model timbre and singing style, respectively. Through a user study with the listening tests, we experimentally verify that the proposed framework is capable of generating a natural singing voice of high quality while independently controlling the timbre and singing style. Also, by using the method of changing singing styles while fixing the timbre, we suggest that our proposed network can produce a more expressive singing voice.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (4)
  1. Juheon Lee (24 papers)
  2. Hyeong-Seok Choi (16 papers)
  3. Junghyun Koo (20 papers)
  4. Kyogu Lee (75 papers)
Citations (16)

Summary

We haven't generated a summary for this paper yet.