Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

GANStrument: Adversarial Instrument Sound Synthesis with Pitch-invariant Instance Conditioning (2211.05385v2)

Published 10 Nov 2022 in cs.SD, cs.LG, and eess.AS

Abstract: We propose GANStrument, a generative adversarial model for instrument sound synthesis. Given a one-shot sound as input, it is able to generate pitched instrument sounds that reflect the timbre of the input within an interactive time. By exploiting instance conditioning, GANStrument achieves better fidelity and diversity of synthesized sounds and generalization ability to various inputs. In addition, we introduce an adversarial training scheme for a pitch-invariant feature extractor that significantly improves the pitch accuracy and timbre consistency. Experimental results show that GANStrument outperforms strong baselines that do not use instance conditioning in terms of generation quality and input editability. Qualitative examples are available online.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (3)
  1. Gaku Narita (3 papers)
  2. Junichi Shimizu (1 paper)
  3. Taketo Akama (13 papers)
Citations (8)

Summary

We haven't generated a summary for this paper yet.