Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Neural Percussive Synthesis Parameterised by High-Level Timbral Features (1911.11853v2)

Published 25 Nov 2019 in eess.AS, cs.LG, cs.SD, and stat.ML

Abstract: We present a deep neural network-based methodology for synthesising percussive sounds with control over high-level timbral characteristics of the sounds. This approach allows for intuitive control of a synthesizer, enabling the user to shape sounds without extensive knowledge of signal processing. We use a feedforward convolutional neural network-based architecture, which is able to map input parameters to the corresponding waveform. We propose two datasets to evaluate our approach on both a restrictive context, and in one covering a broader spectrum of sounds. The timbral features used as parameters are taken from recent literature in signal processing. We also use these features for evaluation and validation of the presented model, to ensure that changing the input parameters produces a congruent waveform with the desired characteristics. Finally, we evaluate the quality of the output sound using a subjective listening test. We provide sound examples and the system's source code for reproducibility.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (5)
  1. António Ramires (7 papers)
  2. Pritish Chandna (10 papers)
  3. Xavier Favory (12 papers)
  4. Emilia Gómez (49 papers)
  5. Xavier Serra (82 papers)
Citations (21)

Summary

We haven't generated a summary for this paper yet.