Modeling Analog Dynamic Range Compressors using Deep Learning and State-space Models (2403.16331v1)
Abstract: We describe a novel approach for developing realistic digital models of dynamic range compressors for digital audio production by analyzing their analog prototypes. While realistic digital dynamic compressors are potentially useful for many applications, the design process is challenging because the compressors operate nonlinearly over long time scales. Our approach is based on the structured state space sequence model (S4), as implementing the state-space model (SSM) has proven to be efficient at learning long-range dependencies and is promising for modeling dynamic range compressors. We present in this paper a deep learning model with S4 layers to model the Teletronix LA-2A analog dynamic range compressor. The model is causal, executes efficiently in real time, and achieves roughly the same quality as previous deep-learning models but with fewer parameters.
- “The brief history of virtual analog synthesis,” in Proc. 6th Forum Acusticum. Aalborg, Denmark: European Acoustics Association, 2011, pp. 461–466.
- Kurt James Werner, “Virtual Analog Modeling of Audio Circuitry using Wave Digital Filters,” 2016.
- “DDSP: Differentiable Digital Signal Processing,” 8th International Conference on Learning Representations, ICLR 2020, 1 2020.
- “Automatic multitrack mixing with a differentiable mixing console of neural audio effects,” ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, vol. 2021-June, pp. 71–75, 2021.
- “Real-time black-box modelling with recurrent neural networks,” in 22nd international conference on digital audio effects (DAFx-19), 2019, pp. 1–8.
- “Deep Learning for Tube Amplifier Emulation,” ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, vol. 2019-May, pp. 471–475, 5 2019.
- “Real-Time Guitar Amplifier Emulation with Deep Learning,” Applied Sciences 2020, Vol. 10, Page 766, vol. 10, no. 3, pp. 766, 1 2020.
- “Real-time modeling of audio distortion circuits with deep learning,” in Proc. Int. Sound and Music Computing Conf.(SMC-19), Malaga, Spain, 2019, pp. 332–339.
- “SignalTrain: Profiling Audio Compressors with Deep Neural Networks,” 5 2019.
- “Efficient neural networks for real-time modeling of analog dynamic range compression,” AES Europe Spring 2022 - 152nd Audio Engineering Society Convention 2022, pp. 451–459, 2 2021.
- “Grey-box modelling of dynamic range compression,” in Proc. Int. Conf. Digital Audio Effects (DAFX), Vienna, Austria, 2022, pp. 304–311.
- “Efficiently Modeling Long Sequences with Structured State Spaces,” 10 2021.
- “Diagonal State Spaces are as Effective as Structured State Spaces,” 3 2022.
- “FiLM: Visual Reasoning with a General Conditioning Layer,” Proceedings of the AAAI Conference on Artificial Intelligence, vol. 32, no. 1, pp. 3942–3951, 4 2018.
- “SignalTrain LA2A Dataset,” 5 2020.
- “Parallel Wavegan: A Fast Waveform Generation Model Based on Generative Adversarial Networks with Multi-Resolution Spectrogram,” ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, vol. 2020-May, pp. 6199–6203, 5 2020.
- “auraloss: Audio focused loss functions in PyTorch,” in Digital music research network one-day workshop (DMRN+ 15), 2020.
- “Perceptual loss function for neural modeling of audio systems,” ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, vol. 2020-May, pp. 251–255, 5 2020.
- “Fr\’echet Audio Distance: A Metric for Evaluating Music Enhancement Algorithms,” 12 2018.
- B Series, “Method for the subjective assessment of intermediate quality level of audio systems,” International Telecommunication Union Radiocommunication Assembly, 2014.
- “webMUSHRA — A Comprehensive Framework for Web-based Listening Tests,” Journal of Open Research Software, vol. 6, no. 1, pp. 8, 2018.