Frequency estimation via spectrogram-based losses using gradient descent
Determine whether frequency estimation from audio signals can be achieved reliably by gradient-descent optimization that minimizes spectrogram-based loss functions in differentiable synthesizer sound matching, and identify the conditions under which such optimization converges to accurate frequency values.
References
We note that in the context of sound matching, the sub-task of frequency estimation through gradient descent techniques via minimizing spectrogram-based losses is an intrinsic challenge that remains open, as we discovered through our own experimentation.
— DiffMoog: a Differentiable Modular Synthesizer for Sound Matching
(2401.12570 - Uzrad et al., 23 Jan 2024) in Section 1 (Introduction)