2000 character limit reached
A Hybrid Approach to Audio-to-Score Alignment (2007.14333v1)
Published 28 Jul 2020 in eess.AS, cs.LG, and cs.SD
Abstract: Audio-to-score alignment aims at generating an accurate mapping between a performance audio and the score of a given piece. Standard alignment methods are based on Dynamic Time Warping (DTW) and employ handcrafted features. We explore the usage of neural networks as a preprocessing step for DTW-based automatic alignment methods. Experiments on music data from different acoustic conditions demonstrate that this method generates robust alignments whilst being adaptable at the same time.