Engraving Oriented Joint Estimation of Pitch Spelling and Local and Global Keys (2402.10247v1)
Abstract: We revisit the problems of pitch spelling and tonality guessing with a new algorithm for their joint estimation from a MIDI file including information about the measure boundaries. Our algorithm does not only identify a global key but also local ones all along the analyzed piece. It uses Dynamic Programming techniques to search for an optimal spelling in term, roughly, of the number of accidental symbols that would be displayed in the engraved score. The evaluation of this number is coupled with an estimation of the global key and some local keys, one for each measure. Each of the three informations is used for the estimation of the other, in a multi-steps procedure. An evaluation conducted on a monophonic and a piano dataset, comprising 216 464 notes in total, shows a high degree of accuracy, both for pitch spelling (99.5% on average on the Bach corpus and 98.2% on the whole dataset) and global key signature estimation (93.0% on average, 95.58% on the piano dataset). Designed originally as a backend tool in a music transcription framework, this method should also be useful in other tasks related to music notation processing.
- J. Nagel, “The chromatic modal scale: Proper spelling for tonal voice-leading,” JOMAR Press, 2007.
- E. Chew and Y.-C. Chen, “Real-time pitch spelling using the spiral array,” Computer Music Journal, vol. 29, no. 2, pp. 61–76, 2005.
- D. Meredith, “The PS13 pitch spelling algorithm,” Journal of New Music Research, vol. 35, no. 2, pp. 121–159, 2006.
- E. Cambouropoulos, “Pitch spelling: A computational model,” Music Perception, vol. 20, no. 4, pp. 411–429, 2003.
- A. K. Honingh, “Compactness in the Euler-lattice: A parsimonious pitch spelling model,” Musicae Scientiae, vol. 13, no. 1, pp. 117–138, 2009.
- B. Wetherfield, “The minimum cut pitch spelling algorithm: Simplifications and developments,” in TENOR, 2020.
- G. Teodoru and C. Raphael, “Pitch spelling with conditionally independent voices,” in Proc. conf. of the Int. Society for Music Information Retrieval (ISMIR), 2007, pp. 201–206.
- F. Foscarin, N. Audebert, and R. Fournier-S’Niehotta, “PKSpell: Data-driven pitch spelling and key signature estimation,” in Proc. 22nd conf. of the Int. Society for Music Information Retrieval (ISMIR), 2021.
- L. Feisthauer, L. Bigo, M. Giraud, and F. Levé, “Estimating keys and modulations in musical pieces,” in Sound and Music Computing Conference (SMC), 2020.
- L. Huang, “Advanced dynamic programming in semiring and hypergraph frameworks,” in COLING, 2008.
- J. Demšar, “Statistical comparisons of classifiers over multiple data sets,” The Journal of Machine Learning Research, vol. 7, pp. 1–30, 2006.
- J. Wenzel. pybind11 – Seamless operability between C++11 and Python. [Online]. Available: https://pybind11.readthedocs.io
- M. S. Cuthbert and C. Ariza, “music21: A toolkit for computer-aided musicology and symbolic music data,” in Proc. conf. of the Int. Society for Music Information Retrieval (ISMIR), 2010.
- F. Foscarin, A. Mcleod, P. Rigaux, F. Jacquemard, and M. Sakai, “ASAP: A dataset of aligned scores and performances for piano transcription,” in Proc. conf. of the Int. Society for Music Information Retrieval (ISMIR), 2020, pp. 534–541.