Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
166 tokens/sec
GPT-4o
7 tokens/sec
Gemini 2.5 Pro Pro
42 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Motif-Centric Representation Learning for Symbolic Music (2309.10597v1)

Published 19 Sep 2023 in cs.SD, cs.LG, and eess.AS

Abstract: Music motif, as a conceptual building block of composition, is crucial for music structure analysis and automatic composition. While human listeners can identify motifs easily, existing computational models fall short in representing motifs and their developments. The reason is that the nature of motifs is implicit, and the diversity of motif variations extends beyond simple repetitions and modulations. In this study, we aim to learn the implicit relationship between motifs and their variations via representation learning, using the Siamese network architecture and a pretraining and fine-tuning pipeline. A regularization-based method, VICReg, is adopted for pretraining, while contrastive learning is used for fine-tuning. Experimental results on a retrieval-based task show that these two methods complement each other, yielding an improvement of 12.6% in the area under the precision-recall curve. Lastly, we visualize the acquired motif representations, offering an intuitive comprehension of the overall structure of a music piece. As far as we know, this work marks a noteworthy step forward in computational modeling of music motifs. We believe that this work lays the foundations for future applications of motifs in automatic music composition and music information retrieval.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (36)
  1. Arnold Schoenberg, The musical idea and the logic, technique, and art of its presentation, Indiana University Press, 2006.
  2. “Symbolic melodic similarity: State of the art and future challenges,” Computer Music Journal, vol. 40, no. 2, pp. 70–83, 2016.
  3. “Simple orthogonal pitch with ioi symbolic music matching,” Proceedings of the Annual Music Information Retrieval Evaluation exchange, 2010.
  4. “An empirically derived measure of melodic similarity,” Journal of New Music Research, vol. 44, no. 4, pp. 391–404, 2015.
  5. “Finding occurrences of melodic segments in folk songs employing symbolic similarity measures,” Journal of New Music Research, vol. 46, no. 2, pp. 118–134, 2017.
  6. “A cross-scape plot representation for visualizing symbolic melodic similarity.,” in ISMIR, 2019, pp. 423–430.
  7. “Melody2vec: Distributed representations of melodic phrases based on melody segmentation,” Journal of Information Processing, vol. 27, pp. 278–286, 2019.
  8. “Learning similarity metrics for melody retrieval.,” in ISMIR, 2019, pp. 478–485.
  9. David Meredith, “Cosiatec and siateccompress: Pattern discovery by geometric compression,” in International society for music information retrieval conference. International Society for Music Information Retrieval, 2013.
  10. “Siarct-cfp: Improving precision and the discovery of inexact musical patterns in point-set representations,” in International Society for Music Information Retrieval Conference, 2013.
  11. Otso Björklund, “Siatec-c: Computationally efficient repeated pattern discovery in polyphonic music,” in Proceedings of the 23rd International Society for Music Information Retrieval Conference, 2022.
  12. “Perceptual evaluation of automatically extracted musical motives,” in Proceedings of the 12th International Conference on Music Perception and Cognition, 2012, pp. 723–727.
  13. Olivier Lartillot, “Automated motivic analysis: An exhaustive approach based on closed and cyclic pattern mining in multidimensional parametric spaces,” in Computational Music Analysis, pp. 273–302. Springer, 2015.
  14. “Symchm—an unsupervised approach for pattern discovery in symbolic music with a compositional hierarchical model,” Applied sciences, vol. 7, no. 11, pp. 1135, 2017.
  15. Alberto Pinto, “Relational motif discovery via graph spectral ranking,” in Proceedings of the Eighth Workshop on Mining and Learning with Graphs, 2010, pp. 102–109.
  16. Olivier Lartillot, “In-depth motivic analysis based on multiparametric closed pattern and cyclic sequence mining,” in International Symposium on Music Information Retrieval: ISMIR, 2014.
  17. “Discovering motifs with variants in music databases,” in Advances in Intelligent Data Analysis XVI: 16th International Symposium, IDA 2017, London, UK, October 26–28, 2017, Proceedings 16. Springer, 2017, pp. 14–26.
  18. “Structured training for large-vocabulary chord recognition.,” in ISMIR, 2017, pp. 188–194.
  19. “Large-vocabulary chord transcription via chord structure decomposition.,” in ISMIR, 2019, pp. 644–651.
  20. Tsung-Ping Chen and Li Su, “Attend to chords: Improving harmonic analysis of symbolic music using transformer-based models,” Transactions of the International Society for Music Information Retrieval, vol. 4, no. 1, 2021.
  21. “Deep semi-supervised learning with contrastive learning in large vocabulary automatic chord recognition,” in 2023 IEEE 13th Annual Computing and Communication Workshop and Conference (CCWC), 2023, pp. 1065–1069.
  22. “Deep music analogy via latent representation disentanglement,” 2019.
  23. “Pianotree vae: Structured representation learning for polyphonic music,” in Proceedings of 21st International Conference on Music Information Retrieval (ISMIR), 2020.
  24. “Learning interpretable representation for controllable polyphonic music generation,” in Proceedings of 21st International Conference on Music Information Retrieval (ISMIR), 2020.
  25. “Controllable deep melody generation via hierarchical music structure representation,” in Proceedings of 22st International Conference on Music Information Retrieval (ISMIR), 2021.
  26. “Melons: generating melody with long-term structure using transformers and structure graph,” in ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 2022, pp. 191–195.
  27. “High-level control of drum track generation using learned patterns of rhythmic interaction,” in 2019 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA). IEEE, 2019, pp. 35–39.
  28. “Generating structured drum pattern using variational autoencoder and self-similarity matrix.,” in ISMIR, 2019, pp. 847–854.
  29. “What is missing in deep music generation? a study of repetition and structure in popular music,” in Proceedings of 23st International Conference on Music Information Retrieval (ISMIR), 2022.
  30. “Theme transformer: Symbolic music generation with theme-conditioned transformer,” IEEE Transactions on Multimedia, 2022.
  31. “Signature verification using a” siamese” time delay neural network,” Advances in neural information processing systems, vol. 6, 1993.
  32. “Facenet: A unified embedding for face recognition and clustering,” in Proceedings of the IEEE conference on computer vision and pattern recognition, 2015, pp. 815–823.
  33. “Vicreg: Variance-invariance-covariance regularization for self-supervised learning,” in 10th International Conference on Learning Representations, ICLR 2022, 2022.
  34. “Pop909: A pop-song dataset for music arrangement generation,” in Proceedings of 21st International Conference on Music Information Retrieval (ISMIR), 2020.
  35. “Decoupled weight decay regularization,” in 6th International Conference on Learning Representations, ICLR 2018, 2018.
  36. “A density-based algorithm for discovering clusters in large spatial databases with noise,” in Knowledge Discovery and Data Mining, 1996, vol. 96, pp. 226–231.

Summary

We haven't generated a summary for this paper yet.