Reinforcement Learning Jazz Improvisation: When Music Meets Game Theory (2403.03224v1)
Abstract: Live performances of music are always charming, with the unpredictability of improvisation due to the dynamic between musicians and interactions with the audience. Jazz improvisation is a particularly noteworthy example for further investigation from a theoretical perspective. Here, we introduce a novel mathematical game theory model for jazz improvisation, providing a framework for studying music theory and improvisational methodologies. We use computational modeling, mainly reinforcement learning, to explore diverse stochastic improvisational strategies and their paired performance on improvisation. We find that the most effective strategy pair is a strategy that reacts to the most recent payoff (Stepwise Changes) with a reinforcement learning strategy limited to notes in the given chord (Chord-Following Reinforcement Learning). Conversely, a strategy that reacts to the partner's last note and attempts to harmonize with it (Harmony Prediction) strategy pair yields the lowest non-control payoff and highest standard deviation, indicating that picking notes based on immediate reactions to the partner player can yield inconsistent outcomes. On average, the Chord-Following Reinforcement Learning strategy demonstrates the highest mean payoff, while Harmony Prediction exhibits the lowest. Our work lays the foundation for promising applications beyond jazz: including the use of AI models to extract data from audio clips to refine musical reward systems, and training ML models on existing jazz solos to further refine strategies within the game.
- David Wright. Mathematics and music, volume 28. American Mathematical Soc., 2009.
- Improvisation experience predicts how musicians categorize musical structures. Psychology of Music, 48(1):18–34, 2020.
- Game theory. MIT press, 1991.
- A game theoretical model for musical interaction. In Proceedings of the International Computer Music Conference, 08 2008.
- Revisiting the illiac suite–a rule-based approach to stochastic processes. Sonic Ideas/Ideas Sonicas, 2:42–46, 2009.
- Marie-Hélène Serra. Stochastic composition and stochastic timbre: Gendy3 by iannis xenakis. Perspectives of New Music, pages 236–257, 1993.
- Linda M Arsenault. Iannis xenakis’s” achorripsis”: The matrix game. Computer Music Journal, 26(1):58–72, 2002.
- Philip N Johnson-Laird. How jazz musicians improvise. Music perception, 19(3):415–442, 2002.
- Jeff Pressing. Cognitive processes in improvisation. In Advances in Psychology, volume 19, pages 345–363. Elsevier, 1984.
- Computational systems for music improvisation. Digital Creativity, 29(1):19–36, 2018.
- Paul Doornbusch. Algorithmic composition: Paradigms of automated music generation. Computer Music Journal, 34(3):70–74, 2010.
- Improving algorithmic music composition with machine learning. In 9th International Conference on Music Perception and Cognition, pages 1848–1854. Citeseer, 2006.
- Symbolic music generation with non-differentiable rule guided diffusion. arXiv preprint arXiv:2402.14285, 2024.
- Ho-Chun Herbert Chang. Multi-issue negotiation with deep reinforcement learning. Knowledge-Based Systems, 211:106544, 2021.
- Risk-aware multi-armed bandit problem with application to portfolio selection. Royal Society open science, 4(11):171377, 2017.
- The role of financial spinning, learning, and predation in market failure. Journal of the Knowledge Economy, 14(1):517–543, 2023.
- Whither game theory? towards a theory of learning in games. Journal of Economic Perspectives, 30(4):151–170, 2016.
- Ingrid Monson. Saying something: Jazz improvisation and interaction. University of Chicago Press, 2009.
- Norman Cazden. The definition of consonance and dissonance. International Review of the Aesthetics and Sociology of Music, pages 123–168, 1980.
- Consonance theory part i: Consonance of dyads. The Journal of the Acoustical Society of America, 45(6):1451–1459, 1969.
- Frieder Stolzenburg. Harmony perception by periodicity detection. Journal of Mathematics and Music, 9(3):215–238, 2015.
- Quantifying the evolution of harmony and novelty in western classical music. arXiv preprint arXiv:2308.03224, 2023.
- Diachronic changes in jazz harmony: A cognitive perspective. Music Perception: An Interdisciplinary Journal, 31(1):32–45, 2013.
- Eddie S Meadows. Improvising jazz a beginner’s guide. Music Educators Journal, 78(4):41–44, 1991.
- Bryn Hughes. Harmonic expectation in twelve-bar blues progressions. PhD thesis, ProQuest Dissertations and Theses, 2011.
- Axel Berndt. Variance in repetitive games music. In Proceedings of the Int. Computer Music Conf. (ICMC), pages 412–415, 09 2012.
- Evelyn C Pielou. The measurement of diversity in different types of biological collections. Journal of theoretical biology, 13:131–144, 1966.
- Reinforcement learning: A survey. Journal of artificial intelligence research, 4:237–285, 1996.
- Learning in extensive-form games: Experimental data and simple dynamic models in the intermediate term. Games and economic behavior, 8(1):164–212, 1995.
- The relation of culture, socio-economics, and friendship to music preferences: A large-scale, cross-country study. PloS one, 13(12):e0208186, 2018.
- Personality traits and music genre preferences: how music taste varies over age groups. In 1st Workshop on Temporal Reasoning in Recommender Systems (RecTemp) at the 11th ACM Conference on Recommender Systems, Como, August 31, 2017., volume 1922, pages 16–20. CEUR-WS, 2017.
- Harmony and conflict: A cross-cultural investigation in china and australia. Journal of Cross-Cultural Psychology, 42(5):795–816, 2011.
- Indifference to dissonance in native amazonians reveals cultural variation in music perception. Nature, 535(7613):547–550, 2016.
- Automatic estimation of harmonic tension by distributed representation of chords. In Music Technology with Swing: 13th International Symposium, CMMR 2017, Matosinhos, Portugal, September 25-28, 2017, Revised Selected Papers 13, pages 23–34. Springer, 2018.
- Score one for jazz: Working memory in jazz and classical musicians. Psychomusicology: Music, Mind, and Brain, 28(2):101, 2018.
- Working memory and musical competence of musicians and non-musicians. Psychology of Music, 41(6):779–793, 2013.
- Godfried Toussaint. The euclidean algorithm generates traditional musical rhythms. In Renaissance Banff: Mathematics, Music, Art, Culture, pages 47–56, 2005.
- Large-scale brain networks emerge from dynamic processing of musical timbre, key and rhythm. Neuroimage, 59(4):3677–3689, 2012.
- Human brain basis of musical rhythm perception: common and distinct neural substrates for meter, tempo, and pattern. Brain sciences, 4(2):428–452, 2014.
- Inside the Jazzomat - New Perspectives for Jazz Research. Schott Campus, 2017.
- Ensuring the greater good in hybrid ai-human systems: Comment on” reputation and reciprocity” by xia et al. Physics of life reviews, 48:41–43, 2023.
- David Matthew Franz. Markov chains as tools for jazz improvisation analysis. PhD thesis, Virginia Tech, 1998.
- Matthew W Butterfield. Why do jazz musicians swing their eighth notes? Music theory spectrum, 33(1):3–26, 2011.