2000 character limit reached
PiCoGen: Generate Piano Covers with a Two-stage Approach (2407.20883v1)
Published 30 Jul 2024 in cs.SD and eess.AS
Abstract: Cover song generation stands out as a popular way of music making in the music-creative community. In this study, we introduce Piano Cover Generation (PiCoGen), a two-stage approach for automatic cover song generation that transcribes the melody line and chord progression of a song given its audio recording, and then uses the resulting lead sheet as the condition to generate a piano cover in the symbolic domain. This approach is advantageous in that it does not required paired data of covers and their original songs for training. Compared to an existing approach that demands such paired data, our evaluation shows that PiCoGen demonstrates competitive or even superior performance across songs of different musical genres.
- Song2Guitar: A difficulty-aware arrangement system for generating guitar solo covers from polyphonic audio of popular music. In Proc. International Society for Music Information Retrieval (ISMIR).
- Automatic music transcription: An overview. In Proc. IEEE Signal Processing Magazine (2019).
- Jongho Choi and Kyogu Lee. 2023. Pop2Piano: Pop audio-based piano cover generation. In Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
- Groove2groove: One-shot music style transfer with supervision from synthetic data. In Proc. IEEE/ACM Transactions on Audio, Speech, and Language Processing (TASLP) (2020).
- Melody transcription via generative pre-training. In Proc. International Society for Music Information Retrieval (ISMIR).
- Multitrack music Transformer. In Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
- MidiTok: A Python package for MIDI file tokenization. In Proc. International Society for Music Information Retrieval (ISMIR).
- MT3: Multi-task multitrack music transcription. In Proc. International Conference on Learning Representations (ICLR).
- Onsets and Frames: Dual-objective piano transcription. arXiv:1710.11153
- Sequence-to-sequence piano transcription with Transformers. In Proc. International Society for Music Information Retrieval (ISMIR).
- Spleeter: A fast and efficient music source separation tool with pre-trained models. In Proc. Journal of Open Source Software (2020).
- Compound word Transformer: Learning to compose full-song music over dynamic directed hypergraphs. In Proc. AAAI Conference on Artificial Intelligence (AAAI).
- Music Transformer. In Proc. International Conference on Learning Representations (ICLR).
- Yu-Siang Huang and Yi-Hsuan Yang. 2020. Pop music Transformer: Beat-based modeling and generation of expressive pop piano compositions. In Proc. ACM Multimedia (ACM MM).
- High-resolution piano transcription with pedals by regressing onset and offset times. In Proc. IEEE/ACM Transactions on Audio, Speech, and Language Processing (TASLP) (2021).
- Hao-Min Liu and Yi-Hsuan Yang. 2018. Lead sheet generation and arrangement by conditional generative adversarial network. In Proc. IEEE International Conference on Machine Learning and Applications (ICMLA). 722–727.
- MuseCoco: Generating symbolic music from text. arXiv:2306.00110
- Matthias Mauch and Simon Dixon. 2014. pYIN: A fundamental frequency estimator using probabilistic threshold distributions. In Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
- This time with feeling: Learning expressive musical performance. In Proc. Neural Computing and Applications (2018).
- A hierarchical latent vector model for learning long-term structure in music. In Proc. International Conference on Machine Learning (ICML).
- Matti P. Ryynänen and Anssi P. Klapuri. 2008. Automatic transcription of melody, bass line, and chords in polyphonic music. Computer Music Journal (2008).
- Adoption of AI technology in music mixing workflow: an investigation. Audio Engineering Society (AES).
- Audio cover song identification and similarity: background, approaches, evaluation, and beyond. In Proc. Advances in music information retrieval (2010).
- Summarizing and comparing music data and its application on cover song identification. In Proc. International Society for Music Information Retrieval (ISMIR).
- Audio-based automatic generation of a piano reduction score by considering the musical structure. In Proc. MultiMedia Modeling: 25th International Conference (MMM).
- Automatic piano transcription with hierarchical frequency-time Transformer. In Proc. International Society for Music Information Retrieval (ISMIR).
- George Tzanetakis and Perry Cook. 2002. Musical genre classification of audio signals. In Proc. IEEE Transactions on Speech and Audio Processing (2002).
- Alexandra L Uitdenbogerd and Justin Zobel. 1998. Manipulation of music for melody matching. In Proc. ACM Multimedia (ACM MM).
- FIGARO: Generating symbolic music with fine-grained artistic control. In Proc. International Conference on Learning Representations (ICLR).
- POP909: A pop-song dataset for music arrangement generation. In In Proc. International Society for Music Information Retrieval (ISMIR).
- Automatic generation of lead Sheets from polyphonic music signals. In Proc. International Society for Music Information Retrieval (ISMIR).
- Shih-Lun Wu and Yi-Hsuan Yang. 2021. MuseMorphose: Full-song and fine-grained piano music style transfer with one Transformer VAE. In Proc. IEEE/ACM Transactions on Audio, Speech, and Language Processing (TASLP) (2021).
- Shih-Lun Wu and Yi-Hsuan Yang. 2023. Compose & Embellish: Well-structured piano performance generation via a two-Stage Approach. In Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
- Da-TACOS: A dataset for cover song identification and understanding. In Proc. International Society for Music Information Retrieval (ISMIR).