Fine-Grained Music Plagiarism Detection: Revealing Plagiarists through Bipartite Graph Matching and a Comprehensive Large-Scale Dataset
Abstract: Music plagiarism detection is gaining more and more attention due to the popularity of music production and society's emphasis on intellectual property. We aim to find fine-grained plagiarism in music pairs since conventional methods are coarse-grained and cannot match real-life scenarios. Considering that there is no sizeable dataset designed for the music plagiarism task, we establish a large-scale simulated dataset, named Music Plagiarism Detection Dataset (MPD-Set) under the guidance and expertise of renowned researchers from national-level professional institutions in the field of music. MPD-Set considers diverse music plagiarism cases found in real life from the melodic, rhythmic, and tonal levels respectively. Further, we establish a Real-life Dataset for evaluation, where all plagiarism pairs are real cases. To detect the fine-grained plagiarism pairs effectively, we propose a graph-based method called Bipatite Melody Matching Detector (BMM-Det), which formulates the problem as a max matching problem in the bipartite graph. Experimental results on both the simulated and Real-life Datasets demonstrate that BMM-Det outperforms the existing plagiarism detection methods, and is robust to common plagiarism cases like transpositions, pitch shifts, duration variance, and melody change. Datasets and source code are open-sourced at https://github.com/xuan301/BMMDet_MPDSet.
- Searching digital music libraries. Information processing & management 41, 1 (2005), 41–56.
- Pitch contour tracking in music using Harmonic Locked Loops. In 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). 191–195. https://doi.org/10.1109/ICASSP.2017.7952144
- Music Plagiarism Detection using Audio Fingerprinting and Segment Matching. In 2021 Smart Technologies, Communication and Robotics (STCR). IEEE, 1–4.
- Content Based Singing Voice Extraction from a Musical Mixture. In ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). 781–785. https://doi.org/10.1109/ICASSP40776.2020.9053024
- C Cronin. 2016. Columbia Law School & UCLA Law Copyright Infringement Project. update (2016).
- Music plagiarism at a glance: metrics of similarity and visualizations. In 2017 21st International Conference Information Visualisation (IV). IEEE, 410–415.
- Visualization of music plagiarism: Analysis and evaluation. In 2016 20th International Conference Information Visualisation (IV). IEEE, 177–182.
- A computational intelligence text-based detection system of music plagiarism. In 2017 4th International Conference on Systems and Informatics (ICSAI). IEEE, 519–524.
- Fuzzy vectorial-based similarity detection of music plagiarism. In 2017 IEEE International Conference on Fuzzy Systems (FUZZ-IEEE). IEEE, 1–6.
- Audio forensics meets music information retrieval—a toolbox for inspection of music plagiarism. In 2012 Proceedings of the 20th European signal processing conference (EUSIPCO). IEEE, 1249–1253.
- Shyamala Doraisamy and Stefan Rüger. 2003. Robust polyphonic music retrieval with n-grams. Journal of Intelligent Information Systems 21, 1 (2003), 53–70.
- Audio Cover Song Identification: MIREX 2006-2007 Results and Analyses.. In ISMIR. 468–474.
- Singing Melody Extraction from Polyphonic Music based on Spectral Correlation Modeling. In ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). 241–245. https://doi.org/10.1109/ICASSP39728.2021.9414190
- Daniel Jurafsky and James H Martin. [n. d.]. Speech and Language Processing: An Introduction to Natural Language Processing, Computational Linguistics, and Speech Recognition.
- H. W. Kuhn. 1955. The Hungarian method for the assignment problem. Naval Research Logistics Quarterly 2, 1-2 (1955), 83–97. https://doi.org/10.1002/nav.3800020109 arXiv:https://onlinelibrary.wiley.com/doi/pdf/10.1002/nav.3800020109
- C. D. Manning and H Schütze. 1999. Foundations of Statistical Natural Language Processing. Foundations of Statistical Natural Language Processing.
- Cognitive adequacy in the measurement of melodic similarity: Algorithmic vs. human judgments. Computing in Musicology 13, 2003 (2004), 147–176.
- Daniel Müllensiefen and Marc Pendzich. 2009. Court decisions on music plagiarism and the predictive value of similarity algorithms. Musicae Scientiae 13, 1_suppl (2009), 257–295. https://doi.org/10.1177/102986490901300111 arXiv:https://doi.org/10.1177/102986490901300111
- The cuidado music browser: an end-to-end electronic music distribution system. Multimedia Tools and Applications 30, 3 (2006), 331–349.
- Music Similarity: Improvements of Edit-Based Algorithms by Considering Music Theory (MIR ’07). Association for Computing Machinery, New York, NY, USA, 135–142. https://doi.org/10.1145/1290082.1290103
- Adaptation of string matching algorithms for identification of near-duplicate music documents. In Workshop on Plagiarism Analysis, Authorship Identification, and Near-Duplicate Detection (PAN07). 37–43.
- DETECTING AND LOCATING PLAGIARISM OF MUSIC MELODIES BY PATH EXPLORATION OVER ABinary MASK. In CS & IT Conference Proceedings, Vol. 7. CS & IT Conference Proceedings.
- A. Tversky. 1988. Features of similarity. Readings in Cognitive Science 84, 4 (1988), 290–302.
- Deep neural network based instrument extraction from music. In 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). 2135–2139. https://doi.org/10.1109/ICASSP.2015.7178348
- Alexandra Uitdenbogerd and Justin Zobel. 1999. Melodic matching techniques for large music databases. Proceedings of the ACM International Multimedia Conference & Exhibition, 57–66. https://doi.org/10.1145/319463.319470
- Symbolic melodic similarity: State of the art and future challenges. Computer Music Journal 40, 2 (2016), 70–83.
- Pop909: A pop-song dataset for music arrangement generation. arXiv preprint arXiv:2008.07142 (2020).
- Shih-Lun Wu and Yi-Hsuan Yang. 2021. MuseMorphose: Full-Song and Fine-Grained Music Style Transfer with One Transformer VAE. arXiv preprint arXiv:2105.04090 (2021).
Paper Prompts
Sign up for free to create and run prompts on this paper using GPT-5.
Top Community Prompts
Collections
Sign up for free to add this paper to one or more collections.