Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
38 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
41 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

A Comparative Analysis of Poetry Reading Audio: Singing, Narrating, or Somewhere In Between? (2404.00789v1)

Published 31 Mar 2024 in cs.SD and eess.AS

Abstract: This paper provides a computational analysis of poetry reading audio signals at a large scale to unveil the musicality within professionally-read poems. Although the acoustic characteristics of other types of spoken language have been extensively studied, most of the literature is limited to narrative speech or singing voice, discussing how different they are from each other. In this work, we develop signal processing methods, which are tailored to capture the unique acoustic characteristics of poetry reading based on their silence patterns, temporal variations of local pitch, and beat stability. Our large-scale statistical analyses on three big corpora, each of which consists of narration (LibriSpeech), singing voice (Intonation), and poetry reading (from The Poetry Foundation), discover that poetry reading does share some musical characteristics with singing voice, although it may also resemble narrative speech.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (31)
  1. “US trends in arts attendance and literary reading: 2002–2017,” 2018.
  2. S. Iyengar, “New Survey Reports Size of Poetry’s Audience, Streaming Included,” 2023, Accessed: September 5, 2023.
  3. Author’s Name, “Behind The Grammy: Best Spoken Word Poetry Album Roundtable, New Category,” 2023.
  4. S. Nooteboom et al., “The prosody of speech: melody and rhythm,” The handbook of phonetic sciences, vol. 5, pp. 640–673, 1997.
  5. N. Francis, “Verbal art across language and culture: poetry as music,” Neohelicon, vol. 48, no. 2, pp. 539–552, 2021.
  6. A. E. Harvey, “The classification of Greek lyric poetry,” The Classical Quarterly, vol. 5, no. 3-4, pp. 157–175, 1955.
  7. Y. Hou and A. Frank, “Analyzing sentiment in classical chinese poetry,” in Proceedings of the 9th SIGHUM workshop on language Technology for Cultural Heritage, social sciences, and humanities (LaTeCH), 2015, pp. 15–24.
  8. “Melody in poems and songs: fundamental statistical properties predict aesthetic evaluation,” Psychology of Aesthetics, Creativity, and the Arts, vol. 17, no. 2, pp. 163, 2023.
  9. “Developmental changes in the categorization of speech and song,” Developmental Science, vol. 26, no. 5, pp. e13346, 2023.
  10. “Similarities and differences in a global sample of song and speech recordings [Stage 1 Registered Report],” 2022.
  11. G. List, “The boundaries of speech and song,” Ethnomusicology, vol. 7, no. 1, pp. 1–16, 1963.
  12. “Comparing the rhythm and melody of speech and music: The case of British English and French,” The Journal of the Acoustical Society of America, vol. 119, no. 5, pp. 3034–3047, 2006.
  13. “A computational approach to style in american poetry,” in Seventh IEEE International Conference on Data Mining (ICDM 2007). IEEE, 2007, pp. 553–558.
  14. J. Kao and D. Jurafsky, “A computational analysis of style, affect, and imagery in contemporary poetry,” in Proceedings of the NAACL-HLT 2012 workshop on computational linguistics for literature, 2012, pp. 8–17.
  15. “Automated analysis of Bangla poetry for classification and poet identification,” in Proceedings of the 12th International Conference on Natural Language Processing, 2015, pp. 247–253.
  16. J. Kaur and J. R. Saini, “Designing Punjabi poetry classifiers using machine learning and different textual features,” Int. Arab J. Inf. Technol., vol. 17, no. 1, pp. 38–44, 2020.
  17. K. Choi, “Computational thematic analysis of poetry via bimodal large language models,” in the Association for Information Science and Technology (ASIS&T), 2023.
  18. A. Singhi and D. G. Brown, “Are poetry and lyrics all that different?,” in Proceedings of the 15th International Society for Music Information Retrieval Conference (ISMIR), 2014, pp. 471–476.
  19. “Librispeech: An ASR corpus based on public domain audio books,” in IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2015, pp. 5206–5210.
  20. “Intonation: A dataset of quality vocal performances refined by spectral clustering on pitch congruence,” in IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2019.
  21. “Style detection for free verse poetry from text and speech,” in Proceedings of the 27th International Conference on Computational Linguistics, 2018, pp. 1929–1940.
  22. “WhisperX: Time-Accurate Speech Transcription of Long-Form Audio,” in Proceedings of the Annual Conference of the International Speech Communication Association (Interspeech), 2023.
  23. “Robust speech recognition via large-scale weak supervision,” in International Conference on Machine Learning. PMLR, 2023, pp. 28492–28518.
  24. M. Mauch and S. Dixon, “PYIN: A fundamental frequency estimator using probabilistic threshold distributions,” 2014, pp. 659–663.
  25. D. P. W. Ellis, “Beat tracking by dynamic programming,” Journal of New Music Research, vol. 36, no. 1, pp. 51–60, 2007.
  26. “Librosa: Audio and music signal analysis in python,” in Proceedings of the 14th python in science conference, 2015, vol. 8, pp. 18–25.
  27. P. Masri, Computer Modeling of Sound for Transformation and Synthesis of Musical Signals, Ph.D. thesis, University of Bristol, UK, 1996.
  28. G. L. Dillon, “Clause, pause, and punctuation in poetry,” 1976.
  29. Z. Lissa, “Aesthetic functions of silence and rests in music,” The Journal of Aesthetics and Art Criticism, vol. 22, no. 4, pp. 443–454, 1964.
  30. F. J. Massey, “The Kolmogorov-Smirnov Test for Goodness of Fit,” Journal of the American Statistical Association, vol. 46, no. 253, pp. 68–78, 1951.
  31. L. V. Kantorovich, “Mathematical methods of organizing and planning production,” Management Science, vol. 6, no. 4, pp. 366–422, 1960.
User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (2)
  1. Kahyun Choi (2 papers)
  2. Minje Kim (53 papers)