Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
126 tokens/sec
GPT-4o
47 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

ART: The Alternating Reading Task Corpus for Speech Entrainment and Imitation (2404.02710v1)

Published 3 Apr 2024 in cs.CL and eess.AS

Abstract: We introduce the Alternating Reading Task (ART) Corpus, a collection of dyadic sentence reading for studying the entrainment and imitation behaviour in speech communication. The ART corpus features three experimental conditions - solo reading, alternating reading, and deliberate imitation - as well as three sub-corpora encompassing French-, Italian-, and Slovak-accented English. This design allows systematic investigation of speech entrainment in a controlled and less-spontaneous setting. Alongside detailed transcriptions, it includes English proficiency scores, demographics, and in-experiment questionnaires for probing linguistic, personal and interpersonal influences on entrainment. Our presentation covers its design, collection, annotation processes, initial analysis, and future research prospects.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (65)
  1. The hcrc map task corpus. Language and speech, 34(4):351–366.
  2. Vincent Aubanel and Noël Nguyen. 2020. Speaking to a common tune: Between-speaker convergence in voice fundamental frequency in a joint speech production task. PloS one, 15(5):e0232209.
  3. Molly Babel. 2012. Evidence for phonetic and social selectivity in spontaneous phonetic imitation. Journal of Phonetics, 40(1):177–189.
  4. Gérard Bailly and Amélie Lelong. 2010. Speech dominoes and phonetic convergence. In Interspeech 2010-11th Annual Conference of the International Speech Communication Association, pages 1153–1156.
  5. Gérard Bailly and Amélie Martin. 2014. Assessing objective characterizations of phonetic convergence. In Interspeech 2014-15th Annual Conference of the International Speech Communication Association, pages P–19.
  6. WhisperX: Time-Accurate Speech Transcription of Long-Form Audio. In Proc. INTERSPEECH 2023, pages 4489–4493.
  7. The prosody of backchannels in american english. ICPhS.
  8. Prosodic entrainment and trust in human-computer interaction. In Proceedings of the 9th International Conference on Speech Prosody, pages 220–224.
  9. Paul Boersma. 2001. Praat, a system for doing phonetics by computer. Glot International, 5(9):341–345.
  10. Abigail R. Bradshaw and Carolyn McGettigan. 2021. Convergence in voice fundamental frequency during synchronous speech. PLoS ONE, 16(10 October):1–27.
  11. Syntactic co-ordination in dialogue. Cognition, 75(2):B13–B25.
  12. Susan E Brennan and Herbert H Clark. 1996. Conceptual pacts and lexical choice in conversation. Journal of experimental psychology: Learning, memory, and cognition, 22(6):1482.
  13. A convenient and accurate parallel input/output usb device for e-prime. Behavior Research Methods, 43:292–296.
  14. The fisher corpus: A resource for the next generations of speech-to-text. In LREC, volume 4, pages 69–71.
  15. Uriel Cohen Priva and Chelsea Sanker. 2020. Natural leaders: Some interlocutors elicit greater convergence across conversations and across characteristics. Cognitive Science, 44(10):e12897.
  16. Vocal accommodation to technology: the role of physical form. Language Sciences, 99:101567.
  17. Amplitude convergence in children’s conversational speech with animated personas. In Proceedings of the 7th International Conference on Spoken Language Processing, volume 4, pages 2689–2692.
  18. The chains corpus: Characterizing individual speakers. In SPECOM, pages 431–435.
  19. Speech imitation skills predict automatic phonetic convergence: a GMM-UBM study on L2. In Proc. Interspeech 2022, pages 769–773.
  20. Véronique Delvaux and Alain Soquet. 2007. The Influence of Ambient Speech on Adult Speech Productions through Unintentional Imitation. Phonetica, 64:145–73.
  21. Nai Ding and Jonathan Z Simon. 2014. Cortical entrainment to continuous speech: functional roles and interpretations. Frontiers in human neuroscience, 8:311.
  22. Sophie Dufour and Noël Nguyen. 2013. How much imitation is there in a shadowing task? Frontiers in psychology, 4:346.
  23. Katherine Earnshaw. 2021. Examining the implications of speech accommodation for forensic speaker comparison casework: A case study of the west yorkshire face vowel. Journal of Phonetics, 87:101062.
  24. Rapid access to speech gestures in perception: Evidence from choice and simple response time tasks. Journal of memory and language, 49(3):396. Publisher: NIH Public Access.
  25. Dialog as interpersonal synergy. New Ideas in Psychology, 32:147–157.
  26. Chiara Gambi and Martin J. Pickering. 2013. Prediction and imitation in speech. Frontiers in Psychology, 4(June):1–9.
  27. Neural correlates of phonetic convergence and speech imitation. Frontiers in Psychology, 4(SEP).
  28. Howard Giles. 2016. Communication accommodation theory: Negotiating personal relationships and social identities across contexts. Cambridge University Press.
  29. Accommodation theory: Communication, context, and consequence. In Contexts of Accommodation, pages 1–68. Cambridge University Press.
  30. Switchboard: Telephone speech corpus for research and development. In Acoustics, speech, and signal processing, ieee international conference on, volume 1, pages 517–520. IEEE Computer Society.
  31. Stephen D. Goldinger. 1998. Echoes of Echoes? An Episodic Theory of Lexical Access. Psychological Review, 105(2):251–279.
  32. Stanford W Gregory. 1990. Analysis of fundamental frequency reveals covariation in interview partners’ speech. Journal of Nonverbal Behavior, 14:237–251.
  33. Phonetic and phonological sound changes in an agent-based model. Speech Communication, 147:93–115.
  34. Introducing parselmouth: A python interface to praat. Journal of Phonetics, 71:1–15.
  35. Sibling corpus of russian dialogue speech designed for research on speech entrainment. In Proceedings of the Twelfth Language Resources and Evaluation Conference, pages 6556–6561.
  36. Terry K Koo and Mae Y Li. 2016. A guideline of selecting and reporting intraclass correlation coefficients for reliability research. Journal of chiropractic medicine, 15(2):155–163.
  37. Towards measuring continuous acoustic feature convergence in unconstrained spoken dialogues. In Interspeech 2008, pages 1692–1695. ISCA.
  38. Measuring prosodic entrainment in conversation: A review and comparison of different methods. Journal of Speech, Language, and Hearing Research, pages 1–35.
  39. Harim Kwon. 2021. A non-contrastive cue in spontaneous imitation: Comparing mono-and bilingual imitators. Journal of Phonetics, 88:101083.
  40. Kristin Lemhöfer and Mirjam Broersma. 2012. Introducing lextale: A quick and valid lexical test for advanced learners of english. Behavior research methods, 44:325–343.
  41. Rivka Levitan and Julia Hirschberg. 2011. Measuring acoustic-prosodic entrainment with respect to multiple levels and dimensions. Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, pages 3081–3084.
  42. Natalie Lewandowski and Matthias Jilka. 2019. Phonetic Convergence, Language Talent, Personality and Attention. Frontiers in Communication, 4(May).
  43. Phonetic accommodation of tone: Reversing a tone merger-in-progress via imitation. Journal of Phonetics, 87:101060.
  44. Nichola Lubold and Heather Pon-Barry. 2014. Acoustic-prosodic entrainment and rapport in collaborative learning dialogues. In Proceedings of the 2014 ACM workshop on Multimodal Learning Analytics Workshop and Grand Challenge, pages 5–12.
  45. The neural oscillatory markers of phonetic convergence during verbal interaction. Human Brain Mapping, 40(1):187–201.
  46. The relationship between f0 synchrony and speech convergence in dyadic interaction. In Interspeech 2017, pages 2341–2345.
  47. Analyzing vocal tract movements during speech accommodation. In Interspeech 2018, pages Paper–2084. ISCA.
  48. Jennifer S. Pardo. 2006. On phonetic convergence during conversational interaction. The Journal of the Acoustical Society of America, 119(4):2382–2393.
  49. Special issue: Vocal accommodation in speech communication. Journal of Phonetics, 95:101196.
  50. The montclair map task: Balance, efficacy, and efficiency in conversational interaction. Language and Speech, 62(2):378–398.
  51. Jonathan W Peirce. 2007. Psychopy—psychophysics software in python. Journal of neuroscience methods, 162(1-2):8–13.
  52. Martin J. Pickering and Simon Garrod. 2004. Toward a mechanistic psychology of dialogue. Behavioral and Brain Sciences, 27(2):169–190.
  53. David Reitter and Johanna D Moore. 2014. Alignment and task success in spoken dialogue. Journal of Memory and Language, 76:29–46.
  54. Priming of syntactic rules in task-oriented dialogue and spontaneous conversation. Proceedings of the 28th Annual Conference of the Cognitive Science Sociey.
  55. Converging toward a common speech code: imitative and perceptuo-motor recalibration processes in speech production. Frontiers in psychology, 4:422.
  56. To see or not to see: Interlocutor visibility and likeability influence convergence in intonation. In Interspeech, pages 919–923.
  57. Jordan Soliz and Howard Giles. 2014. Relational and identity processes in communication: A contextual and meta-analytical review of communication accommodation theory. Annals of the International Communication Association, 38(1):107–144.
  58. The Wildcat Corpus of Native-and Foreign-accented English: Communicative Efficiency across Conversational Dyads with Varying Language Alignment Profiles. Language and Speech, 53(4):510–540.
  59. Phonetic convergence to non-native speech: Acoustic and perceptual evidence. Journal of Phonetics, 88:101076. Publisher: Academic Press.
  60. Andreas Weise. 2022. Towards Explaining Variation in Entrainment. Ph.D. thesis, City University of New York.
  61. Individual differences in acoustic-prosodic entrainment in spoken dialogue. Speech Communication, 115:78–87.
  62. The brooklyn multi-interaction corpus for analyzing variation in entrainment behavior. In Proceedings of the Thirteenth Language Resources and Evaluation Conference, pages 1721–1731.
  63. Automatic imitation of speech is enhanced for non-native sounds. Psychonomic Bulletin & Review, pages 1–17.
  64. Camille J. Wynn and Stephanie A. Borrie. 2022. Classifying conversational entrainment of speech behavior: An expanded framework and review. Journal of Phonetics, 94:101173.
  65. The ART of Conversation: Measuring Phonetic Convergence and Deliberate Imitation in L2-Speech with a Siamese RNN. In Proc. INTERSPEECH 2023, pages 132–136.
Citations (1)

Summary

We haven't generated a summary for this paper yet.