Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
175 tokens/sec
GPT-4o
7 tokens/sec
Gemini 2.5 Pro Pro
42 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Can Authorship Attribution Models Distinguish Speakers in Speech Transcripts? (2311.07564v3)

Published 13 Nov 2023 in cs.CL and cs.LG

Abstract: Authorship verification is the task of determining if two distinct writing samples share the same author and is typically concerned with the attribution of written text. In this paper, we explore the attribution of transcribed speech, which poses novel challenges. The main challenge is that many stylistic features, such as punctuation and capitalization, are not informative in this setting. On the other hand, transcribed speech exhibits other patterns, such as filler words and backchannels (e.g., 'um', 'uh-huh'), which may be characteristic of different speakers. We propose a new benchmark for speaker attribution focused on human-transcribed conversational speech transcripts. To limit spurious associations of speakers with topic, we employ both conversation prompts and speakers participating in the same conversation to construct verification trials of varying difficulties. We establish the state of the art on this new benchmark by comparing a suite of neural and non-neural baselines, finding that although written text attribution models achieve surprisingly good performance in certain settings, they perform markedly worse as conversational topic is increasingly controlled. We present analyses of the impact of transcription style on performance as well as the ability of fine-tuning on speech transcripts to improve performance.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (28)
  1. Overview of PAN 2020: Authorship verification, celebrity profiling, profiling fake news spreaders on Twitter, and style change detection. In Experimental IR Meets Multilinguality, Multimodality, and Interaction, pages 372–383. Springer International Publishing.
  2. Explainable authorship verification in social media via attention-based similarity learning. In IEEE International Conference on Big Data (Big Data), pages 36–45.
  3. The Fisher Corpus: A resource for the next generations of speech-to-text.
  4. Mark my words! Linguistic style accommodation in social media. In Proceedings of the 20th International Conference on World Wide Web, pages 745–754.
  5. Learning stylometric representations for authorship analysis. IEEE Transactions on Cybernetics, 49(1):107–121.
  6. Starkey Duncan. 1974. On the structure of speaker-auditor interaction during speaking turns. Language in Society, 3(2):161–180.
  7. Speaker anonymization using X-vector and neural waveform models. eess.AS/1905.13561v1.
  8. Communication accommodation theory: Past accomplishments, current trends, and future prospects. Language Sciences, 99.
  9. Erica Gold and John Peter French. 2019. International practices in forensic speaker comparisons: Second survey. International Journal of Speech, Language and the Law, 26:1–20.
  10. Identification of speakers in novels. In Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 1312–1320, Sofia, Bulgaria. Association for Computational Linguistics.
  11. A deep metric learning approach to account linking. In Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pages 5275–5287, Online. Association for Computational Linguistics.
  12. Quick transcription of Fisher data with WordWave.
  13. David D. Lewis. 1997. Reuters-21578 text categorization test collection, Distribution 1.0. AT&T Labs-Research.
  14. Frederick Mosteller and David L. Wallace. 1964. Inference and Disputed Authorship: The Federalist. Addison-Wesley, Reading, MA.
  15. Maryam Najafi and Ehsan Tavan. 2022. Text-to-text transformer in authorship verification via stylistic and semantical analysis. In Notebook for PAN at CLEF 2022, CLEF 2022–Conference and Labs of the Evaluation Forum.
  16. Vocal accommodation in speech communication. Journal of Phonetics, 95.
  17. Nils Reimers and Iryna Gurevych. 2019. Sentence-BERT: Sentence embeddings using Siamese BERT-Networks. pages 3982–3992. Association for Computational Linguistics.
  18. Learning universal authorship representations. In Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, pages 913–919. Association for Computational Linguistics.
  19. Harvey Sacks. 1992. Lectures on Conversation, volume 1. Blackwell.
  20. Nelleke Scheijen. 2020. Forensic speaker recognition: Based on text analysis of transcribed speech fragments. Master’s thesis, Delft University of Technology.
  21. An overview of voice conversion and its challenges: From statistical modeling to deep learning. IEEE/ACM Transactions on Audio, Speech, and Language Processing, 29:132–157.
  22. X-vectors: Robust DNN embeddings for speaker recognition. In 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pages 5329–5333.
  23. Efstathios Stamatatos. 2018. Masking topic-related information to enhance authorship attribution. Journal of the Association for Information Science and Technology, 69(3):461–473.
  24. Overview of the authorship verification task at PAN 2023. In CLEF 2023: Conference and Labs of the Evaluation Forum.
  25. HANSEN: Human and AI spoken text benchmark for authorship analysis.
  26. Can Authorship Representation Learning Capture Stylistic Features? Transactions of the Association for Computational Linguistics, 11:1416–1431.
  27. Dominic Watt and Georgina Brown. 2020. Forensic phonetics and automatic speaker recognition: The complementarity of human- and machine-based forensic speaker comparison, chapter 25. Routledge.
  28. Same author or just same topic? Towards content-independent style representations. In Proceedings of the 7th Workshop on Representation Learning for NLP, pages 249–268, Dublin, Ireland. Association for Computational Linguistics.
Citations (2)

Summary

We haven't generated a summary for this paper yet.

Youtube Logo Streamline Icon: https://streamlinehq.com