Empirical evidence of Large Language Model's influence on human spoken communication
Abstract: AI agents now interact with billions of humans in natural language, thanks to advances in LLMs like ChatGPT. This raises the question of whether AI has the potential to shape a fundamental aspect of human culture: the way we speak. Recent analyses revealed that scientific publications already exhibit evidence of AI-specific language. But this evidence is inconclusive, since scientists may simply be using AI to copy-edit their writing. To explore whether AI has influenced human spoken communication, we transcribed and analyzed about 280,000 English-language videos of presentations, talks, and speeches from more than 20,000 YouTube channels of academic institutions. We find a significant shift in the trend of word usage specific to words distinctively associated with ChatGPT following its release. These findings provide the first empirical evidence that humans increasingly imitate LLMs in their spoken language. Our results raise societal and policy-relevant concerns about the potential of AI to unintentionally reduce linguistic diversity, or to be deliberately misused for mass manipulation. They also highlight the need for further investigation into the feedback loops between machine behavior and human culture.
- Toward a mechanistic psychology of dialogue. Behavioral and Brain Sciences, 27(2), 2004.
- Communication accommodation theory. In Theorizing about Intercultural Communication, pages 121–148. Sage, Thousand Oaks, USA, 2005.
- Penelope Eckert. Linguistic variation as social practice. Blackwell, Oxford, UK, 2000.
- Language acquisition meets language evolution. Cognitive Science, 34(7):1131–1157, 2010.
- David Crystal. The language revolution. Polity, Cambridge, UK, 2004.
- Chris Stokel-Walker. ChatGPT listed as author on research papers. Nature, 613(7945):620–621, 2023.
- Analysing the impact of ChatGPT in research. Applied Intelligence, 54(5):4172–4188, 2024.
- Mapping the increasing use of LLMs in scientific papers. In Proc. CoLM, pages 1–27, Amherst, USA, 2024. OpenReview.
- Is ChatGPT transforming academics’ writing style? In Proc. ICML NextGenAISafety Workshop, pages 1–14, Amherst, USA, 2024. OpenReview.
- Delving into ChatGPT usage in academic writing through excess vocabulary. arXiv, 2406.07016:1–13, 2024.
- Word embeddings quantify 100 years of gender and ethnic stereotypes. Proceedings of the National Academy of Sciences, 115(16):E3635–E3644, 2018.
- Large language models show human-like content biases in transmission chain experiments. Proceedings of the National Academy of Sciences, 120(44):e2313790120, 2023.
- Machine culture. Nature Human Behaviour, 7(11):1855–1868, 2023.
- Superhuman artificial intelligence can improve human decision-making by increasing novelty. Proceedings of the National Academy of Sciences, 120(12):e2214840120, 2023.
- The curious decline of linguistic diversity. In Findings of ACL NAACL, pages 3589–3604, Kerrville, USA, 2024. ACL.
- Research Organization Registry. ROR Data, 2024. https://doi.org/10.5281/zenodo.11186879.
- Shuyo Nakatani. Language detection library for Java, 2010. https://www.slideshare.net/slideshow/language-detection-library-for-java/6014274 (Accessed on July 31, 2024).
- Robust speech recognition via large-scale weak supervision. arXiv, 2212.04356:1–28, 2022.
- M. F. Porter. An algorithm for suffix stripping. Program, 14(3):130–137, 1980.
- Introduction to Information Retrieval. Cambridge University Press, Cambridge, UK, 2008.
- R. Harald Baayen. Word Frequency Distributions, volume 18 of Text, Speech and Language Technology. Springer Netherlands, Dordrecht, Netherlands, 2001.
Paper Prompts
Sign up for free to create and run prompts on this paper using GPT-5.
Top Community Prompts
Collections
Sign up for free to add this paper to one or more collections.