Exploring the Impact of ChatGPT on Wikipedia Engagement (2405.10205v3)
Abstract: Wikipedia is one of the most popular websites in the world, serving as a major source of information and learning resource for millions of users worldwide. While motivations for its usage vary, prior research suggests shallow information gathering -- looking up facts and information or answering questions -- dominates over more in-depth usage. On the 22nd of November 2022, ChatGPT was released to the public and has quickly become a popular source of information, serving as an effective question-answering and knowledge gathering resource. Early indications have suggested that it may be drawing users away from traditional question answering services such as Stack Overflow, raising the question of how it may have impacted Wikipedia. In this paper, we explore Wikipedia user metrics across four areas: page views, unique visitor numbers, edit counts and editor numbers within twelve language instances of Wikipedia. We perform pairwise comparisons of these metrics before and after the release of ChatGPT and implement a panel regression model to observe and quantify longer-term trends. We find no evidence of a fall in engagement across any of the four metrics, instead observing that page views and visitor numbers increased in the period following ChatGPT's launch. However, we observe a lower increase in languages where ChatGPT was available than in languages where it was not, which may suggest ChatGPT's availability limited growth in those languages. Our results contribute to the understanding of how emerging generative AI tools are disrupting the Web ecosystem.
- Ai unreliable answers: A case study on chatgpt. In International Conference on Human-Computer Interaction (pp. 23–40).
- There and here: patterns of content transclusion in wikipedia. In Proceedings of the 28th ACM Conference on Hypertext and Social Media (pp. 115–124).
- Contextual documentation referencing on stack overflow. IEEE Transactions on Software Engineering, 48(1), 135–149.
- A multitask, multilingual, multimodal evaluation of ChatGPT on reasoning, hallucination, and interactivity. In Park, J. C., Arase, Y., Hu, B., Lu, W., Wijaya, D., Purwarianti, A. & Krisnadhi, A. A. (Eds.), Proceedings of the 13th International Joint Conference on Natural Language Processing and the 3rd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics (Volume 1: Long Papers) (pp. 675–718). Nusa Dua, Bali: Association for Computational Linguistics.
- Language models are few-shot learners. Advances in neural information processing systems, 33, 1877–1901.
- The consequences of generative ai for ugc and online community engagement. Available at SSRN 4521754.
- Generative ai degrades online communities. Communications of the ACM, 67(3), 40–42.
- A survey on evaluation of large language models. ACM Transactions on Intelligent Systems and Technology.
- Detecting and gauging impact on wikipedia page views. In Companion Proceedings of The 2019 World Wide Web Conference (pp. 1254–1261).
- Transforming sentiment analysis in the financial domain with chatgpt. Machine Learning with Applications, 14, 100508.
- Mathematical capabilities of chatgpt. Advances in Neural Information Processing Systems, 36.
- Geiger, R. S. (2017). Beyond opening up the black box: Investigating the role of algorithmic systems in wikipedian organizational culture. Big Data & Society, 4(2), 2053951717730735.
- How does chatgpt perform on the united states medical licensing examination? the implications of large language models for medical education and knowledge assessment. JMIR Medical Education, 9(1), e45312.
- Exploring the potential of chatgpt in automated code refinement: An empirical study. In Proceedings of the 46th IEEE/ACM International Conference on Software Engineering (pp. 1–13).
- Ores: Lowering barriers with participatory machine learning in wikipedia. Proceedings of the ACM on Human-Computer Interaction, 4(CSCW2), 1–37.
- The_tower_of_babel. jpg: diversity of visual encyclopedic knowledge across wikipedia language editions. In Proceedings of the International AAAI Conference on Web and Social Media, Volume 12.
- Understanding wikipedia practices through hindi, urdu, and english takes on an evolving regional conflict. Proceedings of the ACM on Human-Computer Interaction, 5(CSCW1), 1–31.
- Health on wikipedia: a qualitative study of the attitudes, perceptions, and use of wikipedia as a source of health information by middle-aged and older adults. Information, Communication & Society, 24(12), 1797–1813.
- Verifying social network models of wikipedia knowledge community. Information Sciences, 339, 158–174.
- Global gender differences in wikipedia readership. In Proceedings of the International AAAI Conference on Web and Social Media, Volume 15 (pp. 254–265).
- Chatgpt passes german state examination in medicine with picture questions omitted. Deutsches Ärzteblatt International, 120(21-22), 373.
- Is it the new google: Impact of chatgpt on students’ information search habits. In the 22nd European Conference on e-Learning ECEL 2023, Hosted by the University of South Africa, 26-27 October 2023.
- Chatgpt: Jack of all trades, master of none. Information Fusion (p. 101861).
- Characterizing the online learning landscape: What and how people learn online. Proceedings of the ACM on Human-Computer Interaction, 5(CSCW1), 1–19.
- Kubś, J. (2021). Historical narratives in different language versions of wikipedia. Academic Journal of Modern Philology, (12), 83–94.
- Why the world reads wikipedia: Beyond english speakers. In Proceedings of the twelfth ACM international conference on web search and data mining (pp. 618–626).
- Chatgpt in healthcare: a taxonomy and systematic review. Computer Methods and Programs in Biomedicine (p. 108013).
- Chatgpt and bard exhibit spontaneous citation fabrication during psychiatry literature search. Psychiatry Research, 326, 115334.
- A culturally sensitive test to evaluate nuanced gpt hallucination. IEEE Transactions on Artificial Intelligence.
- The substantial interdependence of wikipedia and google: A case study on the relationship between peer production communities and information technologies. In Proceedings of the International AAAI Conference on Web and Social Media, Volume 11 (pp. 142–151).
- Wikipedia culture gap: quantifying content imbalances across 40 language editions. Frontiers in Physics, 6, 54.
- A season for all things: Phenological imprints in wikipedia usage and their relevance to conservation. PLoS biology, 17(3), e3000146.
- What is trending on wikipedia? capturing trends and language biases across wikipedia editions. In Companion Proceedings of the Web Conference 2020 (pp. 794–801).
- Improving wikipedia verifiability with ai. Nature Machine Intelligence, 5(10), 1142–1148.
- A large-scale characterization of how readers browse wikipedia. ACM Transactions on the Web, 17(2), 1–22.
- Quantifying engagement with citations on wikipedia. In Proceedings of The Web Conference 2020 (pp. 2365–2376).
- On the value of wikipedia as a gateway to the web. In Proceedings of the Web Conference 2021 (pp. 249–260).
- Ray, P. P. (2023). Chatgpt: A comprehensive review on background, applications, key challenges, bias, ethics, limitations and future scope. Internet of Things and Cyber-Physical Systems.
- How did they build the free encyclopedia? a literature review of collaboration and coordination among wikipedia editors. ACM Transactions on Computer-Human Interaction, 31(1), 1–48.
- Are large language models a threat to digital public goods? evidence from activity on stack overflow. arXiv preprint arXiv:2307.07367.
- Understanding wikipedia as a resource for opportunistic learning of computing concepts. In Proceedings of the 51st ACM Technical Symposium on Computer Science Education (pp. 72–78).
- Chatgpt mt: Competitive for high-(but not low-) resource languages. In Proceedings of the Eighth Conference on Machine Translation (pp. 392–418).
- Information foraging in the era of ai: Exploring the effect of chatgpt on digital q&a platforms. Available at SSRN 4459729.
- Temporal patterns of scientific information-seeking on google and wikipedia. Public understanding of science, 26(8), 969–985.
- Why we read wikipedia. In Proceedings of the 26th international conference on world wide web (pp. 1591–1600).
- Keeping community in the loop: Understanding wikipedia stakeholder values for machine learning-based systems. In Proceedings of the 2020 CHI Conference on Human Factors in Computing Systems (pp. 1–14).
- Smith, D. A. (2020). Situating wikipedia as a health information resource in various contexts: A scoping review. PloS one, 15(2), e0228786.
- Taecharungroj, V. (2023). “what can chatgpt do?” analyzing early reactions to the innovative ai chatbot on twitter. Big Data and Cognitive Computing, 7(1), 35.
- Effects of algorithmic flagging on fairness: quasi-experimental evidence from wikipedia. Proceedings of the ACM on Human-Computer Interaction, 5(CSCW1), 1–27.
- Even good bots fight: The case of wikipedia. PloS one, 12(2), e0171774.
- Black lives matter in wikipedia: Collective memory and collaboration around online social movements. In Proceedings of the 2017 acm conference on computer supported cooperative work and social computing (pp. 1400–1412).
- Tracking knowledge propagation across wikipedia languages. In Proceedings of the International AAAI Conference on Web and Social Media, Volume 15 (pp. 1046–1052).
- A deeper investigation of the importance of wikipedia links to search engine results. Proceedings of the ACM on Human-Computer Interaction, 5(CSCW1), 1–15.
- Examining wikipedia with a broader lens: Quantifying the value of wikipedia’s relationships with other large-scale online communities. In Proceedings of the 2018 CHI Conference on Human Factors in Computing Systems (pp. 1–13).
- Towards improving the reliability and transparency of chatgpt for educational question answering. In European Conference on Technology Enhanced Learning (pp. 475–488).
- Chatgpt vs. google: a comparative study of search performance and user experience. arXiv preprint arXiv:2307.01135.
- M3exam: A multilingual, multimodal, multilevel benchmark for examining large language models. Advances in Neural Information Processing Systems, 36.
- Don’t trust chatgpt when your question is not in english: A study of multilingual abilities and types of llms. In Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing (pp. 7915–7927).
- The roles bots play in wikipedia. Proceedings of the ACM on Human-Computer Interaction, 3(CSCW), 1–20.
- Stigmergy in open collaboration: An empirical investigation based on wikipedia. Journal of Management Information Systems, 40(3), 983–1008.
- Chatgpt hallucinates when attributing answers. In Proceedings of the Annual International ACM SIGIR Conference on Research and Development in Information Retrieval in the Asia Pacific Region (pp. 46–51).