Curious Rhythms: Temporal Regularities of Wikipedia Consumption (2305.09497v4)
Abstract: Wikipedia, in its role as the world's largest encyclopedia, serves a broad range of information needs. Although previous studies have noted that Wikipedia users' information needs vary throughout the day, there is to date no large-scale, quantitative study of the underlying dynamics. The present paper fills this gap by investigating temporal regularities in daily consumption patterns in a large-scale analysis of billions of timezone-corrected page requests mined from English Wikipedia's server logs, with the goal of investigating how context and time relate to the kind of information consumed. First, we show that even after removing the global pattern of day-night alternation, the consumption habits of individual articles maintain strong diurnal regularities. Then, we characterize the prototypical shapes of consumption patterns, finding a particularly strong distinction between articles preferred during the evening/night and articles preferred during working hours. Finally, we investigate topical and contextual correlates of Wikipedia articles' access rhythms, finding that article topic, reader country, and access device (mobile vs. desktop) are all important predictors of daily attention patterns. These findings shed new light on how humans seek information on the Web by focusing on Wikipedia as one of the largest open platforms for knowledge and learning, emphasizing Wikipedia's role as a rich knowledge base that fulfills information needs spread throughout the day, with implications for understanding information seeking across the globe and for designing appropriate information systems.
- Cognitive rhythms: Unobtrusive and continuous sensing of alertness using a mobile phone. In Proceedings of the 2016 ACM International Joint Conference on Pervasive and Ubiquitous Computing, 178–189.
- Quantifying daily rhythms with non-negative matrix factorization applied to mobile phone data. Scientific reports, 12(1): 1–10.
- Harnessing the web for population-scale physiological sensing: A case study of sleep and performance. In Proceedings of the 26th international conference on World Wide Web, 113–122.
- Breiman, L. 2001. Random forests. Machine learning, 45(1): 5–32.
- Rhythms in twitter. In 2011 IEEE Third International Conference on Privacy, Security, Risk and Trust and 2011 IEEE Third International Conference on Social Computing, 1409–1414. IEEE.
- Using information scent to model user information needs and actions and the Web. In Proc. SIGCHI Conference on Human Factors in Computing Systems (CHI).
- Different topic, different trafic: How search and navigation interplay on wikipedia. The Journal of Web Science.
- Temporal patterns of happiness and information in a global social network: Hedonometrics and Twitter. PloS one, 6(12): e26752.
- Modeling biobehavioral rhythms with passive sensing in the wild: a case study to predict readmission risk after pancreatic surgery. Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies, 3(1): 1–21.
- Seasonal fluctuations in collective mood revealed by Wikipedia searches and Twitter posts. In 2016 Ieee 16th International Conference on Data Mining Workshops (icdmw), 931–937. IEEE.
- All Models are Wrong, but Many are Useful: Learning a Variable’s Importance by Studying an Entire Class of Prediction Models Simultaneously. J. Mach. Learn. Res., 20(177): 1–81.
- Inspiration, captivation, and misdirection: Emergent properties in networks of online navigation. In Complex Networks IX, 271–282.
- Diurnal and seasonal mood vary with work, sleep, and daylength across diverse cultures. Science, 333(6051): 1878–1881.
- Extracting diurnal patterns of real world activity from social media. In Proceedings of the International AAAI Conference on Web and Social Media, volume 7, 205–214.
- Does time of day affect variety-seeking? Journal of Consumer research, 46(1): 20–35.
- ORES: Lowering Barriers with Participatory Machine Learning in Wikipedia. In Proc. Human-Computer Interaction (HCI).
- Diurnal fluctuations in musical preference. Royal Society open science, 8(11): 210885.
- Helic, D. 2012. Analyzing user click paths in a Wikipedia navigation game. In Proc. International Convention MIPRO.
- Models of human navigation in information networks based on decentralized search. In Proc. ACM Conference on Hypertext and Social Media (HT).
- On the Right Track! Analysing and Predicting Navigation Success in Wikipedia. In Proc. Conference on Hypertext and Social Media (HT).
- The rhythms of life: The biological clocks that control the daily lives of every living thing. Profile books.
- Web Routineness and Limits of Predictability: Investigating Demographic and Behavioral Differences Using Web Tracking Data. arXiv preprint arXiv:2012.15112.
- Habitual daily ‘Good Morning’message senders reveal the status of their own circadian clock. Biological Rhythm Research, 51(5): 735–746.
- How the structure of Wikipedia articles influences user navigation. New Review of Hypermedia and Multimedia, 23(1): 29–50.
- Reader Preferences and Behavior on Wikipedia. In Proc. Conference on Hypertext and Social Media (HT).
- Why the world reads Wikipedia: Beyond English speakers. In Proc. International Conference on Web Search and Data Mining (WSDM).
- Geographically resolved rhythms in twitter use reveal social pressures on daily activity patterns. Current Biology, 28(23): 3763–3775.
- Hunters, busybodies and the knowledge network building associated with deprivation curiosity. Nature Human Behaviour, 5(3): 327–336.
- Machlup, F. 1983. The study of information: Interdisciplinary messages.
- Meta-Research: Reader engagement with medical content on Wikipedia. Elife, 9: e52426.
- Invasion biology and the success of social collaboration networks, with application to Wikipedia. Israel Journal of Ecology and Evolution, 59(1): 17–26.
- Sleep debt in student life: Online attention focus, Facebook, and mood. In Proceedings of the 2016 CHI conference on human factors in computing systems, 5517–5528.
- Umap: Uniform manifold approximation and projection for dimension reduction. arXiv preprint arXiv:1802.03426.
- Social (media) jet lag: How usage of social technology can modulate and reflect circadian rhythms. In Proceedings of the 2015 ACM International Joint Conference on Pervasive and Ubiquitous Computing, 843–854.
- Global music streaming data reveal diurnal and seasonal patterns of affective preference. Nature human behaviour, 3(3): 230–236.
- A large-scale characterization of how readers browse Wikipedia. ACM Transactions on the Web, 17(2): 1–22.
- Going down the Wikipedia Rabbit Hole: Characterizing the Long Tail of Reading Sessions. In Proc. International World Wide Web Conference (WWW) - Companion.
- Quantifying engagement with citations on Wikipedia. In Proc. International World Wide Web Conference (WWW).
- On the Value of Wikipedia as a Gateway to the Web. In Proc. International World Wide Web Conference (WWW).
- Awakening City: Traces of the Circadian Rhythm within the Mobile Phone Network Data. Information, 13(3): 114.
- Information foraging. Psychological review, 106(4): 643.
- A Large Scale Study of Reader Interactions with Images on Wikipedia. EPJ Data Science.
- Characterizing and Modeling the Dynamics of Online Popularity. Physical Review Letters, 105(15).
- Characterization of the Wikipedia traffic. In ICIW 2012: Seventh International Conference on Internet and Web Applications and Services, 156–162.
- Sudden attention shifts on wikipedia during the COVID-19 crisis. In Proc. International Conference on Web and Social Media (ICWSM),.
- From slacktivism to activism: participatory culture in the age of social media. In CHI’11 Extended Abstracts on Human Factors in Computing Systems.
- Advertiming matters: Examining user ad consumption for effective ad allocations on social media. In Proceedings of the 2021 CHI Conference on Human Factors in Computing Systems, 1–18.
- Savolainen, R. 2006. Time as a context of information seeking. Library & information science research, 28(1): 110–127.
- The Last Click: Why Users Give up Information Network Navigation. In Proc. International Conference on Web Search and Data Mining (WSDM).
- Detecting memory and structure in human navigation patterns using Markov chain models of varying order. PLoS ONE, 9(7): e102070.
- Why we read Wikipedia. In Proc. International World Wide Web Conference.
- Want to be retweeted? large scale analytics on factors impacting retweet in twitter network. In 2010 IEEE second international conference on social computing, 177–184. IEEE.
- Dwelling on Wikipedia: Investigating Time Spent by Global Encyclopedia Readers. In Proc. International Symposium on Open Collaboration (OpenSym).
- Exploring the differences and similarities between hierarchical decentralized search and human navigation in information networks. In Proc. International Conference on Knowledge Management and Knowledge Technologies.
- Misalignment between supply and demand of quality content in peer production communities. In Proceedings of the International AAAI Conference on Web and Social Media, volume 9, 493–502.
- Automatic Versus Human Navigation in Information Networks. Proc. International Conference on Web and Social Media (ICWSM).
- Human Wayfinding in Information Networks. In Proc. International World Wide Web Conference (WWW).
- Wikispeedia: An Online Game for Inferring Semantic Distances between Concepts. In Proc. International Joint Conference on Artificial Intelligence (IJCAI).
- Wilson, T. D. 1981. On user studies and information needs. Journal of documentation.
- Wilson, T. D. 1997. Information behaviour: an interdisciplinary perspective. Information processing & management, 33(4): 551–572.
- Wilson, T. D. 1999. Models in information behaviour research. Journal of documentation.
- Growing Wikipedia across languages via recommendation. In Proc. International Conference on World Wide Web.
- Circadian patterns of wikipedia editorial activity: A demographic analysis. PloS one, 7(1): e30091.
- Dynamics of conflicts in Wikipedia. PLoS ONE, 7(6): e38869.
- Architectural styles of curiosity in global Wikipedia mobile app readership. PsyArXiv preprint.
- How Circadian Rhythms Extracted From Social Media Relate to Physical Activity and Sleep. In Proceedings of the International AAAI Conference on Web and Social Media, volume 17, 948–959.