Papers
Topics
Authors
Recent
Search
2000 character limit reached

Accurate Measures of Vaccination and Concerns of Vaccine Holdouts from Web Search Logs

Published 12 Jun 2023 in cs.CY and cs.AI | (2306.07457v1)

Abstract: To design effective vaccine policies, policymakers need detailed data about who has been vaccinated, who is holding out, and why. However, existing data in the US are insufficient: reported vaccination rates are often delayed or missing, and surveys of vaccine hesitancy are limited by high-level questions and self-report biases. Here, we show how large-scale search engine logs and machine learning can be leveraged to fill these gaps and provide novel insights about vaccine intentions and behaviors. First, we develop a vaccine intent classifier that can accurately detect when a user is seeking the COVID-19 vaccine on search. Our classifier demonstrates strong agreement with CDC vaccination rates, with correlations above 0.86, and estimates vaccine intent rates to the level of ZIP codes in real time, allowing us to pinpoint more granular trends in vaccine seeking across regions, demographics, and time. To investigate vaccine hesitancy, we use our classifier to identify two groups, vaccine early adopters and vaccine holdouts. We find that holdouts, compared to early adopters matched on covariates, are 69% more likely to click on untrusted news sites. Furthermore, we organize 25,000 vaccine-related URLs into a hierarchical ontology of vaccine concerns, and we find that holdouts are far more concerned about vaccine requirements, vaccine development and approval, and vaccine myths, and even within holdouts, concerns vary significantly across demographic groups. Finally, we explore the temporal dynamics of vaccine concerns and vaccine seeking, and find that key indicators emerge when individuals convert from holding out to preparing to accept the vaccine.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (85)
  1. Safety and efficacy of the bnt162b2 mrna covid-19 vaccine. New England Journal of Medicine, 383(27):2603–2615, 2020.
  2. Effectiveness of covid-19 vaccines against the b.1.617.2 (delta) variant. New England Journal of Medicine, 385(7):585–594, 2021.
  3. Data-driven real-time strategic placement of mobile vaccine distribution sites. In Proceedings of the 36th AAAI Conference on Artificial Intelligence (IAAI’22), 2022.
  4. Identifying covid-19 vaccine deserts and ways to reduce them: A digital tool to support public health decision-making. American Journal of Public Health, 113(4):363–367, 2023.
  5. Considering emotion in covid-19 vaccine communication: Addressing vaccine hesitancy and fostering vaccine confidence. Health Communication, 35(14):1718–1722, 2020.
  6. Building public trust: a response to covid-19 vaccine hesitancy predicament. Journal of Public Health, 43(2):e291–e292, 2021.
  7. Behavioural nudges increase covid-19 vaccinations. Nature, 597:404–409, 2021.
  8. Evidence from a statewide vaccination rct shows the limits of nudges. Nature, 604:E1–E7, 2022.
  9. Digital public health interventions at scale: The impact of social media advertising on beliefs and outcomes related to covid vaccines. Proceedings of the National Academy of Science (PNAS), 120(5), 2023.
  10. Sharon LaFraniere. ‘very harmful’ lack of data blunts u.s. response to outbreaks. The New York Times, 2022. https://www.nytimes.com/2022/09/20/us/politics/covid-data-outbreaks.html.
  11. How cdc data problems put the u.s. behind on the delta variant. The Washington Post, 2021. https://www.washingtonpost.com/health/2021/08/18/cdc-data-delay-delta-variant/.
  12. Vaccination is local: Covid-19 vaccination rates vary by county and key characteristics. Kaiser Family Foundation (KFF), 2021. https://www.kff.org/coronavirus-covid-19/issue-brief/vaccination-is-local-covid-19-vaccination-rates-vary-by-county-and-key-characteristics/.
  13. Missing again: Us racial and ethnic data for covid-19 vaccination. The Lancet, 397(10281):1259–1260, 2021.
  14. State-level vaccine demographic data is messy and incomplete—we need federal data, now. The COVID Tracking Project, 2021. https://covidtracking.com/analysis-updates/state-level-vaccine-demographic-data-is-messy-and-incomplete.
  15. United States Census Bureau. Household pulse survey covid-19 vaccination tracker, 2021. https://www.census.gov/library/visualizations/interactive/household-pulse-survey-covid-19-vaccination-tracker.html.
  16. Vaccine hesitancy in the era of covid-19. Public Health, 194:245–251, 2021.
  17. Unrepresentative big surveys significantly overestimated us vaccine uptake. Nature, 600:695–700, 2021.
  18. Alaa Althubaiti. Information bias in health research: definition, pitfalls, and adjustment methods. Journal of Multidisciplinary Healthcare, 9:211–217, 2016.
  19. Seth Stephens-Davidowitz. The cost of racial animus on a black candidate: Evidence using google search data. Journal of Public Economics, 118:26–40, 2014.
  20. Comparison of self-report influenza vaccination coverage with data from a population based computerized vaccination registry and factors associated with discordance. Vaccine, 32(35):4386–4392, 2014.
  21. Understanding user behavior through log data and analysis. In Ways of Knowing in HCI, pages 349–372. Springer New York, New York, NY, 2014.
  22. Population-scale study of human needs during the covid-19 pandemic: Analysis and implications. In Proceedings of the 14th ACM International Conference on Web Search and Data Mining (WSDM’21), page 4–12, 2021.
  23. Disparate impacts on online information access during the covid-19 pandemic. Nature Communications, 13(7094), 2022.
  24. Using search queries to understand health information needs in africa. In Proceedings of the Thirteenth International AAAI Conference on Web and Social Media (ICWSM ’19), 2019.
  25. Diagnoses, decisions, and outcomes: Web search as decision support for cancer. In Proceedings of the 24th international conference on World Wide Web (WWW’15), 2015.
  26. Search and breast cancer: On episodic shifts of attention over life histories of an illness. ACM Transactions on the Web, 10(2), 2016.
  27. Exploring time-dependent concerns about pregnancy and childbirth from search logs. In Proceedings of the 33rd Annual ACM Conference on Human Factors in Computing Systems (CHI’15), page 737–746, 2015.
  28. From cookies to cooks: Insights on dietary patterns via analysis of web usage logs. In Proceedings of the 22nd international conference on World Wide Web (WWW’13), page 1399–1410, 2013.
  29. Cyberchondria: Studies of the escalation of medical concerns in web search. ACM Transactions on Information Systems, 27(4), 2009.
  30. Covid-19 vaccine hesitancy on social media: Building a public twitter data set of antivaccine content, vaccine misinformation, and conspiracies. JMIR Public Health and Surveillance, 7(11), 2021.
  31. Online misinformation is linked to early covid-19 vaccination hesitancy and refusal. Scientific Reports, 12(5955), 2022.
  32. Winds of change: Impact of covid-19 on vaccine-related opinions of twitter users. In Proceedings of the 16th International AAAI Conference on Web and Social Media (ICWSM’22), 2022.
  33. Covid-19 vaccine hesitancy linked to increased internet search queries for side effects on fertility potential in the initial rollout phase following emergency use authorization. Andrologia, 53(9), 2021.
  34. Google covid-19 vaccination search insights: Anonymization process description. arXiv, 2021.
  35. Vaccine search patterns provide insights into vaccination intent. arXiv, 2021.
  36. Predicting depression via social media. In Proceedings of the 7th International AAAI Conference on Web and Social Media (ICWSM’13), 2013.
  37. Detecting influenza epidemics using search engine query data. Nature, 457:1012–1014, 2009.
  38. Social data: Biases, methodological pitfalls, and ethical boundaries. Frontiers in Big Data, 2, 2019.
  39. The parable of google flu: Traps in big data analysis. Science, 343:1203–1205, 2014.
  40. Inferring query intent from reformulations and clicks. In Proceedings of the 19th International Conference on World Wide Web (WWW’10), 2010.
  41. Random walks on the click graph. In Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval (SIGIR ’07), 2007.
  42. Learning query intent from regularized click graphs. In Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval (SIGIR ’08), 2008.
  43. The anatomy of a large-scale hypertextual web search engine. Computer Networks and ISDN Systems, 1998.
  44. Community membership identification from small seed sets. In Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD’14), page 1366–1375, 2014.
  45. Semi-supervised classification with graph convolutional networks. In Proceedings of the 5th International Conference on Learning Representations (ICLR ’17), 2017.
  46. See who has been vaccinated so far in new york city. The New York Times, 2021. https://www.nytimes.com/interactive/2021/03/26/nyregion/nyc-vaccination-rates-map.html.
  47. Factors Associated With US Adults’ Likelihood of Accepting COVID-19 Vaccination. JAMA Network Open, 3(10):e2025594–e2025594, 2020.
  48. Predictors of covid-19 vaccine acceptance, intention, and hesitancy: A scoping review. Frontiers in Public Health, 9, 2021.
  49. Covid-19 vaccine hesitancy in the united states: A systematic review. Frontiers in Public Health, 9, 2021.
  50. Berkeley Lovelace Jr. Cdc expands covid vaccination guidelines to everyone 65 and older. CNBC, 2021. https://www.cnbc.com/2021/01/12/covid-vaccine-trump-administration-to-expand-eligibility-to-everyone-65-and-older.html.
  51. Growing number of republicans urge vaccinations amid delta surge. The Washington Post, 2021. https://www.washingtonpost.com/politics/growing-number-of-republicans-urge-vaccinations-amid-delta-surge/2021/07/20/52a06e9c-e999-11eb-8950-d73b3e93ff7f_story.html.
  52. Lydia Saad. More in u.s. vaccinated after delta surge, fda decision. Gallup, 2021. https://news.gallup.com/poll/355073/vaccinated-delta-surge-fda-decision.aspx.
  53. Newsguard. Rating process and criteria. https://www.newsguardtech.com/ratings/rating-process-criteria/.
  54. Alex jones is told to stop selling sham anti-coronavirus toothpaste. The New York Times, 2020. https://www.nytimes.com/2020/03/13/nyregion/alex-jones-coronavirus-cure.html.
  55. Julian E. Barnes. Russian disinformation targets vaccines and the biden administration. The New York Times, 2021. https://www.nytimes.com/2021/08/05/us/politics/covid-vaccines-russian-disinformation.html.
  56. Sheera Frenkel. The most influential spreader of coronavirus misinformation online. The New York Times, 2021. https://www.nytimes.com/2021/07/24/technology/joseph-mercola-coronavirus-misinformation-online.html.
  57. Measuring the impact of covid-19 vaccine misinformation on vaccination intent in the uk and usa. Nature Human Behaviour, 5:337–348, 2021.
  58. Fast unfolding of communities in large networks. Journal of Statistical Mechanics: Theory and Experiment, 2008.
  59. Alexis Benveniste. New york city will require vaccines for entry to restaurants and gyms. CNN Business, 2021. https://www.cnn.com/2021/08/03/business/new-york-city-vaccine-requirements/index.html.
  60. Tom Tapp. Los angeles city council votes 13-0 to create vaccination requirement for indoor public spaces such as restaurants, movie theaters, concert venues. Deadline, 2021. https://deadline.com/2021/08/los-angeles-city-requires-vaccination-vaccine-indoors-1234813086/.
  61. Myths and facts about covid-19 vaccines, 2023. https://www.cdc.gov/coronavirus/2019-ncov/vaccines/facts.html.
  62. Kaia Hubbard. Want free beer or a chance at $1 million? get your covid-19 vaccine. US News, 2021. https://www.usnews.com/news/best-states/articles/2021-05-07/states-cities-and-companies-offer-incentives-to-get-covid-19-vaccine.
  63. Noah Weiland. One and done: Why people are eager for johnson & johnson’s vaccine. The New York Times, 2021. https://www.nytimes.com/2021/03/04/health/covid-vaccine-johnson-and-johnson-rollout.html.
  64. Bob Curley. Why some people still prefer the johnson & johnson covid-19 vaccine. Healthline, 2021. https://www.healthline.com/health-news/why-some-people-still-prefer-the-johnson-johnson-covid-19-vaccine.
  65. StatCounter. Desktop search engine market share united states of america, jan - dec 2021. https://gs.statcounter.com/search-engine-market-share/desktop/united-states-of-america/2021.
  66. Covid-19 vaccinations in the united states,jurisdiction. https://data.cdc.gov/Vaccinations/COVID-19-Vaccinations-in-the-United-States-Jurisdi/unsk-b7fc.
  67. Covid-19 vaccinations in the united states,county. https://data.cdc.gov/Vaccinations/COVID-19-Vaccinations-in-the-United-States-County/8xkx-amqh.
  68. United States Census Bureau. Zip code tabulation areas (zctas). https://www.census.gov/programs-surveys/geography/guidance/geo-areas/zctas.html.
  69. United States Census Bureau. American community survey data. https://www.census.gov/programs-surveys/acs/data.html.
  70. United States Census Bureau. 2020 zip code tabulation area (zcta) relationship file record layouts. https://www.census.gov/programs-surveys/geography/technical-documentation/records-layout/2020-zcta-record-layout.html.
  71. United States Census Bureau. Tiger/line shapefiles. https://www.census.gov/cgi-bin/geo/shapefiles/index.php.
  72. Dave Leip’s Atlas of U.S. Elections. Store - election data. https://uselectionatlas.org/BOTTOM/store_data.php.
  73. Google. Google trends. https://trends.google.com/trends/?geo=US.
  74. Pharmacies participating in the federal retail pharmacy program. https://www.cdc.gov/vaccines/covid-19/retail-pharmacy-program/participating-pharmacies.html.
  75. Learning from positive and unlabeled data: a survey. Machine Learning, 109:719–760, 2020.
  76. Machine learning for social science: An agnostic approach. Annual Review of Political Science, 24:395–419, 2021.
  77. Computational analysis of 140 years of us political speeches reveals more positive but increasingly polarized framing of immigration. Proceedings of the National Academy of Science (PNAS), 119(31), 2022.
  78. Researcher reasoning meets computational capacity: Machine learning for social science. Social Science Research, 108:102807, 2022.
  79. Detecting disparities in police deployments using dashcam data. In Proceedings of the 6th ACM Conference on Fairness, Accountability, and Transparency 2023 (FAccT’23), 2023.
  80. Roderick J. A. Little. Post-stratification: A modeler’s perspective. Journal of the American Statistical Association, 88(423):1001–1012, 1993.
  81. Adults in all u.s. states are now eligible for vaccination, hitting biden’s target. half have had at least one dose. The New York Times, 2021. https://www.nytimes.com/2021/04/19/world/adults-eligible-covid-vaccine.html.
  82. An n5/2superscript𝑛52n^{5/2}italic_n start_POSTSUPERSCRIPT 5 / 2 end_POSTSUPERSCRIPT algorithm for maximum matchings in bipartite graphs. SIAM Journal on Computing, 2(4):225–231, 1973.
  83. Vaccine adverse event reporting system. https://vaers.hhs.gov/.
  84. Statistics in medicine: Calculating confidence intervals for regression and correlation. British Medical Journal (Clinical research Ed.), 296(6631):1238–1242, 1988.
  85. Racial/ethnic disparities in state-level covid-19 vaccination rates and their association with structural racism. Journal of Racial and Ethnic Health Disparities, 9(6):2361–2374, 2022.

Summary

Paper to Video (Beta)

Whiteboard

No one has generated a whiteboard explanation for this paper yet.

Open Problems

We haven't generated a list of open problems mentioned in this paper yet.

Continue Learning

We haven't generated follow-up questions for this paper yet.

Collections

Sign up for free to add this paper to one or more collections.