The Impact of Snippet Reliability on Misinformation in Online Health Search (2401.15720v1)
Abstract: Search result snippets are crucial in modern search engines, providing users with a quick overview of a website's content. Snippets help users determine the relevance of a document to their information needs, and in certain scenarios even enable them to satisfy those needs without visiting web documents. Hence, it is crucial for snippets to reliably represent the content of their corresponding documents. While this may be a straightforward requirement for some queries, it can become challenging in the complex domain of healthcare, and can lead to misinformation. This paper aims to examine snippets' reliability in representing their corresponding documents, specifically in the health domain. To achieve this, we conduct a series of user studies using Google's search results, where participants are asked to infer viewpoints of search results pertaining to queries about the effectiveness of a medical intervention for a medical condition, based solely on their titles and snippets. Our findings reveal that a considerable portion of Google's snippets (28%) failed to present any viewpoint on the intervention's effectiveness, and that 35% were interpreted by participants as having a different viewpoint compared to their corresponding documents. To address this issue, we propose a snippet extraction solution tailored directly to users' information needs, i.e., extracting snippets that summarize documents' viewpoints regarding the intervention and condition that appear in the query. User study demonstrates that our information need-focused solution outperforms the mainstream query-based approach. With only 19.67% of snippets generated by our solution reported as not presenting a viewpoint and a mere 20.33% misinterpreted by participants. These results strongly suggest that an information need-focused approach can significantly improve the reliability of extracted snippets in online health search.
- 1999. Acupuncture for chronic asthma. https://www.cochrane.org/CD000008/AIRWAYS_acupuncture-for-chronic-asthma.
- 2022. Control your snippets in search results. https://developers.google.com/search/docs/appearance/snippet.
- 2023. Cochrane Library: Cochrane Reviews. https://www.cochranelibrary.com/. https://www.cochranelibrary.com/
- 2023. Featured snippets and your website. https://developers.google.com/search/docs/appearance/featured-snippets.
- Improving search result summaries by using searcher behavior data. In Proceedings of the 36th international acm sigir conference on research and development in information retrieval. 13–22.
- Extractive snippet generation for arguments. In Proceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval. 1969–1972.
- Investigating the Influence of Featured Snippets on User Attitudes. In Proceedings of the 2023 Conference on Human Information Interaction and Retrieval. 211–220.
- Featured Snippets and their Influence on Users’ Credibility Judgements. In ACM SIGIR Conference on Human Information Interaction and Retrieval. 113–122.
- Sergey Brin and Lawrence Page. 1998. The anatomy of a large-scale hypertextual web search engine. Computer networks and ISDN systems 30, 1-7 (1998), 107–117.
- What is a Cochrane review? Epidemiology and psychiatric sciences 20, 3 (2011), 231–233.
- The influence of caption features on clickthrough patterns in web search. In Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval. 135–142.
- Edward Cutrell and Zhiwei Guan. 2007. What are you looking for? An eye-tracking study of information usage in web search. In Proceedings of the SIGCHI conference on Human factors in computing systems. 407–416.
- Dave Davies. 2018. Meet the 7 most popular search engines in the world. Search Engine Journal (2018).
- Bert: Pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805 (2018).
- Leaving so soon? Understanding and predicting web search abandonment rationales. In Proceedings of the 21st ACM international conference on Information and knowledge management. 1025–1034.
- A think-aloud study to understand factors affecting online health search. In Proceedings of the 2020 conference on human information interaction and retrieval. 273–282.
- Introducing rich snippets. Google Webmaster Central Blog (2009).
- Understanding and Mitigating Bias in Online Health Search. (2021), 265–274. https://doi.org/10.1145/3404835.3462930
- Not Just Skipping. Understanding the Effect of Sponsored Content on Users’ Decision-Making in Online Health Search. arXiv preprint arXiv:2207.04445 (2022).
- Automatic Web Page Annotation with Google Rich Snippets.. In OTM Conferences (2). 957–974.
- Tapas Kanungo and David Orr. 2009. Predicting the readability of short web summaries. In Proceedings of the Second ACM International Conference on Web Search and Data Mining. 202–211.
- Query-biased summary generation assisted by query expansion. Journal of the Association for Information Science and Technology 66, 5 (2015), 961–979.
- BioBERT: a pre-trained biomedical language representation model for biomedical text mining. Bioinformatics 36, 4 (2020), 1234–1240.
- Good abandonment in mobile and PC internet search. In Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval. 43–50.
- Understanding neural networks through representation erasure. arXiv preprint arXiv:1612.08220 (2016).
- Qing Li and Yuanzhu Peter Chen. 2010. Personalized text snippet extraction using statistical language models. Pattern Recognition 43, 1 (2010), 378–386.
- Marco Lippi and Paolo Torroni. 2016. Argumentation mining: State of the art and emerging trends. ACM Transactions on Internet Technology (TOIT) 16, 2 (2016), 1–25.
- A study of snippet length and informativeness: Behaviour, performance and user experience. In Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval. 135–144.
- Bruno Oliveira and Carla Teixeira Lopes. 2023. The Evolution of Web Search User Interfaces-An Archaeological Analysis of Google Search Engine Result Pages. In Proceedings of the 2023 Conference on Human Information Interaction and Retrieval. 55–68.
- Roselle for hypertension in adults. Cochrane Database of Systematic Reviews 11 (2021).
- The positive and negative influence of search results on people’s decisions about the efficacy of medical treatments. In Proceedings of the ACM SIGIR International Conference on Theory of Information Retrieval. 209–216.
- Okapi at TREC-3. Nist Special Publication Sp 109 (1995), 109.
- Summary attributes and perceived search quality. In Proceedings of the 16th international conference on World Wide Web. 1201–1202.
- Sofia Stamou and Efthimis N Efthimiadis. 2010. Interpreting user inactivity on search results. In European Conference on Information Retrieval. Springer, 100–113.
- Manfred Stede and Jodi Schneider. 2018. Argumentation mining. Synthesis Lectures on Human Language Technologies 11, 2 (2018), 1–191.
- Anastasios Tombros and Mark Sanderson. 1998. Advantages of query biased summaries in information retrieval. In Proceedings of the 21st annual international ACM SIGIR conference on Research and development in information retrieval. 2–10.
- Peter C Wason. 1960. On the failure to eliminate hypotheses in a conceptual task. Quarterly journal of experimental psychology 12, 3 (1960), 129–140.
- Ryen White. 2013. Beliefs and biases in web search. In Proceedings of the 36th international ACM SIGIR conference on Research and development in information retrieval. 3–12.
- Ryen W White and Ahmed Hassan. 2014. Content bias in online health search. ACM Transactions on the Web (TWEB) 8, 4 (2014), 1–33.
- A task-oriented study on the influencing effects of query-biased summarisation in web searching. Information Processing & Management 39, 5 (2003), 707–733.
- Credibility assessment of good abandonment results in mobile search. Information Processing & Management 57, 6 (2020), 102350.
- Anat Hashavit (2 papers)
- Tamar Stern (2 papers)
- Hongning Wang (107 papers)
- Sarit Kraus (54 papers)