ChatGPT vs. Google: A Comparative Study of Search Performance and User Experience (2307.01135v1)

Published 3 Jul 2023 in cs.AI, cs.HC, and cs.IR

Abstract: The advent of ChatGPT, a LLM-powered chatbot, has prompted questions about its potential implications for traditional search engines. In this study, we investigate the differences in user behavior when employing search engines and chatbot tools for information-seeking tasks. We carry out a randomized online experiment, dividing participants into two groups: one using a ChatGPT-like tool and the other using a Google Search-like tool. Our findings reveal that the ChatGPT group consistently spends less time on all tasks, with no significant difference in overall task performance between the groups. Notably, ChatGPT levels user search performance across different education levels and excels in answering straightforward questions and providing general solutions but falls short in fact-checking tasks. Users perceive ChatGPT's responses as having higher information quality compared to Google Search, despite displaying a similar level of trust in both tools. Furthermore, participants using ChatGPT report significantly better user experiences in terms of usefulness, enjoyment, and satisfaction, while perceived ease of use remains comparable between the two tools. However, ChatGPT may also lead to overreliance and generate or replicate misinformation, yielding inconsistent results. Our study offers valuable insights for search engine management and highlights opportunities for integrating chatbot technologies into search engine designs.

PDF Abstract

A Comparative Analysis of ChatGPT and Google Search: Search Performance and User Experience

The paper authored by Ruiyun Xu, Yue Feng, and Hailiang Chen explores the comparative efficacy of ChatGPT and Google Search in fulfilling information-seeking tasks. The paper employs a randomized online experiment to draw distinctions in user behavior and perception when navigating these platforms. In this essay, we highlight the core findings and implications of this analysis, providing an expert perspective on its contributions to information retrieval research.

The experiment stratified participants into two groups, each using a simulated version of either ChatGPT or Google Search to complete three distinct search tasks of varying complexity. The tasks ranged from straightforward question answering to fact-checking challenges. A total of 95 participants were included in the final sample after filtering for data validity. One key outcome of the paper is the revelation that users spent significantly less time completing tasks with ChatGPT, without any substantial compromises in overall task performance compared to Google Search users.

An in-depth analysis indicates ChatGPT's proficiency in addressing straightforward queries, evidenced by superior performance in fact retrieval tasks. In contrast, the performance gap closes with more complex, fact-checking tasks, where ChatGPT exhibits shortcomings due to issues such as misinformation propagation and a lack of iterative query correction. This suggests that while ChatGPT is expedient, its propensity for providing inaccurate information remains a caveat necessitating user caution.

Interestingly, the paper underscores a leveling effect of ChatGPT on search performance across educational demographics. Compared to Google Search, where user performance positively correlates with educational attainment, ChatGPT equalizes performance irrespective of educational level, thereby reducing educational disparities in digital literacy.

Additionally, participants perceived higher information quality with ChatGPT, though trust levels in both platforms remained comparable. Users reported enhanced levels of satisfaction and enjoyment with ChatGPT, primarily attributed to its conversational interface and organized response format. However, these user experience benefits may simultaneously foster complacency towards cross-verifying information, particularly in contexts that require meticulous fact-checking.

From a theoretical standpoint, this research provides substantial insights into how LLM-powered chatbots may redefine parameters of user experience in search operations. Practically, it posits significant implications for search engine design, suggesting potential hybrid models integrating conversational interfaces to improve user engagement without losing the reliability associated with traditional search engines.

Despite its comprehensive nature, the research leaves fertile ground for further exploration. Future inquiries could examine how integration of conversational processes influences critical information-seeking behaviors across diverse population subsets. The findings also prompt questions regarding the persistence of ChatGPT's effectiveness across broader sets of search tasks and environments. Furthermore, an investigation into the long-term effects on user behavior and the search engine landscape could yield insights into optimizing AI-powered search utilities.

In conclusion, while the paper avoids hyperbolic assertions about ChatGPT's transformative potential, it contributes valuable empirical evidence to ongoing discourse around the evolving dynamics of search technology. It charts out potential pathways for future exploration in the domain of AI-integrated search solutions, emphasizing the need for balanced approaches that marry efficiency with accuracy in information dissemination.

PDF Markdown Bookmark Chat (Pro)

Authors (3)

Ruiyun Xu (1 paper)
Yue Feng (55 papers)
Hailiang Chen (15 papers)

Citations (27)

View on Semantic Scholar

Related Papers

Find Related Papers

YouTube

Show All Videos