Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
80 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Evaluating Chatbots to Promote Users' Trust -- Practices and Open Problems (2309.05680v2)

Published 9 Sep 2023 in cs.HC, cs.AI, and cs.SE

Abstract: Chatbots, the common moniker for collaborative assistants, are AI software that enables people to naturally interact with them to get tasks done. Although chatbots have been studied since the dawn of AI, they have particularly caught the imagination of the public and businesses since the launch of easy-to-use and general-purpose LLM-based chatbots like ChatGPT. As businesses look towards chatbots as a potential technology to engage users, who may be end customers, suppliers, or even their own employees, proper testing of chatbots is important to address and mitigate issues of trust related to service or product performance, user satisfaction and long-term unintended consequences for society. This paper reviews current practices for chatbot testing, identifies gaps as open problems in pursuit of user trust, and outlines a path forward.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (6)
  1. Biplav Srivastava (57 papers)
  2. Kausik Lakkaraju (13 papers)
  3. Tarmo Koppel (2 papers)
  4. Vignesh Narayanan (20 papers)
  5. Ashish Kundu (36 papers)
  6. Sachindra Joshi (32 papers)
Citations (1)