How Does Conversation Length Impact User's Satisfaction? A Case Study of Length-Controlled Conversations with LLM-Powered Chatbots (2404.17025v1)
Abstract: Users can discuss a wide range of topics with LLMs, but they do not always prefer solving problems or getting information through lengthy conversations. This raises an intriguing HCI question: How does instructing LLMs to engage in longer or shorter conversations affect conversation quality? In this paper, we developed two Slack chatbots using GPT-4 with the ability to vary conversation lengths and conducted a user study. Participants asked the chatbots both highly and less conversable questions, engaging in dialogues with 0, 3, 5, and 7 conversational turns. We found that the conversation quality does not differ drastically across different conditions, while participants had mixed reactions. Our study demonstrates LLMs' ability to change conversation length and the potential benefits for users resulting from such changes, but we caution that changes in text form may not necessarily imply changes in quality or content.
- Theo Araujo. 2018. Living up to the chatbot hype: The influence of anthropomorphic design cues and communicative agency framing on conversational agent and company perceptions. Computers in Human Behavior 85 (2018), 183–189.
- Chatbots language design: The influence of language variation on user experience with tourist assistant chatbots. ACM Transactions on Computer-Human Interaction 29, 2 (2022), 1–38.
- Creating a chatbot for and with migrants: chatbot personality drives co-design activities. In Proceedings of the 2020 acm designing interactive systems conference. 219–230.
- Exploring language style in chatbots to increase perceived product value and user engagement. In Proceedings of the 2019 Conference on Human Information Interaction and Retrieval. 301–305.
- Towards Designing Cooperative and Social Conversational Agents for Customer Service.. In ICIS. 1–13.
- Herbert P Grice. 1975. Logic and conversation. In Speech acts. Brill, 41–58.
- Real conversations with artificial intelligence: A comparison between human–human online conversations and human–chatbot conversations. Computers in human behavior 49 (2015), 245–250.
- Touch your heart: A tone-aware chatbot for customer care on social media. In Proceedings of the 2018 CHI conference on human factors in computing systems. 1–12.
- What Types of Questions Require Conversation to Answer? A Case Study of AskReddit Questions. In Extended Abstracts of the 2023 CHI Conference on Human Factors in Computing Systems. 1–9.
- Evorus: A crowd-powered conversational assistant built to automate itself over time. In Proceedings of the 2018 CHI conference on human factors in computing systems. 1–13.
- The impact of language style accommodation during social media interactions on brand trust. Journal of Service Management 28, 3 (2017), 418–441.
- COBART: Controlled, optimized, bidirectional and auto-regressive transformer for ad headline generation. In Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining. 3127–3136.
- Crowd AI Lab. 2023. What Types of Questions Require Conversation to Answer? AskReddit Questions Dataset. https://github.com/Crowd-AI-Lab/AskReddit.
- Kellie Morrissey and Jurek Kirakowski. 2013. ‘Realness’ in chatbots: establishing quantifiable criteria. In Human-Computer Interaction. Interaction Modalities and Techniques: 15th International Conference, HCI International 2013, Las Vegas, NV, USA, July 21-26, 2013, Proceedings, Part IV 15. Springer, 87–96.
- Vidya Setlur and Melanie Tory. 2022. How do you converse with an analytical chatbot? revisiting gricean maxims for designing analytical conversational behavior. In Proceedings of the 2022 CHI Conference on Human Factors in Computing Systems. 1–17.
- Plug and Play Conversations: The Micro-Conversation Scheme for Modular Development of Hybrid Conversational Agent. In Companion Publication of the 2023 Conference on Computer Supported Cooperative Work and Social Computing. 50–55.
- The ethnobot: Gathering ethnographies in the age of IoT. In Proceedings of the 2018 CHI conference on human factors in computing systems. 1–13.
- A survey of controllable text generation using transformer-based pre-trained language models. Comput. Surveys 56, 3 (2023), 1–37.