Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
120 tokens/sec
GPT-4o
10 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
5 tokens/sec
GPT-4.1 Pro
3 tokens/sec
DeepSeek R1 via Azure Pro
55 tokens/sec
2000 character limit reached

Towards Proactive Interactions for In-Vehicle Conversational Assistants Utilizing Large Language Models (2403.09135v1)

Published 14 Mar 2024 in cs.HC

Abstract: Research demonstrates that the proactivity of in-vehicle conversational assistants (IVCAs) can help to reduce distractions and enhance driving safety, better meeting users' cognitive needs. However, existing IVCAs struggle with user intent recognition and context awareness, which leads to suboptimal proactive interactions. LLMs have shown potential for generalizing to various tasks with prompts, but their application in IVCAs and exploration of proactive interaction remain under-explored. These raise questions about how LLMs improve proactive interactions for IVCAs and influence user perception. To investigate these questions systematically, we establish a framework with five proactivity levels across two dimensions-assumption and autonomy-for IVCAs. According to the framework, we propose a "Rewrite + ReAct + Reflect" strategy, aiming to empower LLMs to fulfill the specific demands of each proactivity level when interacting with users. Both feasibility and subjective experiments are conducted. The LLM outperforms the state-of-the-art model in success rate and achieves satisfactory results for each proactivity level. Subjective experiments with 40 participants validate the effectiveness of our framework and show the proactive level with strong assumptions and user confirmation is most appropriate.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (44)
  1. At your service: Designing voice assistant personalities to improve automotive user interfaces. In Proceedings of the 2019 CHI Conference on Human Factors in Computing Systems, pages 1–11, 2019.
  2. Language models are few-shot learners. Advances in neural information processing systems, 33:1877–1901, 2020.
  3. The psychological structure and influence of interactive naturalness. Acta Psychologica Sinica, 55(1):55, 2023.
  4. Hello there! is now a good time to talk? opportune moments for proactive interactions with smart speakers. Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies, 4(3):1–28, 2020.
  5. A survey for in-context learning. arXiv preprint arXiv:2301.00234, 2022.
  6. Key-value retrieval networks for task-oriented dialogue. In Proceedings of the 18th Annual SIGdial Meeting on Discourse and Dialogue, pages 37–49, Saarbrücken, Germany, August 2017. Association for Computational Linguistics.
  7. Andy Field. Discovering statistics using IBM SPSS statistics. sage, 2013.
  8. The dynamics of proactivity at work. Research in organizational behavior, 28:3–34, 2008.
  9. Galaxy: A generative pre-trained model for task-oriented dialog with semi-supervised learning and explicit policy injection. In Proceedings of the AAAI conference on artificial intelligence, volume 36, pages 10749–10757, 2022.
  10. Adaptive coordination strategies for human-robot handovers. In Robotics: science and systems, volume 11, pages 1–10. Rome, Italy, 2015.
  11. An ip continuum for adaptive interface design. In Proc. of HCI International, volume 10, 2005.
  12. Interruptibility for in-vehicle multitasking: influence of voice task demands and adaptive behaviors. Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies, 4(1):1–22, 2020.
  13. Effects of proactive dialogue strategies on human-computer trust. In Proceedings of the 28th ACM Conference on User Modeling, Adaptation and Personalization, pages 107–116, 2020.
  14. The role of trust in proactive conversational assistants. IEEE Access, 9:112821–112836, 2021.
  15. A systematic review of functions and design features of in-vehicle agents. International Journal of Human-Computer Studies, 165:102864, 2022.
  16. Gracefully mitigating breakdowns in robotic services. In 2010 5th ACM/IEEE International Conference on Human-Robot Interaction (HRI), pages 203–210. IEEE, 2010.
  17. Sequicity: Simplifying task-oriented dialogue systems with single sequence-to-sequence architectures. In Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 1437–1447, 2018.
  18. Adasa: A conversational in-vehicle digital assistant for advanced driver assistance features. In Proceedings of the 31st Annual ACM Symposium on User Interface Software and Technology, pages 531–542, 2018.
  19. What makes good in-context examples for gpt-3333? arXiv preprint arXiv:2101.06804, 2021.
  20. Understanding and improving transformer from a multi-particle dynamic system point of view. In ICLR 2020 Workshop on Integration of Deep Neural Models and Differential Equations, 2020.
  21. How to design the perfect prompt: A linguistic approach to prompt design in automotive voice assistants–an exploratory study. In 13th International Conference on Automotive User Interfaces and Interactive Vehicular Applications, pages 237–246, 2021.
  22. How may i interrupt? linguistic-driven design guidelines for proactive in-car voice assistants. International Journal of Human–Computer Interaction, pages 1–15, 2023.
  23. Cins: Comprehensive instruction for few-shot learning in task-oriented dialog systems. In Proceedings of the AAAI Conference on Artificial Intelligence, volume 36, pages 11076–11084, 2022.
  24. Cross-task generalization via natural language crowdsourcing instructions. In Smaranda Muresan, Preslav Nakov, and Aline Villavicencio, editors, Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), ACL 2022, Dublin, Ireland, May 22-27, 2022, pages 3470–3487. Association for Computational Linguistics, 2022.
  25. Finding appropriate interaction strategies for proactive dialogue systems—an open quest. In Proceedings of the 2nd European and the 5th Nordic Symposium on Multimodal Communication, volume 110, pages 73–80, 2014.
  26. Taking stock: Integrating and differentiating multiple proactive behaviors. Journal of management, 36(3):633–662, 2010.
  27. Design and evaluation of service robot’s proactivity in decision-making support process. In Proceedings of the 2019 CHI Conference on Human Factors in Computing Systems, pages 1–13, 2019.
  28. A user-centric evaluation framework for recommender systems. In Proceedings of the fifth ACM conference on Recommender systems, pages 157–164, 2011.
  29. Effects of a social robot’s autonomy and group orientation on human decision-making. Advances in Human-Computer Interaction, 2013:11–11, 2013.
  30. Carexpert: Leveraging large language models for in-car conversational question answering. arXiv preprint arXiv:2310.09536, 2023.
  31. Are emergent abilities of large language models a mirage? arXiv preprint arXiv:2304.15004, 2023.
  32. Kwickchat: A multi-turn dialogue system for aac using context-aware sentence generation by bag-of-keywords. In 27th International Conference on Intelligent User Interfaces, pages 853–867, 2022.
  33. Sensing and handling engagement dynamics in human-robot interaction involving peripheral computing devices. In Proceedings of the 2017 CHI Conference on Human Factors in Computing Systems, pages 556–567, 2017.
  34. Safety assessment of chinese large language models. arXiv preprint arXiv:2304.10436, 2023.
  35. How a robot should give advice. In 2013 8th ACM/IEEE International Conference on Human-Robot Interaction (HRI), pages 275–282. IEEE, 2013.
  36. A simple procedure for the assessment of acceptance of advanced transport telematics. Transportation Research Part C: Emerging Technologies, 5(1):1–10, 1997.
  37. Eliciting and analysing users’ envisioned dialogues with perfect voice assistants. In Proceedings of the 2021 CHI Conference on Human Factors in Computing Systems, pages 1–15, 2021.
  38. Chain-of-thought prompting elicits reasoning in large language models. Advances in Neural Information Processing Systems, 35:24824–24837, 2022.
  39. React: Synergizing reasoning and acting in language models. In The Eleventh International Conference on Learning Representations, 2022.
  40. Tree of thoughts: Deliberate problem solving with large language models. arXiv preprint arXiv:2305.10601, 2023.
  41. Fusing task-oriented and open-domain dialogues in conversational agents. In Proceedings of the AAAI Conference on Artificial Intelligence, volume 36, pages 11622–11629, 2022.
  42. Understanding circumstances for desirable proactive behaviour of voice assistants: The proactivity dilemma. In Proceedings of the 4th Conference on Conversational User Interfaces, pages 1–14, 2022.
  43. A probabilistic end-to-end task-oriented dialog model with latent belief states towards semi-supervised learning. arXiv preprint arXiv:2009.08115, 2020.
  44. Voice usability scale: measuring the user experience with voice assistants. In 2020 IEEE International Symposium on Smart Electronic Systems (iSES)(Formerly iNiS), pages 308–311. IEEE, 2020.
Citations (1)

Summary

We haven't generated a summary for this paper yet.

Dice Question Streamline Icon: https://streamlinehq.com

Follow-up Questions

We haven't generated follow-up questions for this paper yet.

X Twitter Logo Streamline Icon: https://streamlinehq.com

Tweets