Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
133 tokens/sec
GPT-4o
12 tokens/sec
Gemini 2.5 Pro Pro
42 tokens/sec
o3 Pro
5 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Building for Speech: Designing the Next Generation of Social Robots for Audio Interaction (2311.01146v2)

Published 2 Nov 2023 in cs.HC

Abstract: There have been incredible advancements in robotics and spoken dialogue systems (SDSs) over the past few years, yet we still don't find social robots in public spaces like train stations, shopping malls, or hospital waiting rooms. In this paper, we argue that early-stage collaboration between robot designers and SDS researchers is crucial to create social robots that can legitimately be used in real-world environments. We draw from our experiences running experiments with social robots, and the surrounding literature, to highlight recurring issues. Robots need more speakers, more microphones, quieter motors, and quieter fans to enable human-robot spoken interaction in the wild and improve accessibility. More robust robot joints are also needed to limit potential harm to older adults and other more vulnerable groups.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (34)
  1. Angus Addlesee. 2022. Securely Capturing People’s Interactions with Voice Assistants at Home: A Bespoke Tool for Ethical Data Collection. In Proceedings of the Second Workshop on NLP for Positive Impact (NLP4PI).
  2. Angus Addlesee. 2023. Voice Assistant Accessibility. In The International Workshop on Spoken Dialogue Systems Technology, IWSDS 2023.
  3. Angus Addlesee and Marco Damonte. 2023a. Understanding and Answering Incomplete Questions. In Proceedings of the 5th Conference on Conversational User Interfaces.
  4. Angus Addlesee and Marco Damonte. 2023b. Understanding Disrupted Sentences Using Underspecified Abstract Meaning Representation. In Interspeech.
  5. Data Collection for Multi-party Task-based Dialogue in Social Robotics. In The International Workshop on Spoken Dialogue Systems Technology, IWSDS 2023.
  6. A comprehensive evaluation of incremental speech recognition and diarization for conversational AI. In Proceedings of the 28th International Conference on Computational Linguistics.
  7. Beamforming Techniques for Multichannel audio Signal Separation. arXiv:1212.6080 [cs.OH]
  8. Do as i can, not as i say: Grounding language in robotic affordances. arXiv preprint arXiv:2204.01691 (2022).
  9. Why is My Social Robot so Slow? How a Conversational Listener can Revolutionize Turn-Taking. In Proceedings of the 5th Conference on Conversational User Interfaces.
  10. Michael Calore. 2019. Review: Apple Homepod. Wired (2019).
  11. A survey of the development of quadruped robots: Joint configuration, dynamic locomotion control method and mobile manipulation approach. Biomimetic Intelligence and Robotics 2, 1 (2022).
  12. MuMMER: Socially Intelligent Human-Robot Interaction in Public Spaces. arXiv:1909.06749 [cs.RO]
  13. James Glass. 1999. Challenges for spoken dialogue systems. In Proceedings of the 1999 IEEE ASRU Workshop, Vol. 696. MIT Laboratory fot Computer Science Cambridge, MA, USA.
  14. A Visually-Aware Conversational Robot Receptionist. In Proceedings of the 23rd Annual Meeting of the Special Interest Group on Discourse and Dialogue.
  15. Linh Hahkio. 2020. Service robots’ feasibility in the hotel industry: A case study of Hotel Presidentti. (2020).
  16. A hybrid framework for ego noise cancellation of a robot. In 2010 IEEE International Conference on Robotics and Automation. IEEE.
  17. Studying drink-serving service robots in the real world. In 2020 29th IEEE International Conference on Robot and Human Interactive Communication (RO-MAN). IEEE.
  18. Oliver Lemon. 2022. Conversational AI for multi-agent communication in Natural Language. AI Communications Preprint (2022).
  19. Development of microphone-array-embedded UAV for search and rescue task. In 2017 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS). IEEE.
  20. Robotic agents used to help teach social skills to children with autism: the third generation. In 2011 RO-MAN. IEEE.
  21. Voice interfaces in everyday life. In proceedings of the 2018 CHI conference on human factors in computing systems.
  22. Social Robots as Coaches: How Human-Robot Interaction Positively Impacts Motivation in Sports Training Sessions. In 2022 31st IEEE International Conference on Robot and Human Interactive Communication (RO-MAN). IEEE.
  23. A novel ego-noise suppression algorithm for acoustic signal enhancement in autonomous systems. In 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE.
  24. Raimond Spekking. 2021. Amazon Echo Dot (RS03QR) - LED and microphone board. Wikimedia (2021).
  25. Situated participatory design: A method for in situ design of robotic interaction with older adults. In Proceedings of the 2023 CHI Conference on Human Factors in Computing Systems.
  26. How does modality matter? investigating the synthesis and effects of multi-modal robot behavior on social intelligence. International Journal of Social Robotics 14, 4 (2022).
  27. Sebastian Thrun. 1998. When robots meet people. IEEE Intelligent Systems and their Applications 13, 3 (1998).
  28. Predictive Models for Robot Ego-Noise Learning and Imitation. In 2018 Joint IEEE 8th International Conference on Development and Learning and Epigenetic Robotics (ICDL-EpiRob). IEEE.
  29. Comparing Multi-User Interaction Strategies in Human-Robot Teamwork. In The International Workshop on Spoken Dialogue Systems Technology, IWSDS 2023.
  30. How to Address Humans: System Barge-In in Multi-user HRI. 147–152. https://doi.org/10.1007/978-981-15-9323-9_13
  31. Chris Welch. 2023. Apple’s new HomePod unsurprisingly sounds close to the original. The Verge (2023).
  32. Jason D Williams. 2009. Spoken dialogue systems: Challenges, and opportunities for research.. In ASRU.
  33. Mark Wilson. 2020. Why Amazon radically redesigned the Echo. Fast Company (2020).
  34. Autonomous Ground Navigation in Highly Constrained Spaces: Lessons Learned From the Benchmark Autonomous Robot Navigation Challenge at ICRA 2022 [Competitions]. IEEE Robotics & Automation Magazine 29, 4 (2022).
Citations (3)

Summary

We haven't generated a summary for this paper yet.