2000 character limit reached
EmBARDiment: an Embodied AI Agent for Productivity in XR (2408.08158v2)
Published 15 Aug 2024 in cs.HC and cs.MA
Abstract: XR devices running chat-bots powered by LLMs have the to become always-on agents that enable much better productivity scenarios. Current screen based chat-bots do not take advantage of the the full-suite of natural inputs available in XR, including inward facing sensor data, instead they over-rely on explicit voice or text prompts, sometimes paired with multi-modal data dropped as part of the query. We propose a solution that leverages an attention framework that derives context implicitly from user actions, eye-gaze, and contextual memory within the XR environment. Our work minimizes the need for engineered explicit prompts, fostering grounded and intuitive interactions that glean user insights for the chat-bot.