Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
41 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
41 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

RealityTalk: Real-Time Speech-Driven Augmented Presentation for AR Live Storytelling (2208.06350v1)

Published 12 Aug 2022 in cs.HC and cs.CL

Abstract: We present RealityTalk, a system that augments real-time live presentations with speech-driven interactive virtual elements. Augmented presentations leverage embedded visuals and animation for engaging and expressive storytelling. However, existing tools for live presentations often lack interactivity and improvisation, while creating such effects in video editing tools require significant time and expertise. RealityTalk enables users to create live augmented presentations with real-time speech-driven interactions. The user can interactively prompt, move, and manipulate graphical elements through real-time speech and supporting modalities. Based on our analysis of 177 existing video-edited augmented presentations, we propose a novel set of interaction techniques and then incorporated them into RealityTalk. We evaluate our tool from a presenter's perspective to demonstrate the effectiveness of our system.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (5)
  1. Jian Liao (11 papers)
  2. Adnan Karim (2 papers)
  3. Shivesh Jadon (3 papers)
  4. Rubaiat Habib Kazi (9 papers)
  5. Ryo Suzuki (61 papers)
Citations (33)