VirtuWander: Enhancing Multi-modal Interaction for Virtual Tour Guidance through Large Language Models (2401.11923v2)
Abstract: Tour guidance in virtual museums encourages multi-modal interactions to boost user experiences, concerning engagement, immersion, and spatial awareness. Nevertheless, achieving the goal is challenging due to the complexity of comprehending diverse user needs and accommodating personalized user preferences. Informed by a formative study that characterizes guidance-seeking contexts, we establish a multi-modal interaction design framework for virtual tour guidance. We then design VirtuWander, a two-stage innovative system using domain-oriented LLMs to transform user inquiries into diverse guidance-seeking contexts and facilitate multi-modal interactions. The feasibility and versatility of VirtuWander are demonstrated with virtual guiding examples that encompass various touring scenarios and cater to personalized preferences. We further evaluate VirtuWander through a user study within an immersive simulated museum. The results suggest that our system enhances engaging virtual tour experiences through personalized communication and knowledgeable assistance, indicating its potential for expanding into real-world scenarios.
- Virtual museums as a means for promotion and enhancement of cultural heritage. The International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences 42 (2019), 33–40. https://doi.org/10.5194/isprs-archives-XLII-2-W15-33-2019
- Lost in Style: Gaze-driven Adaptive Aid for VR Navigation. In Proc. ACM CHI. Association for Computing Machinery, New York, USA, 1–12. https://doi.org/10.1145/3290605.3300578
- “You, Move There!”: Investigating the Impact of Feedback on Voice Control in Virtual Environments. In Proceedings of the 3rd Conference on Conversational User Interfaces. Association for Computing Machinery, New York, USA, Article 14, 9 pages. https://doi.org/10.1145/3469595.3469609
- A platform for virtual museums with personalized content. Multimedia tools and applications 42 (2009), 139–159. https://doi.org/10.1007/s11042-008-0231-2
- A methodology for the evaluation of travel techniques for immersive virtual environments. Virtual reality 3, 2 (1998), 120–131. https://doi.org/10.1007/BF01417673
- Virginia Braun and Victoria Clarke. 2019. Reflecting on reflexive thematic analysis. Qualitative Research in Sport, Exercise and Health 11, 4 (2019), 589–597. https://doi.org/10.1080/2159676X.2019.1628806
- Museum of Interface: Designing the virtual environment. Proceedings of the Fifth Conference on Computer Aided Architectural Design Research in Asia (2000), 471–480. https://doi.org/10.52842/conf.caadria.2000.471
- Deep Reinforcement Learning from Human Preferences. In NeurIPS, Vol. 30. 430–4310. https://proceedings.neurips.cc/paper_files/paper/2017/file/d5e2c0adad503c91f91df240d0cd4e49-Paper.pdf
- Multisensory Interactive Storytelling to Augment the Visit of a Historical House Museum. In International Conference on Virtual Systems & Multimedia. 1–8. https://doi.org/10.1109/DigitalHeritage.2018.8810099
- Linda Daniela. 2020. Virtual Museums as Learning Agents. Sustainability 12, 7 (2020), 2698. https://doi.org/10.3390/su12072698
- Redefining the digital paradigm for virtual museums: Towards interactive and engaging experiences in the post-pandemic era. In International Conference on Human-Computer Interaction. Springer, 357–373. https://doi.org/10.1007/978-3-030-77411-0_23
- Antonina Dattolo and Flaminia L Luccio. 2008. Visualizing Personalized Views in Virtual Museum Tours. In 2008 Conference on Human System Interactions. IEEE, 109–114. https://doi.org/10.1109/HSI.2008.4581418
- Nicola Davis. 2015. Don’t just look–smell, feel, and hear art. Tate’s new way of experiencing paintings. The Guardian 22 (2015). https://www.theguardian.com/artanddesign/2015/aug/22/tate-sensorium-art-soundscapes-chocolates-invisible-rain
- Lina Eklund. 2020. A Shoe Is a Shoe Is a Shoe: Interpersonalization and Meaning-making in Museums – Research Findings and Design Implications. International Journal of Human-Computer Interaction 36, 16 (2020), 1503–1513. https://doi.org/10.1080/10447318.2020.1767982
- A Conceptual Human–Centered Approach to Immersive Digital Heritage Site/Museum Experiences: The Hidden Waterfall City. In Digital Heritage International Congress (DigitalHERITAGE) held jointly with 2018 24th International Conference on Virtual Systems & Multimedia (VSMM 2018). IEEE, 1–4. https://doi.org/10.1109/DigitalHeritage.2018.8810110
- John H Falk and Lynn D Dierking. 1992. The Museum Experience. Whalesback Books. https://books.google.co.jp/books?id=Hd9l6gt6aJ0C
- John H Falk and Lynn D Dierking. 2000. Learning from Museums: Visitor Experiences and the Making of Meaning. AltaMira Press.
- John H Falk and Lynn D Dierking. 2013. The Museum Experience Revisited (1st edition ed.). Routledge. https://doi.org/10.4324/9781315417851
- Living in a Learning Society: Museums and Free-choice Learning. A Companion to Museum Studies (2006), 323–339. https://doi.org/10.1002/9780470996836.ch19
- Natural Experiences in Museums through Virtual Reality and Voice Commands. In Proc. ACM MM. Association for Computing Machinery, New York, USA, 1233––1234. https://doi.org/10.1145/3123266.3127916
- Tell Me Where To Go: Voice-Controlled Hands-Free Locomotion for Virtual Reality Systems. In Proc. IEEE VR. IEEE, 123–134. https://doi.org/10.1109/VR55154.2023.00028
- Eva Hornecker and Luigina Ciolfi. 2019. Human-computer interactions in museums. Springer Cham. https://doi.org/10.1007/978-3-031-02225-8
- Douleur: Creating Pain Sensation with Chemical Stimulant to Enhance User Experience in Virtual Reality. Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies 5, 2, Article 66 (2021), 26 pages. https://doi.org/10.1145/3463527
- Development of a virtual museum including a 4D presentation of building history in virtual reality. The International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences 42 (2017), 361–367. https://doi.org/10.5194/isprs-archives-XLII-2-W3-361-2017
- N. Levent and A. Pascual-Leone. 2014. The Multisensory Museum: Cross-Disciplinary Perspectives on Touch, Sound, Smell, Memory, and Space. Rowman & Littlefield Publishers. https://books.google.co.jp/books?id=c0sJAwAAQBAJ
- ChangYuan Li and BaiHui Tang. 2019. Research on Voice Interaction Technology in VR Environment. In International Conference on Electronic Engineering and Informatics (EEI). IEEE, 213–216. https://doi.org/10.1109/EEI48997.2019.00053
- CubeMuseum: An Augmented Reality Prototype of Embodied Virtual Museum. In IEEE International Symposium on Mixed and Augmented Reality Adjunct (ISMAR-Adjunct). IEEE, 13–17. https://doi.org/10.1109/ISMAR-Adjunct54149.2021.00014
- Pre-Train, Prompt, and Predict: A Systematic Survey of Prompting Methods in Natural Language Processing. Comput. Surveys 55, 9, Article 195 (2023), 35 pages. https://doi.org/10.1145/3560815
- Wandertroper: Supporting Aesthetic Engagement with Everyday Surroundings through Soundscape Augmentation. In Proceedings of the 15th International Conference on Mobile and Ubiquitous Multimedia (MUM ’16). Association for Computing Machinery, 129–140. https://doi.org/10.1145/3012709.3012725
- NVIDIA Corporation. 2023. Create XR Experiences Using Natural-Language Voice Commands: Test Project Mellon. https://developer.nvidia.com/blog/creating-xr-experiences-using-natural-language-voice-commands-test-project-mellon/. Accessed: 2023-12-01.
- Heather L O’Brien and Elaine G Toms. 2010. The development and evaluation of a survey to measure user engagement. Journal of the American Society for Information Science and Technology 61, 1 (2010), 50–69. https://doi.org/10.1002/asi.21229
- Marianna Obrist. 2017. Mastering the Senses in HCI: Towards Multisensory Interfaces. In Proceedings of the 12th Biannual Conference on Italian SIGCHI Chapter (CHItaly ’17). Association for Computing Machinery, Article 2, 2 pages. https://doi.org/10.1145/3125571.3125603
- OpenAI. 2022. OpenAI: Introducing ChatGPT. https://openai.com/blog/chatgpt
- OpenAI. 2023. GPT-4 Technical Report. arXiv:2303.08774
- Hunter Osking and John A Doucette. 2019. Enhancing Emotional Effectiveness of Virtual-Reality Experiences with Voice Control Interfaces. In Immersive Learning Research Network. Springer, 199–209. https://doi.org/10.1007/978-3-030-23089-0_15
- Generative Agents: Interactive Simulacra of Human Behavior. arXiv:2304.03442
- Alireza Gholinejad Pirbazari and Sina Kamali Tabrizi. 2022. RecorDIM of Iran’s Cultural Heritage Using an Online Virtual Museum, Considering the Coronavirus Pandemic. ACM Journal on Computing and Cultural Heritage (JOCCH) 15, 2 (2022), 1–14.
- Laia Pujol and Anna Lorente. 2014. The Virtual Museum: A Quest for the Standard Definition. Archaeology in the Digital Era 40 (2014), 40–48. https://doi.org/10.1017/9789048519590.005
- Evaluation of voice commands for mode change in virtual reality implant planning procedure. International Journal of Computer Assisted Radiology and Surgery 17, 11 (2022), 1981–1989. https://doi.org/10.1007/s11548-022-02685-1
- Deborah Richards. 2012. Agent-Based Museum and Tour Guides: Applying the State of the Art. In Proc. Australasian Conference on Interactive Entertainment: Playing the System. Association for Computing Machinery, New York, USA, Article 15, 9 pages. https://doi.org/10.1145/2336727.2336742
- Steps towards prompt-based creation of virtual worlds. arXiv:2211.05875
- Code Llama: Open Foundation Models for Code. arXiv:2308.12950
- Fostering Virtual Guide in Exhibitions. In Proceedings of the 21st International Conference on Human-Computer Interaction with Mobile Devices and Services. Association for Computing Machinery, New York, USA, Article 48, 6 pages. https://doi.org/10.1145/3338286.3344395
- VELMA: Verbalization Embodiment of LLM Agents for Vision and Language Navigation in Street View. arXiv:2307.06082
- Werner Schweibenz. 2019. The virtual museum: An overview of its origins, concepts, and terminology. The Museum Review 4, 1 (2019), 1–29.
- LM-Nav: Robotic Navigation with Large Pre-Trained Models of Language, Vision, and Action. In Conference on Robot Learning. PMLR, 492–504.
- Virtual Artifact: Enhancing museum exhibit using 3D virtual reality. In TRON Symposium (TRONSHOW). IEEE, 1–5. https://doi.org/10.23919/TRONSHOW.2017.8275078
- Virtual museums, a survey and some issues for consideration. Journal of Cultural Heritage 10, 4 (2009), 520–528. https://doi.org/10.1016/j.culher.2009.03.003
- Avatars as storytellers: Affective narratives in virtual museums. Personal and Ubiquitous Computing 24, 6 (2020), 829–841. https://doi.org/10.1007/s00779-019-01358-2
- Exploring the relationship between presence and enjoyment in a virtual museum. International Journal of Human-Computer Studies 68, 5 (2010), 243–253. https://doi.org/10.1016/j.ijhcs.2009.11.002
- Analysis of virtual museums in terms of design and perception of presence. Education and Information Technologies 28, 7 (2023), 8945–8973. https://doi.org/10.1007/s10639-022-11561-z
- Laia Pujol Tost and Maria Economou. 2007. Exploring the suitability of Virtual Reality interactivity for exhibitions through an integrated evaluation: The case of the Ename Museum. 4 (2007), 81–97.
- LLaMA: Open and Efficient Foundation Language Models. arXiv:2302.13971
- Llama 2: Open Foundation and Fine-Tuned Chat Models. arXiv:2307.09288
- Virtual museum space as the innovative tool for the student research practice. International Journal of Emerging Technologies in Learning (iJET) 16, 14 (2021), 213–231.
- An Approach to Facilitate Visitors’ Engagement with Contemporary Art in a Virtual Museum. In International Conference on Transdisciplinary Multispectral Modeling and Cooperation for the Preservation of Cultural Heritage. Springer, 207–217. https://doi.org/10.1007/978-3-031-20253-7_17
- RECBOT: Virtual Museum navigation through a Chatbot assistant and personalized Recommendations. In Adjunct Proceedings of the 31st ACM Conference on User Modeling, Adaptation and Personalization. Association for Computing Machinery, 388–396. https://doi.org/10.1145/3563359.3596661
- Not just seeing, but also feeling art: Mid-air haptic experiences integrated in a multisensory art exhibition. International Journal of Human-Computer Studies 108 (2017), 1–14. https://doi.org/10.1016/j.ijhcs.2017.06.004
- Podoportation: Foot-Based Locomotion in Virtual Reality. In Proc. ACM CHI. Association for Computing Machinery, New York, USA, 1–14. https://doi.org/10.1145/3313831.3376626
- Annika Waern and Anders Sundnes Løvlie. 2022. Hybrid Museum Experiences: Theory and Design. Amsterdam University Press. https://doi.org/10.5117/9789463726443
- Enabling Conversational Interaction with Mobile UI Using Large Language Models. In Proc. ACM CHI. Springer, Article 432, 17 pages. https://doi.org/10.1145/3544548.3580895
- Virtual Museum ‘Takeouts’ and DIY Exhibitions-Augmented Reality Apps for Scholarship, Citizen Science and Public Engagement. In Euro-Mediterranean Conference. Springer, 323–333. https://doi.org/10.1007/978-3-030-73043-7_27
- Virtual Agents in Immersive Virtual Reality Environments: Impact of Humanoid Avatars and Output Modalities on Shopping Experience. International Journal of Human–Computer Interaction 0, 0 (2023), 1–23. https://doi.org/10.1080/10447318.2023.2241293
- The Invisible Museum: A User-Centric Platform for Creating Virtual 3D Exhibitions with VR Support. Electronics 10, 3 (2021), 363. https://doi.org/10.3390/electronics10030363
- Value-based model of user interaction design for virtual museum. CCF Transactions on Pervasive Computing and Interaction 3, 2 (2021), 112–128. https://doi.org/10.1007/s42486-021-00061-7