Audiovisual angle and voice incongruence do not affect audiovisual verbal short-term memory in virtual reality
Abstract: Virtual reality (VR) environments are frequently used in auditory and cognitive research to imitate real-life scenarios, presumably enhancing state-of-the-art approaches with traditional computer screens. However, the effects of different display technologies on audiovisual processing remain underexplored. This study investigated how VR displayed with an head-mounted display (HMD) affects serial recall performance compared to traditional computer monitors, focusing on their effects on audiovisual processing in cognitive tasks. For that matter, we conducted two experiments with both an HMD and a computer monitor as display devices and two types of audiovisual incongruences: angle (Exp. 1) and voice (Exp. 2) incongruence. To quantify cognitive performance an audiovisual verbal serial recall (avVSR) task was developed where an embodied conversational agent (ECA) was animated to speak the target digit sequence. Even though subjective evaluations showed a higher sense of presence in the HMD condition, we found no effect of the display device on the proportion of correctly recalled digits. For the extreme conditions of angle incongruence in the computer monitor presentation the proportion of correctly recalled digits increased marginally, presumably due to raised attention, but the effect is likely too small to be meaningful. Response times were not affected by incongruences in either display device across both experiments. These findings suggest that the avVSR task is robust against angular and voice audiovisual incongruences, irrespective of the display device, at least for the conditions studied here. Hence, the study introduces the avVSR task in VR and contributes to the understanding of audiovisual integration.
- Parsons TD. Virtual Reality for Enhanced Ecological Validity and Experimental Control in the Clinical, Affective and Social Neurosciences. Frontiers in Human Neuroscience. 2015;9. doi:10.3389/fnhum.2015.00660.
- Virtual reality technology in neuropsychological testing: A systematic review. Journal of Neuropsychology. 2023;17(2):382–399. doi:10.1111/jnp.12304.
- A Mini Review of Presence and Immersion in Virtual Reality. Proceedings of the Human Factors and Ergonomics Society Annual Meeting. 2021;65(1):1099–1103. doi:10.1177/1071181321651148.
- The Role of Immersion and Narrative in Mediated Presence: The Virtual Hospital Experience. Cyberpsychology, Behavior, and Social Networking. 2011;14(3):99–105. doi:10.1089/cyber.2010.0100.
- Enhanced Attention Using Head-mounted Virtual Reality. Journal of Cognitive Neuroscience. 2020;32(8):1438–1454. doi:10.1162/jocn_a_01560.
- Examining the Auditory Selective Attention Switch in a Child-Suited Virtual Reality Classroom Environment. International Journal of Environmental Research and Public Health. 2022;19(24):16569. doi:10.3390/ijerph192416569.
- Effect of immersive visualization technologies on cognitive load, motivation, usability, and embodiment. Virtual Reality. 2021;27(1):307–331. doi:10.1007/s10055-021-00565-8.
- Motivation, engagement, and performance across multiple virtual reality sessions and levels of immersion. Journal of Computer Assisted Learning. 2020;37(3):745–758. doi:10.1111/jcal.12520.
- Presence and Cybersickness in Virtual Reality Are Negatively Related: A Review. Frontiers in Psychology. 2019;10. doi:10.3389/fpsyg.2019.00158.
- EEG Alpha Power Is Modulated by Attentional Changes during Cognitive Tasks and Virtual Reality Immersion. Computational Intelligence and Neuroscience. 2019;2019:1–18. doi:10.1155/2019/7051079.
- Enhanced Cognitive Training using Virtual Reality: Examining a Memory Task Modified for Use in Virtual Environments. In: 2021 5th International Conference on Artificial Intelligence and Virtual Reality (AIVR). AIVR 2021. ACM; 2021. p. 1–8. doi:10.1145/3480433.3480435.
- Vatakis A, Spence C. Crossmodal binding: Evaluating the “unity assumption” using audiovisual speech stimuli. Perception & Psychophysics. 2007;69(5):744–756. doi:10.3758/bf03193776.
- Chen L, Vroomen J. Intersensory binding across space and time: A tutorial review. Attention, Perception, & Psychophysics. 2013;75(5):790–811. doi:10.3758/s13414-013-0475-4.
- Thurlow WR, Jack CE. Certain Determinants of the “Ventriloquism Effect”. Perceptual and Motor Skills. 1973;36(3):1171–1184. doi:10.2466/pms.1973.36.3c.1171.
- The Ventriloquist Effect is not Consistently Affected by Stimulus Realism. Journal of Perceptual Imaging. 2022;5(0):000404–1–000404–10. doi:10.2352/j.percept.imaging.2022.5.000404.
- Kim H, Lee IK. Studying the Effects of Congruence of Auditory and Visual Stimuli on Virtual Reality Experiences. IEEE Transactions on Visualization and Computer Graphics. 2022;28(5):2080–2090. doi:10.1109/tvcg.2022.3150514.
- The combined use of virtual reality and EEG to study language processing in naturalistic environments. Behavior Research Methods. 2017;50(2):862–869. doi:10.3758/s13428-017-0911-9.
- McGurk H, MacDonald J. Hearing lips and seeing voices. Nature. 1976;264(5588):746–748. doi:10.1038/264746a0.
- Perception Deception: Audio-Visual Mismatch in Virtual Reality Using The McGurk Effect. In: AICS. vol. 2019; 2019. p. 176–187.
- Multisensory Integration in the Virtual Hand Illusion with Active Movement. BioMed Research International. 2016;2016:1–9. doi:10.1155/2016/8163098.
- VRsneaky: Increasing Presence in VR Through Gait-Aware Auditory Feedback. In: Proceedings of the 2019 CHI Conference on Human Factors in Computing Systems. vol. 6 of CHI ’19. ACM; 2019. p. 1–9. doi:10.1145/3290605.3300776.
- Memory for serial order across domains: An overview of the literature and directions for future research. Psychological Bulletin. 2014;140(2):339–373. doi:10.1037/a0034221.
- Semantic encoding in working memory: Is there a (multi)modality effect? Memory. 2009;17(6):655–663. doi:10.1080/09658210902998054.
- Irrelevant Speech, Phonological Similarity, and Presentation Modality. Memory. 1999;7(4):405–420. doi:10.1080/741944920.
- Harvey AJ, Beaman CP. Input and output modality effects in immediate serial recall. Memory. 2007;15(7):693–700. doi:10.1080/09658210701644677.
- The effects of irrelevant speech and articulatory suppression on the serial recall of silently presented lipread digits. British Journal of Psychology. 2001;92(4):593–616. doi:10.1348/000712601162365.
- Does irrelevant music cause an irrelevant sound effect for auditory items? European Journal of Cognitive Psychology. 2008;20(2):252–271. doi:10.1080/09541440701427838.
- Cognitive performance in open-plan office acoustic simulations: Effects of room acoustics and semantics but not spatial separation of sound sources. Applied Acoustics. 2023;211:109559. doi:10.1016/j.apacoust.2023.109559.
- Stimulus Onset Asynchronies and Audiovisual Serial Recall Performance. In: Fortschritte der Akustik - DAGA 2022. Deutsche Gesellschaft für Akustik e.V. (DEGA), Berlin; 2022. p. 989–992.
- Animated virtual characters to explore audio-visual speech in controlled and naturalistic environments. Scientific Reports. 2020;10(1). doi:10.1038/s41598-020-72375-y.
- Hughes RW, Jones DM. The Impact of Order Incongruence Between a Task-Irrelevant Auditory Sequence and a Task-Relevant Visual Sequence. Journal of Experimental Psychology: Human Perception and Performance. 2005;31(2):316–327. doi:10.1037/0096-1523.31.2.316.
- World Health Organization. Report of the informal working group on prevention of deafness and hearing impairment Programme planning, Geneva, 18-21 June 1991. World Health Organization; 1991.
- Snellen H. Probebuchstaben zur Bestimmung der Sehschärfe. H. Peters; 1873.
- StudyFramework: Comfortably Setting up and Conducting Factorial-Design Studies Using the Unreal Engine. In: 2024 IEEE Conference on Virtual Reality and 3D User Interfaces Abstracts and Workshops (VRW). vol. 25. IEEE; 2024. p. 442–449. doi:10.1109/vrw62533.2024.00087.
- Available from: https://doi.org/10.5281/zenodo.10817754.
- Pausch F. Documentation of the experimental environments and hardware used in the dissertation ”Spatial audio reproduction for hearing aid research : System design, evaluation and application”; 2022. Available from: https://publications.rwth-aachen.de/record/841181.
- Hall ET. The Hidden Dimension: Man’s Use of Space in Public and Private. Bodley Head; 1969.
- Social VR: How Personal Space is Affected by Virtual Agents’ Emotions. In: 2018 IEEE Conference on Virtual Reality and 3D User Interfaces (VR). IEEE; 2018. p. 199–206. doi:10.1109/vr.2018.8446480.
- Oberem J, Fels J. Speech Material for a Paradigm on the Intentional Switching of Auditory Selective Attention; 2020. Available from: http://publications.rwth-aachen.de/record/782828.
- Virtual Acoustics - A real-time auralization framework for scientific research; 2023. Available from: https://zenodo.org/doi/10.5281/zenodo.13680258.
- Schmitz A. Ein neues digitales Kunstkopfmeßsystem. Acustica : International Journal on Acoustics. 1995;81(4):416–420.
- Experiments on Localization Accuracy with Non-Individual and Individual HRTFs Comparing Static and Dynamic Reproduction Methods. bioRxiv. 2020; p. 2020.03.31.011650. doi:10.1101/2020.03.31.011650.
- Masiero B, Fels J. Perceptually Robust Headphone Equalization for Binaural Reproduction. Journal of the Audio Engineering Society. 2011; p. 8388.
- Slutsky DA, Recanzone GH. Temporal and spatial dependency of the ventriloquism effect. Neuroreport. 2001;12(1):7–10. doi:10.1097/00001756-200101220-00009.
- Jack CE, Thurlow WR. Effects Of Degree Of Visual Association And Angle Of Displacement On The “Ventriloquism” Effect. Perceptual and Motor Skills. 1973;37(3):967–979. doi:10.2466/pms.1973.37.3.967.
- Leiner DJ. SoSci Survey; 2019. https://www.soscisurvey.de/.
- Bönsch A. Social wayfinding strategies to explore immersive virtual environments [Dissertation]. RWTH Aachen University. Aachen, Germany; 2024. doi:10.18154/RWTH-2024-07063.
- Using Presence Questionnaires in Reality. Presence: Teleoperators and Virtual Environments. 2000;9(5):497–503. doi:10.1162/105474600566989.
- Do Prosody and Embodiment Influence the Perceived Naturalness of Conversational Agents’ Speech? ACM Transactions on Applied Perception. 2021;18(4):21:1–21:15. doi:10.1145/3486580.
- Healy AF. Short-Term Memory for Order Information. In: Bower GH, editor. Psychology of Learning and Motivation. vol. 16. Academic Press; 1982. p. 191–238.
- Welcome to the Tidyverse. Journal of Open Source Software. 2019;4(43):1686. doi:10.21105/joss.01686.
- Bürkner PC. brms: An R Package for Bayesian Multilevel Models Using Stan. Journal of Statistical Software. 2017;80:1–28. doi:10.18637/jss.v080.i01.
- Uncertainty in Bayesian Leave-One-Out Cross-Validation Based Model Comparison. arXiv. 2023;doi:arXiv:2008.10296.
- Lenth RV. emmeans: Estimated Marginal Means, aka Least-Squares Means; 2022. Available from: https://CRAN.R-project.org/package=emmeans.
- Kruschke JK, Liddell TM. The Bayesian New Statistics: Hypothesis testing, estimation, meta-analysis, and power analysis from a Bayesian perspective. Psychonomic bulletin & review. 2018;25:178–206.
- bayestestR: Describing Effects and their Uncertainty, Existence and Significance within the Bayesian Framework. Journal of Open Source Software. 2019;4(40):1541. doi:10.21105/joss.01541.
- Why Are Background Telephone Conversations Distracting? Journal of Experimental Psychology Applied. 2018;24(2):222–235. doi:10.1037/xap0000170.
- Ambisonics Sound Source Localization With Varying Amount of Visual Information in Virtual Reality. Frontiers in Virtual Reality. 2021;2. doi:10.3389/frvir.2021.722321.
- The influence of affective voice on sound distance perception. OSF. 2023;doi:10.31234/osf.io/qdw9p.
- Combined Effects of Acoustic and Visual Distraction on Cognitive Performance and Well-Being. Applied Ergonomics. 2012;43(2):424–434. doi:10.1016/j.apergo.2011.06.017.
- Lange EB. Disruption of Attention by Irrelevant Stimuli in Serial Recall. Journal of Memory and Language. 2005;53(4):513–531. doi:10.1016/j.jml.2005.07.002.
- Baddeley AD, Hitch G. Working Memory. In: Bower GH, editor. Psychology of Learning and Motivation. vol. 8. Academic Press; 1974. p. 47–89.
- Comparing virtual reality with computer monitors as rating environments for affective dimensions in social interactions. In: 2017 Seventh International Conference on Affective Computing and Intelligent Interaction (ACII); 2017. p. 464–469. doi:10.1109/ACII.2017.8273640.
- Impact of Immersiveness on Persuasiveness, Politeness, and Social Adherence in Human-Agent Interactions within Small Groups. In: Normand JM, Sugimoto M, Sundstedt V, editors. ICAT-EGVE 2023 - International Conference on Artificial Reality and Telexistence and Eurographics Symposium on Virtual Environments. The Eurographics Association; 2023.doi:10.2312/egve.20231315.
- Infants’ Bimodal Perception of Gender. Ecological Psychology. 1991;3(2):55–75. doi:10.1207/s15326969eco0302_1.
- Szerszen KA. The Audio/Visual Mismatch and the Uncanny Valley: An Investigation Using a Mismatch in the Human Realism of Facial and Vocal Aspects of Stimuli [Thesis]. Indiana University-Purdue University Indianapolis (IUPUI). Indianapolis, USA; 2011. doi:10.7912/c2/915.
- Who’s next? Integrating Non-Verbal Turn-Taking Cues for Embodied Conversational Agents. In: Proceedings of the 23rd ACM International Conference on Intelligent Virtual Agents. IVA ’23. New York, NY, USA: Association for Computing Machinery; 2023. p. 1–8. doi:10.1145/3570945.3607312.
Paper Prompts
Sign up for free to create and run prompts on this paper using GPT-5.
Top Community Prompts
Collections
Sign up for free to add this paper to one or more collections.