The role of direct sound spherical harmonics representation in externalization using binaural reproduction (2401.00936v1)
Abstract: The importance of the information in the direct sound to human perception of spatial sound sources is an ongoing research topic. The classification between direct sound and diffuse or reverberant sound forms the basis of numerous studies in the field of spatial audio. In particular, parametric spatial audio representation methods use this classification and employ signal processing in order to enhance the audio quality at reproduction. However, current literature does not provide information concerning the impact of ideal direct sound representation on externalization, in the context of Ambisonics. This paper aims to assess the importance of the spatial information in the direct sound in the externalization of a sound field when using binaural reproduction. This is done in the spherical harmonics (SH) domain, where an ideal direct sound representation within an otherwise Ambisonics signal is simulated, and its perceived externalization is evaluated in a formal listening test. This investigation leads to the conclusion that externalization of a first order Ambisonics signal may be significantly improved by enhancing the direct sound component, up to a level similar to a third order Ambisonics signal.
- Michael A Gerzon. Periphony: With-height sound reproduction. Journal of the Audio Engineering Society, 21(1):2–10, 1973.
- Michael A Gerzon. Ambisonics in multichannel broadcasting and video. Journal of the Audio Engineering Society, 33(11):859–871, 1985.
- Investigation on localisation accuracy for first and higher order ambisonics reproduced sound sources. Acta Acustica united with Acustica, 99(4):642–657, 2013.
- Localization of 3d ambisonic recordings and ambisonic virtual sources. In 1st International Conference on Spatial Audio,(Detmold), 2011.
- Further study of sound field coding with higher order ambisonics. In Audio Engineering Society Convention 116. Audio Engineering Society, 2004.
- Ville Pulkki. Spatial sound reproduction with directional audio coding. Journal of the Audio Engineering Society, 55(6):503–516, 2007.
- High angular resolution planewave expansion. In Proc. of the 2nd International Symposium on Ambisonics and Spherical Acoustics May, pages 6–7, 2010.
- Spatial audio scene coding. In Audio Engineering Society Convention 125. Audio Engineering Society, 2008.
- Sector-based parametric sound field reproduction in the spherical harmonic domain. IEEE Journal of Selected Topics in Signal Processing, 9(5):852–866, 2015.
- Directional audio coding: Virtual microphone-based synthesis and subjective evaluation. Journal of the Audio Engineering Society, 57(9):709–724, 2009.
- A new method for b-format to binaural transcoding. In Audio Engineering Society Conference: 40th International Conference: Spatial Audio: Sense the Sound of Space. Audio Engineering Society, 2010.
- Enhancement of ambisonic binaural reproduction using directional audio coding with optimal adaptive mixing. In Applications of Signal Processing to Audio and Acoustics (WASPAA), 2017 IEEE Workshop on, pages 379–383. IEEE, 2017.
- A precedence effect in sound localization. The Journal of the Acoustical Society of America, 21(4):468–468, 1949.
- The precedence effect. The Journal of the Acoustical Society of America, 106(4):1633–1654, 1999.
- Patrick M Zurek. The precedence effect. In Directional hearing, pages 85–105. Springer, 1987.
- Perceptual limits for detecting interaural-cue manipulations measured in reverberant settings. In Proceedings of Meetings on Acoustics ICA2013, volume 19, page 015004. ASA, 2013.
- Accurate sound localization in reverberant environments is mediated by robust encoding of spatial cues in the auditory midbrain. Neuron, 62(1):123–134, 2009.
- Effect of source spectrum on sound localization in an everyday reverberant room. The Journal of the Acoustical Society of America, 130(1):324–333, 2011.
- The role of spectral detail in the binaural transfer function on perceived externalization in a reverberant environment. The Journal of the Acoustical Society of America, 139(5):2992–3000, 2016.
- The role of reverberation-related binaural cues in the externalization of speech. The Journal of the Acoustical Society of America, 138(2):1154–1167, 2015.
- MH Acoustics. Em32 eigenmike microphone array release notes (v17. 0). 25 Summit Ave, Summit, NJ 07901, USA, 2013.
- Nearfield binaural synthesis and ambisonics. The Journal of the Acoustical Society of America, 121(3):1559–1563, 2007.
- Computing fourier transforms and convolutions on the 2-sphere. Advances in applied mathematics, 15(2):202–250, 1994.
- Sound-field analysis by plane-wave decomposition using spherical microphone array. The Journal of the Acoustical Society of America, 118(5):3094–3103, 2005.
- Boaz Rafaely. Plane-wave decomposition of the sound field on a sphere by spherical convolution. The Journal of the Acoustical Society of America, 116(4):2149–2157, 2004.
- Analyzing head-related transfer function measurements using surface spherical harmonics. The Journal of the Acoustical Society of America, 104(4):2400–2411, 1998.
- The cipic hrtf database. In Applications of Signal Processing to Audio and Acoustics, 2001 IEEE Workshop on the, pages 99–102. IEEE, 2001.
- Benjamin Bernschütz. A spherical far field HRIR/HRTF compilation of the neumann ku 100. In Proceedings of the 40th Italian (AIA) Annual Conference on Acoustics and the 39th German Annual Conference on Acoustics (DAGA) Conference on Acoustics, page 29, 2013.
- Insights into head-related transfer function: Spatial dimensionality and continuous representation. The Journal of the Acoustical Society of America, 127(4):2347–2357, 2010.
- Interaural cross correlation in a sound field represented by spherical harmonics. The Journal of the Acoustical Society of America, 127(2):823–828, 2010.
- Boaz Rafaely. Fundamentals of spherical array processing, volume 8. Springer, 2015.
- Producing 3d audio in ambisonics. In Audio Engineering Society Conference: 57th International Conference: The Future of Audio Entertainment Technology–Cinema, Television and the Internet. Audio Engineering Society, 2015.
- Investigation of the perceived spatial resolution of higher order ambisonics sound fields: A subjective evaluation involving virtual and real 3d microphones. In Audio Engineering Society Conference: 30th International Conference: Intelligent Audio Environments. Audio Engineering Society, 2007.
- Efficient real spherical harmonic representation of head-related transfer functions. IEEE Journal of Selected Topics in Signal Processing, 9(5):921–930, 2015.
- A direct comparison of localization performance when using first, third, and fifth ambisonics order for real loudspeaker and virtual loudspeaker rendering. In Audio Engineering Society Convention 143. Audio Engineering Society, 2017.
- Yang Liu and BS Xie. Analysis on the timbre of ambisonics reproduction using a binaural loudness model. In The 21 International Congress on Sound and Vibration, 2014.
- Image method for efficiently simulating small-room acoustics. The Journal of the Acoustical Society of America, 65(4):943–950, 1979.
- HRTF magnitude modeling using a non-regularized least-squares fit of spherical harmonics coefficients on incomplete data. In Signal & Information Processing Association Annual Summit and Conference (APSIPA ASC), 2012 Asia-Pacific, pages 1–5. IEEE, 2012.
- Darpa timit acoustic phonetic continuous speech corpus (vol. ldc93s1). Philadelphia: Linguistic Data Consortium, 1993.
- Spatial perception of sound fields recorded by spherical microphone arrays with varying spatial resolution. The Journal of the Acoustical Society of America, 133(5):2711–2721, 2013.
- Spectral equalization in binaural signals represented by order-truncated spherical harmonics. The Journal of the Acoustical Society of America, 141(6):4087–4096, 2017.
- The soundscape renderer: A unified spatial audio reproduction framework for arbitrary rendering methods. In In 124 th AES Conv. Citeseer, 2008.
- The interaction between head-tracker latency, source duration, and response time in the localization of virtual sound sources. In ICAD 2004: The 10th Meeting of the International Conference on Auditory Display, Sydney, Australia. Georgia Institute of Technology, 2004.
- Ken Farrar. Soundfield microphone. Wireless World, 85(1526):48–50, 1979.
- Or Nadiri and Boaz Rafaely. Localization of multiple speakers under high reverberation using a spherical microphone array and the direct-path dominance test. IEEE/ACM Transactions on Audio, Speech, and Language Processing, 22(10):1494–1505, 2014.
- Speaker localization using direct path dominance test based on sound field directivity. Signal Processing, 143:42–47, 2018.
- Exploiting structures of temporal causality for robust speaker localization in reverberant environments. In International Conference on Latent Variable Analysis and Signal Separation, pages 228–237. Springer, 2018.