Feasibility of iMagLS-BSM -- ILD Informed Binaural Signal Matching with Arbitrary Microphone Arrays (2408.03611v1)
Abstract: Binaural reproduction for headphone-centric listening has become a focal point in ongoing research, particularly within the realm of advancing technologies such as augmented and virtual reality (AR and VR). The demand for high-quality spatial audio in these applications is essential to uphold a seamless sense of immersion. However, challenges arise from wearable recording devices equipped with only a limited number of microphones and irregular microphone placements due to design constraints. These factors contribute to limited reproduction quality compared to reference signals captured by high-order microphone arrays. This paper introduces a novel optimization loss tailored for a beamforming-based, signal-independent binaural reproduction scheme. This method, named iMagLS-BSM incorporates an interaural level difference (ILD) error term into the previously proposed binaural signal matching (BSM) magnitude least squares (MagLS) rendering loss for lateral plane angles. The method leverages nonlinear programming to minimize the introduced loss. Preliminary results show a substantial reduction in ILD error, while maintaining a binaural magnitude error comparable to that achieved with a MagLS BSM solution. These findings hold promise for enhancing the overall spatial quality of resultant binaural signals.
- B. Rafaely, V. Tourbabin, E. Habets, Z. Ben-Hur, H. Lee, H. Gamper, L. Arbel, L. Birnie, T. Abhayapala, and P. Samarasinghe, “Spatial audio signal processing for binaural reproduction of recorded acoustic scenes–review and challenges,” Acta Acustica, vol. 6, p. 47, 2022.
- L. Madmoni, J. Donley, V. Tourbabin, and B. Rafaely, “Beamforming-based binaural reproduction by matching of binaural signals,” in Audio Engineering Society Conference: 2020 AES International Conference on Audio for Virtual and Augmented Reality. Audio Engineering Society, 2020.
- E. Rasumow, M. Blau, M. Hansen, S. Doclo, S. Van De Par, V. Mellert, and D. Püschel, “Robustness of virtual artificial head topologies with respect to microphone positioning,” in Proceedings of Forum Acusticum, 2011, pp. 397–402.
- L. McCormack, N. Meyer-Kahlen, D. L. Alon, Z. Ben-Hur, S. V. A. Garí, and P. Robinson, “Six-degrees-of-freedom binaural reproduction of head-worn microphone array capture,” Journal of the Audio Engineering Society, vol. 71, no. 10, pp. 638–649, 2023.
- J. Fernandez, L. McCormack, P. Hyvärinen, A. Politis, and V. Pulkki, “Enhancing binaural rendering of head-worn microphone arrays through the use of adaptive spatial covariance matching,” The Journal of the Acoustical Society of America, vol. 151, no. 4, pp. 2624–2635, 2022.
- L. McCormack, A. Politis, R. Gonzalez, T. Lokki, and V. Pulkki, “Parametric ambisonic encoding of arbitrary microphone arrays,” IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 30, pp. 2062–2075, 2022.
- L. McCormack and S. Delikaris-Manias, “Parametric first-order ambisonic decoding for headphones utilising the cross-pattern coherence algorithm,” in EAA Spatial Audio Signal Processing Symposium, 2019, pp. 173–178.
- O. Berebi, Z. Ben-Hur, D. L. Alon, and B. Rafaely, “imagls: Interaural level difference with magnitude least-squares loss for optimized first-order head-related transfer function,” 10th Convention of the European Acoustics Association Forum Acusticum, pp. 631–634, 2023.
- T. Deppisch, H. Helmholz, and J. Ahrens, “End-to-end magnitude least squares binaural rendering of spherical microphone array signals,” in 2021 Immersive and 3D Audio: from Architecture to Automotive (I3DA). IEEE, 2021, pp. 1–7.
- J. W. Strutt, “On our perception of sound direction,” Philosophical Magazine, vol. 13, no. 74, pp. 214–32, 1907.
- P. W. Kassakian, “Convex approximation and optimization with applications in magnitude filter design and radiation pattern synthesis,” Ph.D. dissertation, University of California, Berkeley Berkeley, CA, 2006.
- M. Burkhard and R. Sachs, “Anthropometric manikin for acoustic research,” JASA, vol. 58, no. 1, pp. 214–222, 1975.
- C. G. Broyden, “The convergence of a class of double-rank minimization algorithms 1. general considerations,” IMA Journal of Applied Mathematics, vol. 6, no. 1, pp. 76–90, 1970.
- W. A. Yost and R. H. Dye Jr, “Discrimination of interaural differences of level as a function of frequency,” JASA, vol. 83, no. 5, pp. 1846–1851, 1988.
Paper Prompts
Sign up for free to create and run prompts on this paper using GPT-5.
Collections
Sign up for free to add this paper to one or more collections.