Time-of-arrival Estimation and Phase Unwrapping of Head-related Transfer Functions With Integer Linear Programming
Abstract: In binaural audio synthesis, aligning head-related impulse responses (HRIRs) in time has been an important pre-processing step, enabling accurate spatial interpolation and efficient data compression. The maximum correlation time delay between spatially nearby HRIRs has previously been used to get accurate and smooth alignment by solving a matrix equation in which the solution has the minimum Euclidean distance to the time delay. However, the Euclidean criterion could lead to an over-smoothing solution in practice. In this paper, we solve the smoothing issue by formulating the task as solving an integer linear programming problem equivalent to minimising an $L1$-norm. Moreover, we incorporate 1) the cross-correlation of inter-aural HRIRs, and 2) HRIRs with their minimum-phase responses to have more reference measurements for optimisation. We show the proposed method can get more accurate alignments than the Euclidean-based method by comparing the spectral reconstruction loss of time-aligned HRIRs using spherical harmonics representation on seven HRIRs consisting of human and dummy heads. The extra correlation features and the $L1$-norm are also beneficial in extremely noisy conditions. In addition, this method can be applied to phase unwrapping of head-related transfer functions, where the unwrapped phase could be a compact feature for downstream tasks.
- Magnitude-corrected and time-aligned interpolation of head-related transfer functions. IEEE/ACM TASLP, 31:3783–3799, 2023.
- Efficient Representation and Sparse Sampling of Head-Related Transfer Functions Using Phase-Correction Based on Ear Alignment. IEEE/ACM TASLP, 27(12):2249–2262, Dec. 2019.
- F. Brinkmann and S. Weinzierl. Comparison of head-related transfer functions pre-processing techniques for spherical harmonics decomposition. In AES International Conference on Audio for Virtual and Augmented Reality, 2018.
- M. Costantini. A novel phase unwrapping method based on network programming. IEEE Transactions on Geoscience and Remote Sensing, 36(3):813–821, May 1998.
- Interpolation and range extrapolation of hrtfs. In ICASSP, volume 4, pages iv–iv, 2004.
- The SONICOM HRTF Dataset. Journal of the Audio Engineering Society, 71(5):241–253, May 2023.
- Analyzing head-related transfer function measurements using surface spherical harmonics. The Journal of the Acoustical Society of America, 104(4):2400–2411, 1998.
- H. Gamper. Selection and interpolation of head-related transfer functions for rendering moving virtual sound sources. In DAFx, 2013.
- Hrtf measurements of a kemar. The Journal of the Acoustical Society of America, 97(6):3907–3908, 1995.
- K. Itoh. Analysis of the phase unwrapping algorithm. Appl. Opt., 21(14):2470–2470, Jul 1982.
- A model of head-related transfer functions based on principal components analysis and minimum-phase reconstruction. The Journal of the Acoustical Society of America, 91(3):1637–1647, 1992.
- Time Series Phase Unwrapping Based on Graph Theory and Compressed Sensing. IEEE Transactions on Geoscience and Remote Sensing, 60:1–12, 2022.
- K. P. Murphy. Probabilistic Machine Learning: An introduction. MIT Press, 2022.
- A method for estimating interaural time difference for binaural synthesis. In Audio Engineering Society Convention 125, 2008.
- Cross-correlation-based time-aligned interpolation of head-related impulse responses. In AES International Conference on Spatial and Immersive Audio, 2023.
- Directional equalization of sparse head-related transfer function sets for spatial upsampling. IEEE/ACM TASLP, 27(6):1060–1071, 2019.
- Noise-resistant correlation-based alignment of head-related transfer functions for high-fidelity spherical harmonics representation. In AES International Conference on Spatial and Immersive Audio, 2023.
- A. P. Shanker and H. Zebker. Edgelist phase unwrapping algorithm for time series InSAR analysis. JOSA A, 27(3):605–612, Mar. 2010.
- Head-related transfer function interpolation through multivariate polynomial fitting of principal component weights. Acoustical Science and Technology, 30(6):395–403, 2009.
- J. Zaar. Phase unwrapping for spherical interpolation of head-related transfer functions. M. thesis, IEM, Univ. of Music and Performing Arts Graz, 2011.
Paper Prompts
Sign up for free to create and run prompts on this paper using GPT-5.
Top Community Prompts
Collections
Sign up for free to add this paper to one or more collections.