Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
169 tokens/sec
GPT-4o
7 tokens/sec
Gemini 2.5 Pro Pro
45 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Source Localization by Multidimensional Steered Response Power Mapping with Sparse Bayesian Learning (2405.11792v1)

Published 20 May 2024 in eess.AS

Abstract: We propose an advance Steered Response Power (SRP) method for localizing multiple sources. While conventional SRP performs well in adverse conditions, it remains to struggle in scenarios with closely neighboring sources, resulting in ambiguous SRP maps. We address this issue by applying sparsity optimization in SRP to obtain high-resolution maps. Our approach represents SRP maps as multidimensional matrices to preserve time-frequency information and further improve performance in unfavorable conditions. We use multi-dictionary Sparse Bayesian Learning to localize sources without needing prior knowledge of their quantity. We validate our method through practical experiments with a 16-channel planar microphone array and compare against three other SRP and sparsity-based methods. Our multidimensional SRP approach outperforms conventional SRP and the current state-of-the-art sparse SRP methods for localizing closely spaced sources in a reverberant room.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (22)
  1. R. Schmidt, “Multiple emitter location and signal parameter estimation,” IEEE Trans. Antennas Propag., vol. 34, no. 3, pp. 276–280, 1986.
  2. “Sound source localization in a reverberant room using harmonic based music,” in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., 2019, pp. 651–655.
  3. R. Roy and T. Kailath, “Esprit-estimation of signal parameters via rotational invariance techniques,” IEEE Trans. Acoust., Speech, Signal Process., vol. 37, no. 7, pp. 984–995, 1989.
  4. J. H. Dibiase, “A high-accuracy, low-latency technique for talker localization in reverberant environments using microphone arrays,” Ph. D. Thesis, 2000.
  5. “Drone audition: Sound source localization using on-board microphones,” IEEE Trans. Acoust., Speech, Signal Process., vol. 30, pp. 508–519, 2022.
  6. “A generalized steered response power method for computationally viable source localization,” IEEE Trans. Audio, Speech, Language Process., vol. 15, no. 8, pp. 2510–2526, 2007.
  7. “A blind dereverberation method for narrowband source localization,” IEEE J. Selected Topics Signal Process., vol. 9, no. 5, pp. 815–824, 2015.
  8. “Sparse bayesian learning for basis selection,” IEEE Trans. Signal Process., vol. 52, no. 8, pp. 2153–2164, 2004.
  9. “Robust ocean acoustic localization with sparse bayesian learning,” IEEE J. Selected Topics Signal Process., vol. 13, no. 1, pp. 49–60, 2019.
  10. S. Chakrabarty and E. A. P. Habets, “Broadband doa estimation using convolutional neural networks trained with noise signals,” in Proc. IEEE Workshop Appl. Signal Process. Audio Acoust. IEEE, 2017, pp. 136–140.
  11. “Robust doa estimation from deep acoustic imaging,” in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. IEEE, 2024, pp. 1321–1325.
  12. D. N. Zotkin and R. Duraiswami, “Accelerated speech source localization via a hierarchical search of steered response power,” IEEE Trans. Speech Audio Process., vol. 12, no. 5, pp. 499–508, 2004.
  13. “A steered-response power algorithm employing hierarchical search for acoustic source localization using microphone arrays,” IEEE Trans. Signal Process., vol. 62, no. 19, pp. 5171–5183, 2014.
  14. “A real-time srp-phat source location implementation using stochastic region contraction (src) on a large-aperture microphone array,” in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. IEEE, 2007, vol. 1, pp. I–121.
  15. “A steered response power iterative method for high-accuracy acoustic source localization,” J. Acoust. Soc. Am., vol. 134, no. 4, pp. 2627–2630, 2013.
  16. “Source localization with acoustic sensor arrays using generative model based fitting with sparse constraints,” Sensors, vol. 12, no. 10, pp. 13781–13812, 2012.
  17. “Multi-source direction-of-arrival estimation using group-sparse fitting of steered response power maps,” in Proc. IEEE Workshop Appl. Signal Process. Audio Acoust. IEEE, 2023, pp. 1–5.
  18. “Multisnapshot sparse bayesian learning for doa,” IEEE Signal Process. Lett., vol. 23, no. 10, pp. 1469–1473, 2016.
  19. I. Cohen, “Relative transfer function identification using speech signals,” IEEE Trans. Speech Audio Process., vol. 12, no. 5, pp. 451–459, 2004.
  20. “Sparse representation using multidimensional mixed-norm penalty with application to sound field decomposition,” IEEE Trans. Signal Process., vol. 66, no. 12, pp. 3327–3338, 2018.
  21. “A scalable noisy speech dataset and online subjective test framework,” Proc. Interspeech, pp. 1816–1820, 2019.
  22. “Algorithms for simultaneous sparse approximation. part i: Greedy pursuit,” Signal Process., vol. 86, no. 3, pp. 572–588, 2006.

Summary

We haven't generated a summary for this paper yet.

X Twitter Logo Streamline Icon: https://streamlinehq.com