Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
194 tokens/sec
GPT-4o
7 tokens/sec
Gemini 2.5 Pro Pro
46 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

BRUDEX Database: Binaural Room Impulse Responses with Uniformly Distributed External Microphones (2306.08484v1)

Published 14 Jun 2023 in eess.AS, cs.SD, and eess.SP

Abstract: There is an emerging need for comparable data for multi-microphone processing, particularly in acoustic sensor networks. However, commonly available databases are often limited in the spatial diversity of the microphones or only allow for particular signal processing tasks. In this paper, we present a database of acoustic impulse responses and recordings for a binaural hearing aid setup, 36 spatially distributed microphones spanning a uniform grid of (5x5) m2 and 12 source positions. This database can be used for a variety of signal processing tasks, such as (multi-microphone) noise reduction, source localization, and dereverberation, as the measurements were performed using the same setup for three different reverberation conditions (T_60\approx{310, 510, 1300} ms). The usability of the database is demonstrated for a noise reduction task using a minimum variance distortionless response beamformer based on relative transfer functions, exploiting the availability of spatially distributed microphones.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (33)
  1. S. Markovich-Golan, A. Bertrand, M. Moonen, and S. Gannot, “Optimal distributed minimum-variance beamforming approaches for speech enhancement in wireless acoustic sensor networks,” Signal Processing, vol. 107, pp. 4–20, Feb. 2015.
  2. V. M. Tavakoli, J. R. Jensen, M. G. Christensen, and J. Benesty, “A framework for speech enhancement with ad hoc microphone arrays,” IEEE/ACM Trans. on Audio, Speech, and Language Processing, vol. 24, pp. 1038–1051, Mar. 2016.
  3. M. Cobos, F. Antonacci, A. Alexandridis, A. Mouchtaris, and B. Lee, “A survey of sound source localization methods in wireless acoustic sensor networks,” Wireless Communications and Mobile Computing, vol. 2017, pp. 1–24, Aug. 2017.
  4. A. I. Koutrouvelis, T. W. Sherson, R. Heusdens, and R. C. Hendriks, “A low-cost robust distributed linearly constrained beamformer for wireless acoustic sensor networks with arbitrary topology,” IEEE/ACM Trans. on Audio, Speech, and Language Processing, vol. 26, pp. 1434–1448, Apr. 2018.
  5. J. Zhang, R. Heusdens, and R. C. Hendriks, “Relative acoustic transfer function estimation in wireless acoustic sensor networks,” IEEE/ACM Trans. on Audio, Speech, and Language Processing, vol. 27, pp. 1507–1519, Jun. 2019.
  6. N. Gößling, W. Middelberg, and S. Doclo, “RTF-steered binaural MVDR beamforming incorporating multiple external microphones,” in Proc. IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), (New Paltz, USA), pp. 368–372, Oct. 2019.
  7. J. B. Allen and D. A. Berkley, “Image method for efficiently simulating small-room acoustics,” The Journal of the Acoustical Society of America, vol. 65, pp. 943–950, Apr. 1979.
  8. D. P. Jarrett, E. A. P. Habets, M. R. P. Thomas, and P. A. Naylor, “Rigid sphere room impulse response simulation: Algorithm and applications,” The Journal of the Acoustical Society of America, vol. 132, pp. 1462–1472, Sep. 2012.
  9. E. Hadad, F. Heese, P. Vary, and S. Gannot, “Multichannel audio database in various acoustic environments,” in Proc. International Workshop on Acoustic Signal Enhancement (IWAENC), (Juan-les-Pins, France), pp. 313–317, Sep. 2014.
  10. R. Stewart and M. Sandler, “Database of omnidirectional and B-format room impulse responses,” in Proc. International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 165–168, Mar. 2010.
  11. S. Koyama, T. Nishida, K. Kimura, T. Abe, N. Ueno, and J. Brunnström, “Meshrir: A dataset of room impulse responses on meshed grid points for evaluating sound field analysis and synthesis methods,” in Proc. IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), pp. 1–5, Oct. 2021.
  12. H. Kayser, S. D. Ewert, J. Anemüller, T. Rohdenburg, V. Hohmann, and B. Kollmeier, “Database of multichannel In-Ear and Behind-The-Ear Head-Related and Binaural Room Impulse Responses,” Eurasip Journal on Advances in Signal Processing, vol. 2009, pp. 1–10, Jul. 2009.
  13. F. Denk, S. M. Ernst, S. D. Ewert, and B. Kollmeier, “Adapting hearing devices to the individual ear acoustics: Database and target response correction functions for various device styles,” Trends in hearing, vol. 22, p. 2331216518779313, Jun. 2018.
  14. W. S. Woods, E. Hadad, I. Merks, B. Xu, S. Gannot, and T. Zhang, “A real-world recording database for ad hoc microphone arrays,” in Proc. IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), pp. 1–5, Oct. 2015.
  15. R. M. Corey, M. D. Skarha, and A. C. Singer, “Massive distributed microphone array dataset,” 2019. [Online]. Available: https://doi.org/10.13012/B2IDB-6216881_V1.
  16. T. Dietzen, R. Ali, M. Taseska, and T. van Waterschoot, “MYRiAD: a multi-array room acoustic database,” EURASIP Journal on Audio, Speech, and Music Processing, vol. 2023, pp. 1–14, Apr. 2023.
  17. S. Doclo, S. Gannot, D. Marquardt, and E. Hadad, “Binaural speech processing with application to hearing devices,” in Audio Source Separation and Speech Enhancement (E. Vincent, T. Virtanen, and S. Gannot, eds.), pp. 413–442, Hoboken, NJ, USA: John Wiley & Sons, Ltd., 2018.
  18. M. R. Schroeder, “New method of measuring reverberation time,” The Journal of the Acoustical Society of America, vol. 37, pp. 419–412, Jun. 1965.
  19. P. A. Naylor, E. A. P. Habets, J. Y.-C. Wen, and N. D. Gaubitch, “Models, measurement and evaluation,” in Speech Dereverberation (P. A. Naylor and N. D. Gaubitch, eds.), pp. 21–56, London, Great Britain: Springer, 2010.
  20. R. Schmidt, “Multiple emitter location and signal parameter estimation,” IEEE Trans. on Antennas and Propagation, vol. 34, pp. 276–280, Mar. 1986.
  21. H. Kayser and J. Anemüller, “A discriminative learning approach to probabilistic acoustic source localization,” in Proc. International Workshop on Acoustic Signal Enhancement (IWAENC), (Juan-les-Pins, France), pp. 99–103, Sep. 2014.
  22. D. Fejgin and S. Doclo, “Comparison of binaural RTF-vector-based direction of arrival estimation methods exploiting an external microphone,” in Proc. European Signal Processing Conference (EUSIPCO), (Dublin, Ireland), pp. 241–245, Aug. 2021.
  23. A. Farina, “Simultaneous measurement of impulse response and distortion with a swept-sine technique,” in Proc. Audio Engineering Society (AES), (Paris, France), Feb. 2000.
  24. A. Novák, L. Simon, F. Kadlec, and P. Lotton, “Nonlinear system identification using exponential swept-sine signal,” IEEE Trans. on Instrumentation and Measurement, vol. 59, pp. 2220–2229, Aug 2010.
  25. European Broadcasting Union, “Sound quality assessment material - recordings for subjective tests: User’s handbook for the EBU SQUAM CD,” 2008. [Online]. Available: https://tech.ebu.ch/publications/sqamcd.
  26. C. Veaux, J. Yamagishi, and K. MacDonald, “CSTR VCTK corpus: English multi-speaker corpus for CSTR voice cloning toolkit,” University of Edinburgh. The Centre for Speech Technology Research (CSTR), 2017.
  27. Y. Avargel and I. Cohen, “On multiplicative transfer function approximation in the short-time Fourier transform domain,” IEEE Signal Processing Letters, vol. 14, no. 5, pp. 337–340, 2007.
  28. S. Doclo, W. Kellermann, S. Makino, and S. E. Nordholm, “Multichannel signal enhancement algorithms for assisted listening devices: Exploiting spatial diversity using multiple microphones,” IEEE Signal Processing Magazine, vol. 32, pp. 18–30, Mar. 2015.
  29. S. Gannot, E. Vincent, S. Markovich-Golan, and A. Ozerov, “A consolidated perspective on multi-microphone speech enhancement and source separation,” IEEE/ACM Trans. on Audio, Speech, and Language Processing, vol. 25, pp. 692–730, Apr. 2017.
  30. B. D. Van Veen and K. M. Buckley, “Beamforming: A versatile approach to spatial filtering,” IEEE ASSP Magazine, vol. 5, pp. 4–24, Apr. 1988.
  31. N. Gößling and S. Doclo, “Relative transfer function estimation exploiting spatially separated microphones in a diffuse noise field,” in Proc. International Workshop on Acoustic Signal Enhancement (IWAENC), (Tokyo, Japan), pp. 146–150, Sep. 2018.
  32. T. Gerkmann and R. C. Hendriks, “Unbiased MMSE-Based Noise Power Estimation With Low Complexity and Low Tracking Delay,” IEEE Trans. on Audio, Speech, and Language Processing, vol. 20, pp. 1383–1393, May 2012.
  33. N. Gößling and S. Doclo, “RTF-Steered Binaural MVDR Beamforming Incorporating an External Microphone for Dynamic Acoustic Scenarios,” in Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), (Brighton, UK), pp. 416–420, May 2019.
Citations (5)

Summary

We haven't generated a summary for this paper yet.