Direction of Arrival Estimation Using Microphone Array Processing for Moving Humanoid Robots (2401.02386v1)
Abstract: The auditory system of humanoid robots has gained increased attention in recent years. This system typically acquires the surrounding sound field by means of a microphone array. Signals acquired by the array are then processed using various methods. One of the widely applied methods is direction of arrival estimation. The conventional direction of arrival estimation methods assume that the array is fixed at a given position during the estimation. However, this is not necessarily true for an array installed on a moving humanoid robot. The array motion, if not accounted for appropriately, can introduce a significant error in the estimated direction of arrival. The current paper presents a signal model that takes the motion into account. Based on this model, two processing methods are proposed. The first one compensates for the motion of the robot. The second method is applicable to periodic signals and utilizes the motion in order to enhance the performance to a level beyond that of a stationary array. Numerical simulations and an experimental study are provided, demonstrating that the motion compensation method almost eliminates the motion-related error. It is also demonstrated that by using the motion-based enhancement method it is possible to improve the direction of arrival estimation performance, as compared to that obtained when using a stationary array.
- D. Pavlidi, A. Griffin, M. Puigt, and A. Mouchtaris, “Real-time multiple sound source localization and counting using a circular microphone array,” IEEE Trans. Audio, Speech, Language Process., vol. 21, no. 10, pp. 2193–2206, Oct 2013.
- Y. Peled and B. Rafaely, “Linearly-constrained minimum-variance method for spherical microphone arrays based on plane-wave decomposition of the sound field,” IEEE Trans. Audio, Speech, Language Process., vol. 21, no. 12, pp. 2532–2540, Dec 2013.
- K. Nakamura, K. Nakadai, and H. G. Okuno, “A real-time super-resolution robot audition system that improves the robustness of simultaneous speech recognition,” Advanced Robotics, vol. 27, no. 12, pp. 933–945, 2013.
- X. Alameda-Pineda and R. Horaud, “A geometric approach to sound source localization from time-delay estimates,” IEEE/ACM Trans. Audio, Speech, Language Process., vol. 22, no. 6, pp. 1082–1095, June 2014.
- A. Gershman, V. Turchin, and V. Zverev, “Experimental results of localization of moving underwater signal by adaptive beamforming,” IEEE Trans. Signal Process., vol. 43, no. 10, pp. 2249–2257, Oct 1995.
- C. Zhang, D. Florencio, D. Ba, and Z. Zhang, “Maximum likelihood sound source localization and beamforming for directional microphone arrays in distributed meetings,” IEEE Trans. Multimedia, vol. 10, no. 3, pp. 538–548, April 2008.
- R. Schmidt, “Multiple emitter location and signal parameter estimation,” IEEE Trans. Antennas Propag., vol. 34, no. 3, pp. 276–280, Mar 1986.
- R. Roy and T. Kailath, “ESPRIT-estimation of signal parameters via rotational invariance techniques,” IEEE Trans. Acoust., Speech Signal Process., vol. 37, no. 7, pp. 984–995, Jul 1989.
- G. Ince, K. Nakadai, T. Rodemann, J. Imura, K. Nakamura, and H. Nakajima, “Assessment of single-channel ego noise estimation methods,” in IEEE/RSJ Int. Conference Intelligent Robots and Systems (IROS), Sept 2011, pp. 106–111.
- K. Nakadai, T. Lourens, H. G. Okuno, and H. Kitano, “Active audition for humanoid,” in 17th National Conference on Artificial Intelligence (AAAI). AAAI, 2000, pp. 832–839.
- E. Chang, “Irregular array motion and extended integration for the suppression of spatial aliasing in passive sonar,” J. Acoust. Soc. Am., vol. 129, no. 2, pp. 765–773, 2011.
- A. Cigada, M. Lurati, F. Ripamonti, and M. Vanali, “Moving microphone arrays to reduce spatial aliasing in the beamforming technique: Theoretical background and numerical investigation,” J. Acoust. Soc. Am., vol. 124, no. 6, pp. 3648–3658, 2008.
- N. Yen and W. Carey, “Application of synthetic-aperture processing to towed-array data,” J. Acoust. Soc. Am., vol. 86, no. 2, pp. 754–765, 1989.
- J. Unnikrishnan and M. Vetterli, “Sampling and reconstruction of spatial fields using mobile sensors,” IEEE Trans. Signal Process., vol. 61, no. 9, pp. 2328–2340, May 2013.
- V. Tourbabin and B. Rafaely, “Utilizing motion in humanoid robots to enhance spatial information recorded by microphone arrays,” in Joint Workshop Hands-free Speech Communication and Microphone Arrays (HSCMA), Nancy, France, May 2014, pp. 147–151.
- Y. Avargel and I. Cohen, “On multiplicative transfer function approximation in the short-time fourier transform domain,” IEEE Signal Process. Lett., vol. 14, no. 5, pp. 337–340, May 2007.
- B. Rafaely, “Analysis and design of spherical microphone arrays,” IEEE Trans. Speech, Audio Process., vol. 13, no. 1, pp. 135–143, Jan 2005.
- M. Maazaoui, K. Abed-Meraim, and Y. Grenier, “Adaptive blind source separation with HRTFs beamforming preprocessing,” in IEEE 7th Sensor Array and Multichannel Signal Processing Workshop (SAM), June 2012, pp. 269–272.
- V. Tourbabin and B. Rafaely, “Theoretical framework for the optimization of microphone array configuration for humanoid robot audition,” IEEE/ACM Trans. Audio, Speech, Language Process., vol. 22, no. 12, pp. 1803–1814, Dec 2014.
- D. Khaykin and B. Rafaely, “Coherent signals direction-of-arrival estimation using a spherical microphone array: Frequency smoothing approach,” in IEEE Workshop Applications Signal Processing Audio and Acoustics (WASPAA)., Oct 2009, pp. 221–224.
- M. A. Poletti, “Series expansions of rotating two and three dimensional sound fields,” J. Acoust. Soc. Am., vol. 128, no. 6, pp. 3363–3374, 2010.
- P. J. Kostelec and D. N. Rockmore, “FFTs on the rotation group,” J. Fourier Anal. and Appl., vol. 14, no. 2, pp. 145–179, 2008.
- T. Peleg and B. Rafaely, “Investigation of spherical loudspeaker arrays for local active control of sound,” J. Acoust. Soc. Am., vol. 130, no. 4, pp. 1926–1935, 2011.
- G. Rump, “Embedded sound localization on a humanoid robot,” in Joint Workshop Hands-free Speech Communication and Microphone Arrays (HSCMA), May 2014.
- E. Tzoreff, B. Bobrovsky, and A. Weiss, “Single receiver emitter geolocation based on signal periodicity with oscillator instability,” IEEE Trans. Signal Process., vol. 62, no. 6, pp. 1377–1385, March 2014.
- O. Roy and M. Vetterli, “The effective rank: A measure of effective dimensionality,” in European Signal Processing Conference (EUSIPCO), Sep. 2007, pp. 606–610.
- V. Tourbabin and B. Rafaely, “Objective measure for sound localization based on head-related transfer functions,” in IEEE 27th Convention Electrical Electronics Engineers Israel (IEEEI), Nov 2012, pp. 1–5.
- V. R. Algazi, R. O. Duda, R. Duraiswami, N. A. Gumerov, and Z. Tang, “Approximating the head-related transfer function using simple geometric models of the head and torso,” J. Acoust. Soc. Am., vol. 112, no. 5, pp. 2053–2064, 2002.
- R. H. Hardin and N. J. A. Sloane, “Mclaren’s improved snub cube and other new spherical designs in three dimensions,” Discrete and Computational Geometry, vol. 15, pp. 429–441, 1996.
- O. Nadiri and B. Rafaely, “Localization of multiple speakers under high reverberation using a spherical microphone array and the direct-path dominance test,” IEEE/ACM Trans. Audio, Speech, Language Process., no. 99, 2014.
- J. S. Garofolo, L. F. Lamel, W. M. Fisher, J. G. Fiscus, D. S. Pallet, and N. S. Dahlgren, “DARPA TIMIT acoustic-phonetic continuous speech corpus,” CD-ROM, 1993.
- D. Bruck and I. Thomas, “Waking effectiveness of alarms(auditory, visual and tactile) for adults who are hard of hearing,” The fire protection research foundation, June 2007.