Active binaural localization of multiple sound sources

被引:20
|
作者
Zhong, Xuan [1 ]
Sun, Liang [2 ]
Yost, William [1 ]
机构
[1] Arizona State Univ, Dept Speech & Hearing Sci, Tempe, AZ 85287 USA
[2] New Mexico State Univ, Dept Mech & Aerosp Engn, Las Cruces, NM 88003 USA
关键词
Sound source localization; Binaural localization; Spatial hearing; Extended Kalman filter (EKF); PLANE; MODEL;
D O I
10.1016/j.robot.2016.07.008
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Sound source localization serves as a significant capability of autonomous robots that conduct missions such as search and rescue, and target tracking in challenging environments. However, localization of multiple sound sources and static sound source tracking in self-motion are both difficult tasks, especially when the number of sound sources or reflections increase. This study presents two robotic hearing approaches based on a human perception model (Wallach, 1939) that combines interaural time difference (ITD) and head turn motion data to locate sound sources. The first method uses a fitting-based approach to recognize the changing trends of the cross-correlation function of binaural inputs. The effectiveness of this method was validated using data collected from a two-microphone array rotating in a non-anechoic environment, and the experiments reveal its ability to separate and localize up to three sound sources of the same spectral content (white noise) at different azimuth and elevation angles. The second method uses an extended Kalman filter (EKF) that estimates the orientation of a sound source by fusing the robot's self motion and ITD data to reduce the localization errors recursively. This method requires limited memory resources and is able to keep tracking the relative position change of a number of static sources when the robot moves. In the experiments, up to three sources can be tracked simultaneously with a two microphone array. (C) 2016 Elsevier B.V. All rights reserved.
引用
收藏
页码:83 / 92
页数:10
相关论文
共 50 条
  • [41] Binaural sound source localization in real and virtual rooms
    Laboratory of Acoustics and Thermal Physics, Katholieke Universiteit Leuven, 3001 Heverlee, Belgium
    不详
    不详
    AES J Audio Eng Soc, 2009, 4 (205-220):
  • [42] Efficient Binaural Rendering of Spatially Extended Sound Sources
    Anemueller, Carlotta
    Adami, Alexander
    Herre, Juergen
    JOURNAL OF THE AUDIO ENGINEERING SOCIETY, 2023, 71 (05): : 281 - 292
  • [43] Binaural Sound Localization based on Sparse Coding and SOM
    Kim, Hong Shik
    Choi, Jongsuk
    2009 IEEE-RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, 2009, : 2557 - 2562
  • [44] Model and application of a binaural 360° sound localization system
    Schauer, C
    Gross, HM
    IJCNN'01: INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-4, PROCEEDINGS, 2001, : 1132 - 1137
  • [45] Effects of visual images on sound localization with binaural reproduction
    Kuramochi, Toshikatsu
    Ayama, Miyoshi
    Takahashi, Kazuhiro
    Hasegawa, Hiroshi
    Mekada, Yoshito
    Kasuga, Masao
    Kyokai Joho Imeji Zasshi/Journal of the Institute of Image Information and Television Engineers, 2000, 54 (09): : 1350 - 1355
  • [46] Modeling the utility of binaural cues for underwater sound localization
    Schneider, Jennifer N.
    Lloyd, David R.
    Banks, Patchouly N.
    Mercado, Eduardo, III
    HEARING RESEARCH, 2014, 312 : 103 - 113
  • [47] Active binaural distance estimation for dynamic sources
    Lu, Yan-Chen
    Cooke, Martin
    Christensen, Heidi
    INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 933 - 936
  • [48] MONTE CARLO EXPLORATION FOR ACTIVE BINAURAL LOCALIZATION
    Schymura, Christopher
    Grajales, Juan Diego Rios
    Kolossa, Dorothea
    2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 491 - 495
  • [49] Localization of multiple sound sources based on a CSP analysis with a microphone array
    Nishiura, T
    Yamada, T
    Nakamura, S
    Shikano, K
    2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI, 2000, : 1053 - 1056
  • [50] A Modified Frequency Weighted MUSIC Algorithm for Multiple Sound Sources Localization
    Gao, Shan
    Huang, Yankun
    Zhang, Tao
    Wu, Xihong
    Qu, Tianshu
    2018 IEEE 23RD INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING (DSP), 2018,