DeepEar: Sound Localization With Binaural Microphones

被引:7
|
作者
Yang, Qiang [1 ]
Zheng, Yuanqing [1 ]
机构
[1] Hong Kong Polytech Univ, Dept Comp, Kowloon, Hong Kong, Peoples R China
关键词
Binaural localization; multi-source localization; earable computing; NEURAL-NETWORKS; HEAD MOVEMENTS; NOISE; DIFFERENCE; FEATURES; SEARCH;
D O I
10.1109/TMC.2022.3222821
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The binaural microphone, which refers to a pair of microphones with artificial human-shaped ears, is widely used in hearing aids and spatial audio recording to improve sound quality. It is crucial for such devices to find the voice direction in many applications such as binaural sound enhancement. However, sound localization with two microphones remains challenging, especially in multi-source scenarios. Most previous work utilized microphone arrays to deal with the multi-source localization problem. Extra microphones yet have space constraints for deployment in many scenarios (e.g., hearing aids). Inspired by the fact that humans have evolved to locate multiple sound sources with only two ears, we propose DeepEar, a binaural microphone-based sound localization system. To this end, we design a multisector-based neural network to locate multiple sound sources simultaneously, where each sector is a discretized region of the space for different angle of arrivals. DeepEar fuses explicit hand-crafted features and implicit latent sound representatives to facilitate sound localization. More importantly, the trained DeepEar model can adapt to new environments with a minimum amount of extra training data. The experiment results show that DeepEar substantially outperforms the state-of-the-art binaural deep learning approach by a large margin in terms of sound detection accuracy and azimuth estimation error.
引用
收藏
页码:359 / 375
页数:17
相关论文
共 50 条
  • [21] Sound source localization based on directivity of MEMS microphones
    Wu, XM
    Ren, TL
    Liu, LT
    2004: 7TH INTERNATIONAL CONFERENCE ON SOLID-STATE AND INTEGRATED CIRCUITS TECHNOLOGY, VOLS 1- 3, PROCEEDINGS, 2004, : 1884 - 1887
  • [22] Localization of sound source direction using the binaural model
    Department of Production Systems Engineering, Toyohashi University of Technology, 1-1 Hibarigaoka, Tempaku-cho, Toyohashi-shi, Aichi, 441-8580, Japan
    Nihon Kikai Gakkai Ronbunshu C, 2008, 3 (642-649):
  • [23] ASYMMETRIC PERFORMANCES IN BINAURAL LOCALIZATION OF SOUND IN-SPACE
    BURKE, KA
    LETSOS, A
    BUTLER, RA
    NEUROPSYCHOLOGIA, 1994, 32 (11) : 1409 - 1417
  • [24] Binaural weighting of monaural spectral cues for sound localization
    Macpherson, Ewan A.
    Sabin, Andrew T.
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2007, 121 (06): : 3677 - 3688
  • [25] Binaural weighting of pinna cues in human sound localization
    P. Hofman
    A. Van Opstal
    Experimental Brain Research, 2003, 148 : 458 - 470
  • [26] The effects of attenuation of frequency segments on binaural localization of sound
    Jason A. Burijngame
    Robert A. Butler
    Perception & Psychophysics, 1998, 60 (8): : 1374 - 1383
  • [27] The effects of attenuation of frequency segments on binaural localization of sound
    Burlingame, JA
    Butler, RA
    PERCEPTION & PSYCHOPHYSICS, 1998, 60 (08): : 1374 - 1383
  • [28] Binaural Sound Source Localization in Real and Virtual Rooms
    Rychtarikova, Monika
    Van den Bogaert, Tim
    Vermeir, Gerrit
    Wouters, Jan
    JOURNAL OF THE AUDIO ENGINEERING SOCIETY, 2009, 57 (04): : 205 - 220
  • [29] Fast Neuromorphic Sound Localization for Binaural Hearing Aids
    Park, Paul K. J.
    Ryu, Hyunsurk
    Lee, Jun Haeng
    Shin, Chang-Woo
    Lee, Kyoo Bin
    Woo, Jooyeon
    Kim, Jun-Seok
    Kang, Byung Chang
    Liu, Shih-Chii
    Delbruck, Tobi
    2013 35TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC), 2013, : 5275 - 5278
  • [30] Binaural weighting of pinna cues in human sound localization
    Hofman, PM
    Van Opstal, AJ
    EXPERIMENTAL BRAIN RESEARCH, 2003, 148 (04) : 458 - 470