Localizing concurrent sound sources with binaural microphones: A simulation study

被引:0
|
作者
Orr, Jakeh [1 ]
Ebel, William [1 ]
Gai, Yan [1 ]
机构
[1] St Louis Univ, Sch Sci & Engn, St Louis, MO 63105 USA
基金
美国国家科学基金会;
关键词
Sound localization; HRTF; ITD; ILD; Robotics; Sparseness; Reverberations; SOURCE LOCALIZATION; HEAD; IDENTIFICATION; FREQUENCY; PINNAE; ILD;
D O I
10.1016/j.heares.2023.108884
中图分类号
R36 [病理学]; R76 [耳鼻咽喉科学];
学科分类号
100104 ; 100213 ;
摘要
The human auditory system can localize multiple sound sources using time, intensity, and frequency cues in the sound received by the two ears. Being able to spatially segregate the sources helps perception in a challenging condition when multiple sounds coexist. This study used model simulations to explore an algorithm for localizing multiple sources in azimuth with binaural (i.e., two) microphones. The algorithm relies on the "sparseness" property of daily signals in the time-frequency domain, and sound coming from different locations carrying unique spatial features will form clusters. Based on an interaural normalization procedure, the model generated spiral patterns for sound sources in the frontal hemifield. The model itself was created using broadband noise for better accuracy, because speech typically has sporadic energy at high frequencies. The model at an arbitrary frequency can be used to predict locations of speech and music that occurred alone or concurrently, and a classification algorithm was applied to measure the localization error. Under anechoic conditions, averaged errors in azimuth increased from 4.5 degrees to 19 degrees with RMS errors ranging from 6.4 degrees to 26.7 degrees as model frequency increased from 300 to 3000 Hz. The low-frequency model performance using short speech sound was notably better than the generalized cross-correlation model. Two types of room reverberations were then introduced to simulate difficult listening conditions. Model performance under reverberation was more resilient at low fre-quencies than at high frequencies. Overall, our study presented a spiral model for rapidly predicting horizontal locations of concurrent sound that is suitable for real-world scenarios.
引用
收藏
页数:16
相关论文
共 50 条
  • [21] The simulation of binaural hearing caused by a moving sound source
    Lee, P. L.
    Wang, J. H.
    COMPUTERS & STRUCTURES, 2009, 87 (17-18) : 1102 - 1110
  • [22] Theory and experiment of dual sound sources localization with five proximate microphones
    Kurihara, M
    Ono, N
    Ando, S
    SICE 2002: PROCEEDINGS OF THE 41ST SICE ANNUAL CONFERENCE, VOLS 1-5, 2002, : 1100 - 1101
  • [23] An improved acoustic deconvolution method for localizing correlated sound sources
    Wei L.
    Qin Z.
    Ren F.
    Zhang Z.
    Li M.
    Liu X.
    Hangkong Xuebao/Acta Aeronautica et Astronautica Sinica, 2019, 40 (11):
  • [24] Head-tracking and subject positioning using binaural headset microphones and common modulation anchor sources
    Karjalainen, M
    Tikander, M
    Härmä, A
    2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PROCEEDINGS: AUDIO AND ELECTROACOUSTICS SIGNAL PROCESSING FOR COMMUNICATIONS, 2004, : 101 - 104
  • [25] MINIMUM AUDITORY MOVEMENT ANGLE - BINAURAL LOCALIZATION OF MOVING SOUND SOURCES
    PERROTT, DR
    MUSICANT, AD
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1977, 62 (06): : 1463 - 1466
  • [26] SIMULATION OF MOVING SOUND SOURCES
    CHOWNING, JM
    JOURNAL OF THE AUDIO ENGINEERING SOCIETY, 1971, 19 (01): : 2 - &
  • [27] Binaural Bearing Only Tracking of Stationary Sound Sources in Reverberant Environment
    Kossyk, Ingo
    Neumann, Michael
    Marton, Zoltan-Csaba
    2015 IEEE-RAS 15TH INTERNATIONAL CONFERENCE ON HUMANOID ROBOTS (HUMANOIDS), 2015, : 53 - 60
  • [28] SIMULATION OF MOVING SOUND SOURCES
    CHOWNING, JM
    JOURNAL OF THE AUDIO ENGINEERING SOCIETY, 1970, 18 (06): : 692 - &
  • [29] Efficient Binaural Rendering of Moving Sound Sources Using HRTF Interpolation
    Queiroz, Marcelo
    Montesiao de Sousa, Gustavo Henrique
    JOURNAL OF NEW MUSIC RESEARCH, 2011, 40 (03) : 239 - 252
  • [30] Real Time Binaural Synthesis of Moving Sound Sources over Loudspeakers
    Bruschi, Valeria
    Nobili, Stefano
    Cecchi, Stefania
    2021 IMMERSIVE AND 3D AUDIO: FROM ARCHITECTURE TO AUTOMOTIVE (I3DA), 2021,