Source Localization Using Distributed Microphones in Reverberant Environments Based on Deep Learning and Ray Space Transform

被引:19
|
作者
Comanducci, Luca [1 ]
Borra, Federico [1 ]
Bestagini, Paolo [1 ]
Antonacci, Fabio [1 ]
Tubaro, Stefano [1 ]
Sarti, Augusto [1 ]
机构
[1] Politecn Milan, Dipartimento Elettron Informaz & Bioingn, I-20133 Milan, Italy
关键词
Transforms; Training; Arrays; Microphone arrays; Reverberation; Acoustic source localization; deep learning; generalized cross correlation; ray space transform (RST); SOUND SOURCE LOCALIZATION; TIME; NETWORKS; NOISY;
D O I
10.1109/TASLP.2020.3011256
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this article we present a methodology for source localization in reverberant environments from Generalized Cross Correlations (GCCs) computed between spatially distributed individual microphones. Reverberation tends to negatively affect localization based on Time Differences of Arrival (TDOAs), which become inaccurate due to the presence of spurious peaks in the GCC. We therefore adopt a data-driven approach based on a convolutional neural network, which, using the GCCs as input, estimates the source location in two steps. It first computes the Ray Space Transform (RST) from multiple arrays. The RST is a convenient representation of the acoustic rays impinging on the array in a parametric space, called Ray Space. Rays produced by a source are visualized in the RST as patterns, whose position is uniquely related to the source location. The second step consists of estimating the source location through a nonlinear fitting, which estimates the coordinates that best approximate the RST pattern obtained through the first step. It is worth noting that training can be accomplished on simulated data only, thus relaxing the need of actually deploying microphone arrays in the acoustic scene. The localization accuracy of the proposed techniques is similar to the one of SRP-PHAT, however our method demonstrates an increased robustness regarding different distributed microphones configurations. Moreover, the use of the RST as an intermediate representation makes it possible for the network to generalize to data unseen during training.
引用
收藏
页码:2238 / 2251
页数:14
相关论文
共 50 条
  • [1] Multiple Sound Source Localization and Counting Using One Pair of Microphones in Noisy and Reverberant Environments
    Fang, Yuzhuo
    Xu, Zhiyong
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2020, 2020 (2020)
  • [2] Sound Source Localization in Reverberant Environments Based on Structural Sparse Bayesian Learning
    Liu, Yanshan
    Wang, Lu
    Zeng, Xiangyang
    Wang, Haitao
    ACTA ACUSTICA UNITED WITH ACUSTICA, 2018, 104 (03) : 528 - 541
  • [3] SOURCE LOCALIZATION IN REVERBERANT ENVIRONMENTS USING SPARSE OPTIMIZATION
    Le Roux, Jonathan
    Boufounos, Petros T.
    Kang, Kang
    Hershey, John R.
    2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 4310 - 4314
  • [4] Semi-Supervised Source Localization in Reverberant Environments With Deep Generative Modeling
    Bianco, Michael J.
    Gannot, Sharon
    Fernandez-Grande, Efren
    Gerstoft, Peter
    IEEE ACCESS, 2021, 9 : 84956 - 84970
  • [5] Semi-Supervised Source Localization in Reverberant Environments with Deep Generative Modeling
    Bianco, Michael J.
    Gannot, Sharon
    Fernandez-Grande, Efren
    Gerstoft, Peter
    IEEE Access, 2021, 9 : 84956 - 84970
  • [6] Efficient source localization and tracking in reverberant environments using microphone arrays
    Antonacci, F
    Lonoce, D
    Motta, M
    Sarti, A
    Tubaro, S
    2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 1061 - 1064
  • [7] Sound source localization in reverberant environments using an outlier elimination algorithm
    Jan, EE
    Flanagan, J
    ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 1321 - 1324
  • [8] Improved Speech Source Localization in Reverberant Environments Based on Correlation Dimension
    Wan, Xinwang
    Wu, Zhenyang
    2009 INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS AND SIGNAL PROCESSING (WCSP 2009), 2009, : 1540 - 1543
  • [9] Speech Source Tracking Based on Distributed Particle Filter in Reverberant Environments
    Wang, Ruifang
    Lan, Xiaoyu
    ADVANCED HYBRID INFORMATION PROCESSING, ADHIP 2019, PT II, 2019, 302 : 330 - 342
  • [10] Robust Source Localization in Reverberant Environments Based on Weighted Fuzzy Clustering
    Kuehne, Marco
    Togneri, Roberto
    Nordholm, Sven
    IEEE SIGNAL PROCESSING LETTERS, 2009, 16 (1-3) : 85 - 88