Robust TDOA Estimation Based on Time-Frequency Masking and Deep Neural Networks

被引:27
|
作者
Wang, Zhong-Qiu [1 ]
Zhang, Xueliang [3 ]
Wang, DeLiang [1 ,2 ]
机构
[1] Ohio State Univ, Dept Comp Sci & Engn, Columbus, OH 43210 USA
[2] Ohio State Univ, Ctr Cognit & Brain Sci, Columbus, OH 43210 USA
[3] Inner Mongolia Univ, Dept Comp Sci, Hohhot, Peoples R China
关键词
GCC-PHAT; time-frequency masking; robust TDOA estimation; deep neural networks; NOISE; LOCALIZATION; RECOGNITION; SEPARATION; ALGORITHM;
D O I
10.21437/Interspeech.2018-1652
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep learning based time-frequency (T-F) masking has dramatically advanced monaural speech separation and enhancement. This study investigates its potential for robust time difference of arrival (TDOA) estimation in noisy and reverberant environments. Three novel algorithms are proposed to improve the robustness of conventional cross-correlation-, beam-forming- and subspace-based algorithms for speaker localization. The key idea is to leverage the power of deep neural networks (DNN) to accurately identify T-F units that are relatively clean for TDOA estimation. All of the proposed algorithms exhibit strong robustness for TDOA estimation in environments with low input SNR, high reverberation and low direction-to-reverberant energy ratio.
引用
收藏
页码:322 / 326
页数:5
相关论文
共 50 条
  • [41] AF Identification From Time-Frequency Analysis of ECG Signal Using Deep Neural Networks
    Anbalagan, Thivya
    Nath, Malaya Kumar
    IEEE SENSORS LETTERS, 2024, 8 (09)
  • [42] Manufacturing process monitoring using time-frequency representation and transfer learning of deep neural networks
    Liao, Yabin
    Ragai, Ihab
    Huang, Ziyun
    Kerner, Scott
    JOURNAL OF MANUFACTURING PROCESSES, 2021, 68 (68) : 231 - 248
  • [43] Environmental Sound Classification via Time-Frequency Attention and Framewise Self-Attention-Based Deep Neural Networks
    Wu, Bo
    Zhang, Xiao-Ping
    IEEE INTERNET OF THINGS JOURNAL, 2022, 9 (05) : 3416 - 3428
  • [44] On Using Time-Frequency Binary Masking For Dereverberation
    Mischie, Septimiu
    2013 INTERNATIONAL SYMPOSIUM ON SIGNALS, CIRCUITS AND SYSTEMS (ISSCS), 2013,
  • [45] Robust time-frequency distributions
    Katkovnik, W
    Djurovic, I
    Stankovic, LJ
    ISSPA 2001: SIXTH INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND ITS APPLICATIONS, VOLS 1 AND 2, PROCEEDINGS, 2001, : 156 - 157
  • [46] A novel approach for tool condition monitoring based on transfer learning of deep neural networks using time-frequency images
    Li, Yao
    Zhao, Zhengcai
    Fu, Yucan
    Chen, Qingliang
    JOURNAL OF INTELLIGENT MANUFACTURING, 2024, 35 (03) : 1159 - 1171
  • [47] TIME-FREQUENCY MASKING STRATEGIES FOR SINGLE-CHANNEL LOW-LATENCY SPEECH ENHANCEMENT USING NEURAL NETWORKS
    Parviainen, Mikko
    Pertila, Pasi
    Virtanen, Tuomas
    Grosche, Peter
    2018 16TH INTERNATIONAL WORKSHOP ON ACOUSTIC SIGNAL ENHANCEMENT (IWAENC), 2018, : 51 - 55
  • [48] Robust Watermarking Algorithm Based on Time-Frequency CHIRPLET
    Deng Minghui
    Zeng Qingshuang
    Yu Song
    MATERIALS SCIENCE AND ENGINEERING, PTS 1-2, 2011, 179-180 : 881 - +
  • [49] Constructing Time-Frequency Dictionaries for Source Separation via Time-Frequency Masking and Source Localisation
    de Frein, Ruairi
    Rickard, Scott T.
    Pearlmutter, Barak A.
    INDEPENDENT COMPONENT ANALYSIS AND SIGNAL SEPARATION, PROCEEDINGS, 2009, 5441 : 573 - +
  • [50] Approximation of Time-Frequency Shift Equivariant Maps by Neural Networks
    Lee, Dae Gwan
    MATHEMATICS, 2024, 12 (23)