Multi-channel time-frequency data fusion

被引:0
|
作者
Aarabi, P [1 ]
Shi, G [1 ]
机构
[1] Univ Toronto, Edward S Rogers Sr Dept Elect & Comp Engn, Toronto, ON, Canada
关键词
adaptive signal processing; adaptive beam-forming; time-delay of arrival estimation; speech phase; sound localization; time-frequency analysis;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper proposes an efficient mechanism for the fusion of two noisy speech signals obtained by an array of two microphones using single-tap time-frequency filters and by taking into account the correct time delay of arrival (TDOA) of the speech source. Speech signals obtained by the microphones are transformed into a set Of two complex time-frequency (TF) images. By knowing the correct TDOA, and therefore the associated phase difference between the signals at each frequency, it is possible to non-linearly filter both the real and the imaginary parts of the TF images. This will consist of a TF reward-punish filter that adjusts the amplitude of the TF blocks based upon the variation of their phase-difference with the ideal phase-difference defined by the TDOA. Simulation results show that the proposed technique can achieve a Signal-to-Noise Ratio (SNR) improvement of 15dB when there is strong Gaussian noise present (-20dB initial SNR). When the original SNR is 0dB, the simulated improvement is approximately 8dB. It is also shown that although the proposed technique is a more general case of the adaptive beamformer (where the adaptive beamformer has a specific reward-punish characteristic), other reward-punish characteristics that are proposed in this paper can often surpass the performance of the ideal adaptive beamformer.
引用
收藏
页码:404 / 411
页数:4
相关论文
共 50 条
  • [21] Multi-Channel Attentive Feature Fusion for Radio Frequency Fingerprinting
    Zeng, Yuan
    Gong, Yi
    Liu, Jiawei
    Lin, Shangao
    Han, Zidong
    Cao, Ruoxiao
    Huang, Kaibin
    Letaief, Khaled B.
    IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2024, 23 (05) : 4243 - 4254
  • [22] HYBRID APPROACH FOR MULTICHANNEL SOURCE SEPARATION COMBINING TIME-FREQUENCY MASK WITH MULTI-CHANNEL WIENER FILTER
    Araki, Shoko
    Nakatani, Tomohiro
    2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 225 - 228
  • [23] SIMULTANEOUS OPTIMIZATION OF FORGETTING FACTOR AND TIME-FREQUENCY MASK FOR BLOCK ONLINE MULTI-CHANNEL SPEECH ENHANCEMENT
    Togami, Masahito
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 2702 - 2706
  • [24] Time-Frequency Masking Based Online Multi-Channel Speech Enhancement With Convolutional Recurrent Neural Networks
    Chakrabarty, Soumitro
    Habets, Emanuel A. P.
    IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2019, 13 (04) : 787 - 799
  • [25] Adaptive time-frequency data fusion for speech enhancement
    Shi, G
    Aarabi, P
    Lazic, N
    FUSION 2003: PROCEEDINGS OF THE SIXTH INTERNATIONAL CONFERENCE OF INFORMATION FUSION, VOLS 1 AND 2, 2003, : 394 - 399
  • [26] Multi-Channel Fusion Attacks
    Yang, Wei
    Zhou, Yongbin
    Cao, Yuchen
    Zhang, Hailong
    Zhang, Qian
    Wang, Huan
    IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2017, 12 (08) : 1757 - 1771
  • [27] Modeling and analysis of fatigue detection with multi-channel data fusion
    Wenbo Huang
    Changyuan Wang
    Hong-bo Jia
    Pengxiang Xue
    Li Wang
    The International Journal of Advanced Manufacturing Technology, 2022, 122 : 291 - 301
  • [28] Modeling and analysis of fatigue detection with multi-channel data fusion
    Huang, Wenbo
    Wang, Changyuan
    Jia, Hong-bo
    Xue, Pengxiang
    Wang, Li
    INTERNATIONAL JOURNAL OF ADVANCED MANUFACTURING TECHNOLOGY, 2022, 122 (01): : 291 - 301
  • [29] Color image superresolution using multi-channel data fusion
    Zhao, SB
    Han, H
    Peng, SL
    THIRD INTERNATIONAL SYMPOSIUM ON MULTISPECTRAL IMAGE PROCESSING AND PATTERN RECOGNITION, PTS 1 AND 2, 2003, 5286 : 39 - 44
  • [30] A Unified Bayesian Model of Time-frequency Clustering and Low-rank Approximation for Multi-channel Source Separation
    Itakura, Kousuke
    Bando, Yoshiaki
    Nakamura, Eita
    Itoyama, Katsutoshi
    Yoshii, Kazuyoshi
    2016 24TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2016, : 2280 - 2284