Multi-channel time-frequency data fusion

被引：0

作者：

Aarabi, P ^{[1
]}

Shi, G ^{[1
]}

机构：

[1] Univ Toronto, Edward S Rogers Sr Dept Elect & Comp Engn, Toronto, ON, Canada

来源：

PROCEEDINGS OF THE FIFTH INTERNATIONAL CONFERENCE ON INFORMATION FUSION, VOL I | 2002年

关键词：

adaptive signal processing; adaptive beam-forming; time-delay of arrival estimation; speech phase; sound localization; time-frequency analysis;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper proposes an efficient mechanism for the fusion of two noisy speech signals obtained by an array of two microphones using single-tap time-frequency filters and by taking into account the correct time delay of arrival (TDOA) of the speech source. Speech signals obtained by the microphones are transformed into a set Of two complex time-frequency (TF) images. By knowing the correct TDOA, and therefore the associated phase difference between the signals at each frequency, it is possible to non-linearly filter both the real and the imaginary parts of the TF images. This will consist of a TF reward-punish filter that adjusts the amplitude of the TF blocks based upon the variation of their phase-difference with the ideal phase-difference defined by the TDOA. Simulation results show that the proposed technique can achieve a Signal-to-Noise Ratio (SNR) improvement of 15dB when there is strong Gaussian noise present (-20dB initial SNR). When the original SNR is 0dB, the simulated improvement is approximately 8dB. It is also shown that although the proposed technique is a more general case of the adaptive beamformer (where the adaptive beamformer has a specific reward-punish characteristic), other reward-punish characteristics that are proposed in this paper can often surpass the performance of the ideal adaptive beamformer.

引用

页码：404 / 411

页数：4

共 50 条

[21] Multi-Channel Attentive Feature Fusion for Radio Frequency Fingerprinting
Zeng, Yuan
Gong, Yi
Liu, Jiawei
Lin, Shangao
Han, Zidong
Cao, Ruoxiao
Huang, Kaibin
Letaief, Khaled B.
IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2024, 23 (05) : 4243 - 4254
[22] HYBRID APPROACH FOR MULTICHANNEL SOURCE SEPARATION COMBINING TIME-FREQUENCY MASK WITH MULTI-CHANNEL WIENER FILTER
Araki, Shoko
Nakatani, Tomohiro
2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 225 - 228
[23] SIMULTANEOUS OPTIMIZATION OF FORGETTING FACTOR AND TIME-FREQUENCY MASK FOR BLOCK ONLINE MULTI-CHANNEL SPEECH ENHANCEMENT
Togami, Masahito
2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 2702 - 2706
[24] Time-Frequency Masking Based Online Multi-Channel Speech Enhancement With Convolutional Recurrent Neural Networks
Chakrabarty, Soumitro
Habets, Emanuel A. P.
IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2019, 13 (04) : 787 - 799
[25] Adaptive time-frequency data fusion for speech enhancement
Shi, G
Aarabi, P
Lazic, N
FUSION 2003: PROCEEDINGS OF THE SIXTH INTERNATIONAL CONFERENCE OF INFORMATION FUSION, VOLS 1 AND 2, 2003, : 394 - 399
[26] Multi-Channel Fusion Attacks
Yang, Wei
Zhou, Yongbin
Cao, Yuchen
Zhang, Hailong
Zhang, Qian
Wang, Huan
IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2017, 12 (08) : 1757 - 1771
[27] Modeling and analysis of fatigue detection with multi-channel data fusion
Wenbo Huang
Changyuan Wang
Hong-bo Jia
Pengxiang Xue
Li Wang
The International Journal of Advanced Manufacturing Technology, 2022, 122 : 291 - 301
[28] Modeling and analysis of fatigue detection with multi-channel data fusion
Huang, Wenbo
Wang, Changyuan
Jia, Hong-bo
Xue, Pengxiang
Wang, Li
INTERNATIONAL JOURNAL OF ADVANCED MANUFACTURING TECHNOLOGY, 2022, 122 (01): : 291 - 301
[29] Color image superresolution using multi-channel data fusion
Zhao, SB
Han, H
Peng, SL
THIRD INTERNATIONAL SYMPOSIUM ON MULTISPECTRAL IMAGE PROCESSING AND PATTERN RECOGNITION, PTS 1 AND 2, 2003, 5286 : 39 - 44
[30] A Unified Bayesian Model of Time-frequency Clustering and Low-rank Approximation for Multi-channel Source Separation
Itakura, Kousuke
Bando, Yoshiaki
Nakamura, Eita
Itoyama, Katsutoshi
Yoshii, Kazuyoshi
2016 24TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2016, : 2280 - 2284

← 1 2 3 4 5 →