GLMSNET: SINGLE CHANNEL SPEECH SEPARATION FRAMEWORK IN NOISY AND REVERBERANT ENVIRONMENTS

被引:1
|
作者
Shi, Huiyu [1 ]
Chen, Xi [2 ]
Kong, Tianlong [1 ]
Yin, Shouyi [1 ]
Ouyang, Peng [2 ]
机构
[1] Tsinghua Univ, Beijing, Peoples R China
[2] AI Lab, Lenovo Res, Beijing, Peoples R China
关键词
Speech separation; speech enhancement; cock-tail party problem; reverberation;
D O I
10.1109/ASRU51503.2021.9688217
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In real noisy and reverberant environments, the performance of current single channel speech separation algorithms decreases significantly. Given this situation, this paper proposes a novel speech separation framework, called Graph convolution and Leading global Multi-scale separation network (GLMSnet). The graph convolution network (GCN) is introduced on high-level features for modeling global context and incorporating long-range information, and it can be arbitrarily inserted into the desired position. Furthermore, Global multi-scale convolution is proposed to aggregate different levels features and improve the audio quality of separation. The leading factor is applied to increase valid information of target speech. We evaluate our method on WHAMR! Database. The results show that our proposed method can obtain state-of-the-art speech separation effect in the presence of noise and reverberation. Compared with the most advanced model before, the performance is improved by 22.7%.
引用
收藏
页码:663 / 670
页数:8
相关论文
共 50 条
  • [41] Deep Learning Based Multi-Channel Speaker Recognition in Noisy and Reverberant Environments
    Taherian, Hassan
    Wang, Zhong-Qiu
    Wane, DeLiang
    INTERSPEECH 2019, 2019, : 4070 - 4074
  • [42] A MULTIPITCH TRACKING ALGORITHM FOR NOISY AND REVERBERANT SPEECH
    Jin, Zhaozhang
    Wang, DeLiang
    2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 4218 - 4221
  • [43] SINGLE-CHANNEL SPEAKER DISTANCE ESTIMATION IN REVERBERANT ENVIRONMENTS
    Neri, Michael
    Politis, Archontis
    Krause, Daniel
    Carli, Marco
    Virtanen, Tuomas
    2023 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS, WASPAA, 2023,
  • [44] Speech detection and enhancement using single microphone for distant speech applications in reverberant environments
    Kothapally, Vinay
    Hansen, John H. L.
    18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 1948 - 1952
  • [45] A DUET-Based Method for Blind Separation of Speech Signals in Reverberant Environments
    Kim, Minook
    Lee, Tae-Jun
    Park, Hyung-Min
    IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2015, E98A (11) : 2325 - 2329
  • [46] Speech recognition based on HMM decomposition and composition method with a microphone array in noisy reverberant environments
    Miki, K
    Nishiura, T
    Nakamura, S
    Shikano, K
    ELECTRONICS AND COMMUNICATIONS IN JAPAN PART II-ELECTRONICS, 2002, 85 (09): : 13 - 22
  • [47] Speech improvement in noisy reverberant environments using virtual microphones along with proposed array geometry
    Sadeghi, Mohammad Ebrahim
    Sheikhzadeh, Hamid
    Emadi, Mohammad Javad
    EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2022, 2022 (01)
  • [48] Separation of Multiple Speech Sources in Reverberant Environments Based on Sparse Component Enhancement
    Li, Lu
    Jia, Maoshen
    Liu, Jinxiang
    Pai, Tun-Wen
    CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2023, 42 (10) : 6001 - 6028
  • [49] Speech improvement in noisy reverberant environments using virtual microphones along with proposed array geometry
    Mohammad Ebrahim Sadeghi
    Hamid Sheikhzadeh
    Mohammad Javad Emadi
    EURASIP Journal on Advances in Signal Processing, 2022
  • [50] Separation of Multiple Speech Sources in Reverberant Environments Based on Sparse Component Enhancement
    Lu Li
    Maoshen Jia
    Jinxiang Liu
    Tun-Wen Pai
    Circuits, Systems, and Signal Processing, 2023, 42 : 6001 - 6028