GLMSNET: SINGLE CHANNEL SPEECH SEPARATION FRAMEWORK IN NOISY AND REVERBERANT ENVIRONMENTS

被引:1
|
作者
Shi, Huiyu [1 ]
Chen, Xi [2 ]
Kong, Tianlong [1 ]
Yin, Shouyi [1 ]
Ouyang, Peng [2 ]
机构
[1] Tsinghua Univ, Beijing, Peoples R China
[2] AI Lab, Lenovo Res, Beijing, Peoples R China
关键词
Speech separation; speech enhancement; cock-tail party problem; reverberation;
D O I
10.1109/ASRU51503.2021.9688217
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In real noisy and reverberant environments, the performance of current single channel speech separation algorithms decreases significantly. Given this situation, this paper proposes a novel speech separation framework, called Graph convolution and Leading global Multi-scale separation network (GLMSnet). The graph convolution network (GCN) is introduced on high-level features for modeling global context and incorporating long-range information, and it can be arbitrarily inserted into the desired position. Furthermore, Global multi-scale convolution is proposed to aggregate different levels features and improve the audio quality of separation. The leading factor is applied to increase valid information of target speech. We evaluate our method on WHAMR! Database. The results show that our proposed method can obtain state-of-the-art speech separation effect in the presence of noise and reverberation. Compared with the most advanced model before, the performance is improved by 22.7%.
引用
收藏
页码:663 / 670
页数:8
相关论文
共 50 条
  • [21] ON END-TO-END MULTI-CHANNEL TIME DOMAIN SPEECH SEPARATION IN REVERBERANT ENVIRONMENTS
    Zhang, Jisi
    Zorila, Catalin
    Doddipatla, Rama
    Barker, Jon
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 6389 - 6393
  • [22] Multi-branch Learning for Noisy and Reverberant Monaural Speech Separation
    Ma, Chao
    Li, Dongmei
    PROCEEDINGS OF 2022 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2022, : 1247 - 1251
  • [23] WHAM!: Extending Speech Separation to Noisy Environments
    Wichern, Gordon
    Antognini, Joe
    Flynn, Michael
    Zhu, Licheng Richard
    McQuinn, Emmett
    Crow, Dwight
    Manilow, Ethan
    Le Roux, Jonathan
    INTERSPEECH 2019, 2019, : 1368 - 1372
  • [24] Blind speech separation of moving spearers in real reverberant environments
    Koutras, A
    Dermatas, E
    Kokkinakis, G
    2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI, 2000, : 1133 - 1136
  • [25] Deep Learning Based Binaural Speech Separation in Reverberant Environments
    Zhang, Xueliang
    Wang, DeLiang
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2017, 25 (05) : 1075 - 1084
  • [26] RECURRENT NEURAL NETWORKS FOR COCHANNEL SPEECH SEPARATION IN REVERBERANT ENVIRONMENTS
    Delfarah, Masood
    Wang, DeLiang
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 5404 - 5408
  • [27] Experimental study of robust acoustic beamforming for speech acquisition in reverberant and noisy environments
    Zhao, Yingke
    Jensen, Jesper Rindom
    Jensen, Tobias Lindstrom
    Chen, Jingdong
    Christensen, Mads Graesboll
    APPLIED ACOUSTICS, 2020, 170
  • [28] Speech separation based on reliable binaural cues with two-stage neural network in noisy-reverberant environments
    Li, Ruwei
    Li, Tao
    Sun, Xiaoyue
    Sun, Xingwu
    Zhao, Fengnian
    APPLIED ACOUSTICS, 2020, 168
  • [29] A Blind Source Separation Based Approach for Speech Enhancement in Noisy and Reverberant Environment
    Pignotti, Alessio
    Marcozzi, Daniele
    Cifani, Simone
    Squartini, Stefano
    Piazza, Francesco
    CROSS-MODAL ANALYSIS OF SPEECH, GESTURES, GAZE AND FACIAL EXPRESSIONS, 2009, 5641 : 356 - 367
  • [30] Convolutive Prediction for Monaural Speech Dereverberation and Noisy-Reverberant Speaker Separation
    Wang, Zhong-Qiu
    Wichern, Gordon
    Le Roux, Jonathan
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 29 : 3476 - 3490