Monaural Source Separation in Complex Domain With Long Short-Term Memory Neural Network

被引:29
|
作者
Sun, Yang [1 ]
Xian, Yang [1 ]
Wang, Wenwu [2 ]
Naqvi, Syed Mohsen [1 ]
机构
[1] Newcastle Univ, Sch Engn, Intelligent Sensing & Commun Res Grp, Newcastle Upon Tyne NE1 7RU, Tyne & Wear, England
[2] Univ Surrey, Ctr Vis Speech & Signal Proc, Dept Elect & Elect Engn, Surrey GU2 7XH, England
关键词
Deep neural networks; monaural speech separation; long short-term memory; complex signal approximation; SPEECH DEREVERBERATION; MASKING; RECOGNITION; FEATURES; NOISE;
D O I
10.1109/JSTSP.2019.2908760
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In recent research, deep neural network (DNN) has been used to solve the monaural source separation problem. According to the training objectives, DNN-based monaural speech separation is categorized into three aspects, namely masking, mapping, and signal approximation based techniques. However, the performance of the traditional methods is not robust due to variations in real-world environments. Besides, in the vanilla DNN-based methods, the temporal information cannot be fully utilized. Therefore, in this paper, the long short-term memory (LSTM) neural network is applied to exploit the long-term speech contexts. Then, we propose the complex signal approximation (cSA), which is operated in the complex domain to utilize the phase information of the desired speech signal to improve the separation performance. The IEEE and the TIMIT corpora are used to generate mixtures with noise and speech interferences to evaluate the efficacy of the proposed method. The experimental results demonstrate the advantages of the proposed cSA-based LSTM recurrent neural network method in terms of different objective performance measures.
引用
收藏
页码:359 / 369
页数:11
相关论文
共 50 条
  • [41] Long Short-Term Memory Recurrent Neural Network for Automatic Speech Recognition
    Oruh, Jane
    Viriri, Serestina
    Adegun, Adekanmi
    IEEE ACCESS, 2022, 10 : 30069 - 30079
  • [42] Evolving long short-term memory neural network for wind speed forecasting
    Huang, Cong
    Karimi, Hamid Reza
    Mei, Peng
    Yang, Daoguang
    Shi, Quan
    INFORMATION SCIENCES, 2023, 632 : 390 - 410
  • [43] Driving Intention Identification Based on Long Short-Term Memory Neural Network
    Liu, Yonggang
    Zhao, Pan
    Qin, Datong
    Yang, Yang
    Chen, Zheng
    2019 IEEE VEHICLE POWER AND PROPULSION CONFERENCE (VPPC), 2019,
  • [44] Urban Sound Classification using Long Short-Term Memory Neural Network
    Lezhenin, Iurii
    Bogach, Natalia
    Pyshkin, Evgeny
    PROCEEDINGS OF THE 2019 FEDERATED CONFERENCE ON COMPUTER SCIENCE AND INFORMATION SYSTEMS (FEDCSIS), 2019, : 57 - 60
  • [45] A Convolutional Long Short-Term Memory Neural Network Based Prediction Model
    Tian, Y. H.
    Wu, Q.
    Zhang, Y.
    INTERNATIONAL JOURNAL OF COMPUTERS COMMUNICATIONS & CONTROL, 2020, 15 (05) : 1 - 12
  • [46] Question Similarity Modeling with Bidirectional Long Short-Term Memory Neural Network
    An, Chao
    Huang, Jiuming
    Chang, Shoufeng
    Huang, Zhijie
    2016 IEEE FIRST INTERNATIONAL CONFERENCE ON DATA SCIENCE IN CYBERSPACE (DSC 2016), 2016, : 318 - 322
  • [47] Short-Term Load Forecasting using A Long Short-Term Memory Network
    Liu, Chang
    Jin, Zhijian
    Gu, Jie
    Qiu, Caiming
    2017 IEEE PES INNOVATIVE SMART GRID TECHNOLOGIES CONFERENCE EUROPE (ISGT-EUROPE), 2017,
  • [48] A Graph Fourier Transform Based Bidirectional Long Short-Term Memory Neural Network for Electrophysiological Source Imaging
    Jiao, Meng
    Wan, Guihong
    Guo, Yaxin
    Wang, Dongqing
    Liu, Hang
    Xiang, Jing
    Liu, Feng
    FRONTIERS IN NEUROSCIENCE, 2022, 16
  • [49] FULLY COMPLEX DEEP NEURAL NETWORK FOR PHASE-INCORPORATING MONAURAL SOURCE SEPARATION
    Lee, Yuan-Shan
    Wang, Chien-Yao
    Wang, Shu-Fan
    Wang, Jia-Ching
    Wu, Chung-Hsien
    2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 281 - 285
  • [50] Short-term Demand Forecasting of Shared Bicycles Based on Long Short-term Memory Neural Network and Climate Characteristics
    Xu, Yuan
    Wang, Xin
    2021 INTERNATIONAL CONFERENCE ON NEURAL NETWORKS, INFORMATION AND COMMUNICATION ENGINEERING, 2021, 11933