DNN-based phase estimation for online speech enhancement

被引:0
|
作者
Nguyen, Binh Thien [1 ]
Wakabayashi, Yukoh [2 ]
Geng, Yuting [1 ]
Iwai, Kenta [1 ]
Nishiura, Takanobu [1 ]
机构
[1] Ritsumeikan Univ, 1-1-1 Noji Higashi, Kusatsu 5258577, Japan
[2] Toyohashi Univ Technol, 1-1 Hibarigaoka,Tempaku Ku, Toyohashi 4418580, Japan
关键词
Speech enhancement; Phase estimation; DNN; Convolutional recurrent network; SIGNAL ESTIMATION;
D O I
10.1250/ast.e24.102
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper presents a DNN-based phase reconstruction algorithm for online speech enhancement. Although various online phase reconstruction algorithms have been proposed, many of them rely on the structure of the clean amplitude. This restricts their performance in speech enhancement applications, where only noisy observations are available. In contrast, our proposed method directly estimates the clean phase from the noisy observation. Several aspects of phase reconstruction and their effects on speech enhancement are also investigated and discussed. Experimental results confirm that our method performs better than conventional online phase reconstruction methods for speech enhancement in all experimental settings.
引用
收藏
页码:186 / 190
页数:5
相关论文
共 50 条
  • [1] Online Multichannel Speech Enhancement Based on Recursive EM and DNN-Based Speech Presence Estimation
    Martin-Donas, Juan Manuel
    Jensen, Jesper
    Tan, Zheng-Hua
    Gomez, Angel M.
    Peinado, Antonio M.
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2020, 28 : 3080 - 3094
  • [2] DNN-BASED ENHANCEMENT OF NOISY AND REVERBERANT SPEECH
    Zhao, Yan
    Wang, DeLiang
    Merks, Ivo
    Zhang, Tao
    2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 6525 - 6529
  • [3] Online Phase Reconstruction via DNN-based Phase Differences Estimation
    Masuyama, Yoshiki
    Yatabe, Kohei
    Nagatomo, Kento
    Oikawa, Yasuhiro
    arXiv, 2022,
  • [4] Online Phase Reconstruction via DNN-Based Phase Differences Estimation
    Masuyama, Yoshiki
    Yatabe, Kohei
    Nagatomo, Kento
    Oikawa, Yasuhiro
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2023, 31 : 163 - 176
  • [5] DNN-BASED DISTRIBUTED MULTICHANNEL MASK ESTIMATION FOR SPEECH ENHANCEMENT IN MICROPHONE ARRAYS
    Furnon, Nicolas
    Serizel, Romain
    Illina, Irina
    Essid, Slim
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 4672 - 4676
  • [6] DNN-BASED SPEECH ENHANCEMENT USING MBE MODEL
    Huang, Qizheng
    Bao, Changchun
    Wang, Xianyun
    Xiang, Yang
    2018 16TH INTERNATIONAL WORKSHOP ON ACOUSTIC SIGNAL ENHANCEMENT (IWAENC), 2018, : 196 - 200
  • [7] DNN-Based Cepstral Excitation Manipulation for Speech Enhancement
    Elshamy, Samy
    Fingscheidt, Tim
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2019, 27 (11) : 1803 - 1814
  • [8] DNN-Based Mask Estimation for Distributed Speech Enhancement in Spatially Unconstrained Microphone Arrays
    Furnon, Nicolas
    Serizel, Romain
    Essid, Slim
    Illina, Irina
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 29 : 2310 - 2323
  • [9] DNN-BASED SPEECH MASK ESTIMATION FOR EIGENVECTOR BEAMFORMING
    Pfeifenberger, Lukas
    Zoehrer, Matthias
    Pernkopf, Franz
    2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 66 - 70
  • [10] DNN-Based Linear Prediction Residual Enhancement for Speech Dereverberation
    Feng, Xinyang
    Li, Nuo
    He, Zunwen
    Zhang, Yan
    Zhang, Wancheng
    2021 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2021, : 541 - 545