DNN-based phase estimation for online speech enhancement

被引:0
|
作者
Nguyen, Binh Thien [1 ]
Wakabayashi, Yukoh [2 ]
Geng, Yuting [1 ]
Iwai, Kenta [1 ]
Nishiura, Takanobu [1 ]
机构
[1] Ritsumeikan Univ, 1-1-1 Noji Higashi, Kusatsu 5258577, Japan
[2] Toyohashi Univ Technol, 1-1 Hibarigaoka,Tempaku Ku, Toyohashi 4418580, Japan
关键词
Speech enhancement; Phase estimation; DNN; Convolutional recurrent network; SIGNAL ESTIMATION;
D O I
10.1250/ast.e24.102
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper presents a DNN-based phase reconstruction algorithm for online speech enhancement. Although various online phase reconstruction algorithms have been proposed, many of them rely on the structure of the clean amplitude. This restricts their performance in speech enhancement applications, where only noisy observations are available. In contrast, our proposed method directly estimates the clean phase from the noisy observation. Several aspects of phase reconstruction and their effects on speech enhancement are also investigated and discussed. Experimental results confirm that our method performs better than conventional online phase reconstruction methods for speech enhancement in all experimental settings.
引用
收藏
页码:186 / 190
页数:5
相关论文
共 50 条
  • [21] SYNTHETIC DATA FOR DNN-BASED DOA ESTIMATION OF INDOOR SPEECH
    Gelderblom, Femke B.
    Liu, Yi
    Kvam, Johannes
    Myrvoll, Tor Andre
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 4390 - 4394
  • [22] DNN-based monaural speech enhancement with temporal and spectral variations equalization
    Kang, Tae Gyoon
    Shin, Jong Won
    Kim, Nam Soo
    DIGITAL SIGNAL PROCESSING, 2018, 74 : 102 - 110
  • [23] DNN-based speech enhancement with self-attention on feature dimension
    Cheng, Jiaming
    Liang, Ruiyu
    Zhao, Li
    MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (43-44) : 32449 - 32470
  • [24] DNN-based speech enhancement with self-attention on feature dimension
    Jiaming Cheng
    Ruiyu Liang
    Li Zhao
    Multimedia Tools and Applications, 2020, 79 : 32449 - 32470
  • [25] An Adaptation Method in Noise Mismatch Conditions for DNN-based Speech Enhancement
    Xu Si-Ying
    Niu Tong
    Qu Dan
    Long Xing-Yan
    KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2018, 12 (10): : 4930 - 4951
  • [26] DNN-Based Arabic Speech Synthesis
    Amrouche, Aissa
    Bentrcia, Youssouf
    Boubakeur, Khadidja Nesrine
    Abed, Ahcene
    2022 9TH INTERNATIONAL CONFERENCE ON ELECTRICAL AND ELECTRONICS ENGINEERING (ICEEE 2022), 2022, : 378 - 382
  • [27] DNN-Based Feature Extraction for Conflict Intensity Estimation From Speech
    Gosztolya, Gabor
    Toth, Laszlo
    IEEE SIGNAL PROCESSING LETTERS, 2017, 24 (12) : 1837 - 1841
  • [28] A NEW COST FUNCTION FOR DNN-BASED SPEECH ENHANCEMENT COMBINING NMF AND CASA
    Yan, Bofang
    Bao, Changchun
    Bai, Zhigang
    PROCEEDINGS OF 2018 14TH IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP), 2018, : 255 - 259
  • [29] Power Exponent Based Weighting Criterion for DNN-Based Mask Approximation in Speech Enhancement
    Cui, Zihao
    Bao, Changchun
    IEEE SIGNAL PROCESSING LETTERS, 2021, 28 : 618 - 622
  • [30] INVERTIBLE DNN-BASED NONLINEAR TIME-FREQUENCY TRANSFORM FOR SPEECH ENHANCEMENT
    Lakeuchi, Daiki
    Yatabe, Kohei
    Koizumi, Yuma
    Oikawa, Yasuhiro
    Harada, Noboru
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 6644 - 6648