DNN-based phase estimation for online speech enhancement

被引：0

作者：

Nguyen, Binh Thien ^{[1
]}

Wakabayashi, Yukoh ^{[2
]}

Geng, Yuting ^{[1
]}

Iwai, Kenta ^{[1
]}

Nishiura, Takanobu ^{[1
]}

机构：

[1] Ritsumeikan Univ, 1-1-1 Noji Higashi, Kusatsu 5258577, Japan

[2] Toyohashi Univ Technol, 1-1 Hibarigaoka,Tempaku Ku, Toyohashi 4418580, Japan

来源：

ACOUSTICAL SCIENCE AND TECHNOLOGY | 2025年 / 46卷 / 02期

关键词：

Speech enhancement; Phase estimation; DNN; Convolutional recurrent network; SIGNAL ESTIMATION;

D O I：

10.1250/ast.e24.102

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

This paper presents a DNN-based phase reconstruction algorithm for online speech enhancement. Although various online phase reconstruction algorithms have been proposed, many of them rely on the structure of the clean amplitude. This restricts their performance in speech enhancement applications, where only noisy observations are available. In contrast, our proposed method directly estimates the clean phase from the noisy observation. Several aspects of phase reconstruction and their effects on speech enhancement are also investigated and discussed. Experimental results confirm that our method performs better than conventional online phase reconstruction methods for speech enhancement in all experimental settings.

引用

页码：186 / 190

页数：5

共 50 条

[1] Online Multichannel Speech Enhancement Based on Recursive EM and DNN-Based Speech Presence Estimation
Martin-Donas, Juan Manuel
Jensen, Jesper
Tan, Zheng-Hua
Gomez, Angel M.
Peinado, Antonio M.
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2020, 28 : 3080 - 3094
[2] DNN-BASED ENHANCEMENT OF NOISY AND REVERBERANT SPEECH
Zhao, Yan
Wang, DeLiang
Merks, Ivo
Zhang, Tao
2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 6525 - 6529
[3] Online Phase Reconstruction via DNN-based Phase Differences Estimation
Masuyama, Yoshiki
Yatabe, Kohei
Nagatomo, Kento
Oikawa, Yasuhiro
arXiv, 2022,
[4] Online Phase Reconstruction via DNN-Based Phase Differences Estimation
Masuyama, Yoshiki
Yatabe, Kohei
Nagatomo, Kento
Oikawa, Yasuhiro
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2023, 31 : 163 - 176
[5] DNN-BASED DISTRIBUTED MULTICHANNEL MASK ESTIMATION FOR SPEECH ENHANCEMENT IN MICROPHONE ARRAYS
Furnon, Nicolas
Serizel, Romain
Illina, Irina
Essid, Slim
2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 4672 - 4676
[6] DNN-BASED SPEECH ENHANCEMENT USING MBE MODEL
Huang, Qizheng
Bao, Changchun
Wang, Xianyun
Xiang, Yang
2018 16TH INTERNATIONAL WORKSHOP ON ACOUSTIC SIGNAL ENHANCEMENT (IWAENC), 2018, : 196 - 200
[7] DNN-Based Cepstral Excitation Manipulation for Speech Enhancement
Elshamy, Samy
Fingscheidt, Tim
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2019, 27 (11) : 1803 - 1814
[8] DNN-Based Mask Estimation for Distributed Speech Enhancement in Spatially Unconstrained Microphone Arrays
Furnon, Nicolas
Serizel, Romain
Essid, Slim
Illina, Irina
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 29 : 2310 - 2323
[9] DNN-BASED SPEECH MASK ESTIMATION FOR EIGENVECTOR BEAMFORMING
Pfeifenberger, Lukas
Zoehrer, Matthias
Pernkopf, Franz
2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 66 - 70
[10] DNN-Based Linear Prediction Residual Enhancement for Speech Dereverberation
Feng, Xinyang
Li, Nuo
He, Zunwen
Zhang, Yan
Zhang, Wancheng
2021 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2021, : 541 - 545

← 1 2 3 4 5 →