DNN-based phase estimation for online speech enhancement

被引：0

作者：

Nguyen, Binh Thien ^{[1
]}

Wakabayashi, Yukoh ^{[2
]}

Geng, Yuting ^{[1
]}

Iwai, Kenta ^{[1
]}

Nishiura, Takanobu ^{[1
]}

机构：

[1] Ritsumeikan Univ, 1-1-1 Noji Higashi, Kusatsu 5258577, Japan

[2] Toyohashi Univ Technol, 1-1 Hibarigaoka,Tempaku Ku, Toyohashi 4418580, Japan

来源：

ACOUSTICAL SCIENCE AND TECHNOLOGY | 2025年 / 46卷 / 02期

关键词：

Speech enhancement; Phase estimation; DNN; Convolutional recurrent network; SIGNAL ESTIMATION;

D O I：

10.1250/ast.e24.102

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

This paper presents a DNN-based phase reconstruction algorithm for online speech enhancement. Although various online phase reconstruction algorithms have been proposed, many of them rely on the structure of the clean amplitude. This restricts their performance in speech enhancement applications, where only noisy observations are available. In contrast, our proposed method directly estimates the clean phase from the noisy observation. Several aspects of phase reconstruction and their effects on speech enhancement are also investigated and discussed. Experimental results confirm that our method performs better than conventional online phase reconstruction methods for speech enhancement in all experimental settings.

引用

页码：186 / 190

页数：5

共 50 条

[21] SYNTHETIC DATA FOR DNN-BASED DOA ESTIMATION OF INDOOR SPEECH
Gelderblom, Femke B.
Liu, Yi
Kvam, Johannes
Myrvoll, Tor Andre
2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 4390 - 4394
[22] DNN-based monaural speech enhancement with temporal and spectral variations equalization
Kang, Tae Gyoon
Shin, Jong Won
Kim, Nam Soo
DIGITAL SIGNAL PROCESSING, 2018, 74 : 102 - 110
[23] DNN-based speech enhancement with self-attention on feature dimension
Cheng, Jiaming
Liang, Ruiyu
Zhao, Li
MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (43-44) : 32449 - 32470
[24] DNN-based speech enhancement with self-attention on feature dimension
Jiaming Cheng
Ruiyu Liang
Li Zhao
Multimedia Tools and Applications, 2020, 79 : 32449 - 32470
[25] An Adaptation Method in Noise Mismatch Conditions for DNN-based Speech Enhancement
Xu Si-Ying
Niu Tong
Qu Dan
Long Xing-Yan
KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2018, 12 (10): : 4930 - 4951
[26] DNN-Based Arabic Speech Synthesis
Amrouche, Aissa
Bentrcia, Youssouf
Boubakeur, Khadidja Nesrine
Abed, Ahcene
2022 9TH INTERNATIONAL CONFERENCE ON ELECTRICAL AND ELECTRONICS ENGINEERING (ICEEE 2022), 2022, : 378 - 382
[27] DNN-Based Feature Extraction for Conflict Intensity Estimation From Speech
Gosztolya, Gabor
Toth, Laszlo
IEEE SIGNAL PROCESSING LETTERS, 2017, 24 (12) : 1837 - 1841
[28] A NEW COST FUNCTION FOR DNN-BASED SPEECH ENHANCEMENT COMBINING NMF AND CASA
Yan, Bofang
Bao, Changchun
Bai, Zhigang
PROCEEDINGS OF 2018 14TH IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP), 2018, : 255 - 259
[29] Power Exponent Based Weighting Criterion for DNN-Based Mask Approximation in Speech Enhancement
Cui, Zihao
Bao, Changchun
IEEE SIGNAL PROCESSING LETTERS, 2021, 28 : 618 - 622
[30] INVERTIBLE DNN-BASED NONLINEAR TIME-FREQUENCY TRANSFORM FOR SPEECH ENHANCEMENT
Lakeuchi, Daiki
Yatabe, Kohei
Koizumi, Yuma
Oikawa, Yasuhiro
Harada, Noboru
2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 6644 - 6648

← 1 2 3 4 5 →