A MAXIMUM LIKELIHOOD APPROACH TO MULTI-OBJECTIVE LEARNING USING GENERALIZED GAUSSIAN DISTRIBUTIONS FOR DNN-BASED SPEECH ENHANCEMENT

被引：0

作者：

Niu, Shu-Tong ^{[1
]}

Du, Jun ^{[1
]}

Chai, Li ^{[1
]}

Lee, Chin-Hui ^{[2
]}

机构：

[1] Univ Sci & Technol China, Hefei, Anhui, Peoples R China

[2] Georgia Inst Technol, Atlanta, GA 30332 USA

来源：

2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING | 2020年

基金：

中国国家自然科学基金; 国家重点研发计划;

关键词：

multi-objective learning; maximum likelihood; deep neural network; shape factors update; generalized Gaussian distribution; CONVOLUTIONAL NEURAL-NETWORK; FEATURES;

D O I：

10.1109/icassp40776.2020.9053995

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

The multi-objective learning using minimum mean squared error criterion for DNN-based speech enhancement (MMSE-MOL-DNN) has been demonstrated to achieve better performance than single output DNN. However, one problem of MMSE-MOL-DNN is that the prediction error values on different targets have a very broad dynamic range, causing difficulty in DNN training. In this paper, we extend the maximum likelihood approach proposed in our previous work [1] to the multi-objective learning for DNN-based speech enhancement (ML-MOL-DNN) to achieve the automatic adjustment of the dynamic range of prediction error values on different targets. The conditional likelihood function to be maximized is derived under the generalized Gaussian distribution (GGD) error model. Moreover, the control of the dynamic range of the prediction error values on different targets is achieved by the scale factors in GGD. Furthermore, we propose a method to update the shape factors automatically utilizing the one-to-one mapping between the kurtosis and shape factor in GGD instead of manual adjustment. The experimental results show that our ML-MOL-DNN can achieve better performance than MMSE-MOL-DNN in terms of different objective measures.

引用

页码：6229 / 6233

页数：5

共 50 条

[31] A multi-objective robust optimization approach based on Gaussian process model
Qi Zhou
Ping Jiang
Xiang Huang
Feng Zhang
Taotao Zhou
Structural and Multidisciplinary Optimization, 2018, 57 : 213 - 233
[32] On the Role of Spatial, Spectral, and Temporal Processing for DNN-based Non-linear Multi-channel Speech Enhancement
Tesch, Kristina
Mohrmann, Nils-Hendrik
Gerkmann, Timo
INTERSPEECH 2022, 2022, : 2908 - 2912
[33] Robust Speech Recognition based on Multi-Objective Learning with GRU Network
Liu, Ming
Wang, Yujun
Yan, Zhaoyu
Wang, Jing
Xie, Xiang
2019 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2019, : 181 - 185
[34] Multi-objective Learning and Mask-based Post-processing for Deep Neural Network based Speech Enhancement
Xu, Yong
Du, Jun
Huang, Zhen
Dai, Li-Rong
Lee, Chin-Hui
16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 1508 - 1512
[35] DNN-BASED SOURCE ENHANCEMENT SELF-OPTIMIZED BY REINFORCEMENT LEARNING USING SOUND QUALITY MEASUREMENTS
Koizumi, Yuma
Niwa, Kenta
Hioka, Yusuke
Kobayashi, Kazunori
Haneda, Yoichi
2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 81 - 85
[36] ROSE: A Recognition-Oriented Speech Enhancement Framework in Air Traffic Control Using Multi-Objective Learning
Yu, Xincheng
Guo, Dongyue
Zhang, Jianwei
Lin, Yi
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2024, 32 : 3365 - 3378
[37] A Multi-objective Generalized Teacher-Learning-Based-Optimization Algorithm
Ram S.D.K.
Srivastava S.
Mishra K.K.
Journal of The Institution of Engineers (India): Series B, 2022, 103 (5) : 1415 - 1430
[38] A Multi-objective Generalized Teacher-Learning-Based-Optimization Algorithm
Ram, Satya Deo Kumar
Srivastava, Shashank
Mishra, K.K.
Journal of The Institution of Engineers (India): Series B, 2022, 103 (05) : 1415 - 1430
[39] Water cycle algorithm based multi-objective contrast enhancement approach
Kaushal, Manisha
Khehra, Baljit Singh
Sharma, Akashdeep
OPTIK, 2017, 140 : 762 - 775
[40] An example-based learning approach to multi-objective programming
Amano, M
Okano, H
MULTI-OBJECTIVE PROGRAMMING AND GOAL PROGRAMMING, 2003, : 87 - 92

← 1 2 3 4 5 →