A MAXIMUM LIKELIHOOD APPROACH TO MULTI-OBJECTIVE LEARNING USING GENERALIZED GAUSSIAN DISTRIBUTIONS FOR DNN-BASED SPEECH ENHANCEMENT

被引:0
|
作者
Niu, Shu-Tong [1 ]
Du, Jun [1 ]
Chai, Li [1 ]
Lee, Chin-Hui [2 ]
机构
[1] Univ Sci & Technol China, Hefei, Anhui, Peoples R China
[2] Georgia Inst Technol, Atlanta, GA 30332 USA
基金
中国国家自然科学基金; 国家重点研发计划;
关键词
multi-objective learning; maximum likelihood; deep neural network; shape factors update; generalized Gaussian distribution; CONVOLUTIONAL NEURAL-NETWORK; FEATURES;
D O I
10.1109/icassp40776.2020.9053995
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
The multi-objective learning using minimum mean squared error criterion for DNN-based speech enhancement (MMSE-MOL-DNN) has been demonstrated to achieve better performance than single output DNN. However, one problem of MMSE-MOL-DNN is that the prediction error values on different targets have a very broad dynamic range, causing difficulty in DNN training. In this paper, we extend the maximum likelihood approach proposed in our previous work [1] to the multi-objective learning for DNN-based speech enhancement (ML-MOL-DNN) to achieve the automatic adjustment of the dynamic range of prediction error values on different targets. The conditional likelihood function to be maximized is derived under the generalized Gaussian distribution (GGD) error model. Moreover, the control of the dynamic range of the prediction error values on different targets is achieved by the scale factors in GGD. Furthermore, we propose a method to update the shape factors automatically utilizing the one-to-one mapping between the kurtosis and shape factor in GGD instead of manual adjustment. The experimental results show that our ML-MOL-DNN can achieve better performance than MMSE-MOL-DNN in terms of different objective measures.
引用
收藏
页码:6229 / 6233
页数:5
相关论文
共 50 条
  • [31] A multi-objective robust optimization approach based on Gaussian process model
    Qi Zhou
    Ping Jiang
    Xiang Huang
    Feng Zhang
    Taotao Zhou
    Structural and Multidisciplinary Optimization, 2018, 57 : 213 - 233
  • [32] On the Role of Spatial, Spectral, and Temporal Processing for DNN-based Non-linear Multi-channel Speech Enhancement
    Tesch, Kristina
    Mohrmann, Nils-Hendrik
    Gerkmann, Timo
    INTERSPEECH 2022, 2022, : 2908 - 2912
  • [33] Robust Speech Recognition based on Multi-Objective Learning with GRU Network
    Liu, Ming
    Wang, Yujun
    Yan, Zhaoyu
    Wang, Jing
    Xie, Xiang
    2019 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2019, : 181 - 185
  • [34] Multi-objective Learning and Mask-based Post-processing for Deep Neural Network based Speech Enhancement
    Xu, Yong
    Du, Jun
    Huang, Zhen
    Dai, Li-Rong
    Lee, Chin-Hui
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 1508 - 1512
  • [35] DNN-BASED SOURCE ENHANCEMENT SELF-OPTIMIZED BY REINFORCEMENT LEARNING USING SOUND QUALITY MEASUREMENTS
    Koizumi, Yuma
    Niwa, Kenta
    Hioka, Yusuke
    Kobayashi, Kazunori
    Haneda, Yoichi
    2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 81 - 85
  • [36] ROSE: A Recognition-Oriented Speech Enhancement Framework in Air Traffic Control Using Multi-Objective Learning
    Yu, Xincheng
    Guo, Dongyue
    Zhang, Jianwei
    Lin, Yi
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2024, 32 : 3365 - 3378
  • [37] A Multi-objective Generalized Teacher-Learning-Based-Optimization Algorithm
    Ram S.D.K.
    Srivastava S.
    Mishra K.K.
    Journal of The Institution of Engineers (India): Series B, 2022, 103 (5) : 1415 - 1430
  • [38] A Multi-objective Generalized Teacher-Learning-Based-Optimization Algorithm
    Ram, Satya Deo Kumar
    Srivastava, Shashank
    Mishra, K.K.
    Journal of The Institution of Engineers (India): Series B, 2022, 103 (05) : 1415 - 1430
  • [39] Water cycle algorithm based multi-objective contrast enhancement approach
    Kaushal, Manisha
    Khehra, Baljit Singh
    Sharma, Akashdeep
    OPTIK, 2017, 140 : 762 - 775
  • [40] An example-based learning approach to multi-objective programming
    Amano, M
    Okano, H
    MULTI-OBJECTIVE PROGRAMMING AND GOAL PROGRAMMING, 2003, : 87 - 92