A MAXIMUM LIKELIHOOD APPROACH TO MULTI-OBJECTIVE LEARNING USING GENERALIZED GAUSSIAN DISTRIBUTIONS FOR DNN-BASED SPEECH ENHANCEMENT

被引:0
|
作者
Niu, Shu-Tong [1 ]
Du, Jun [1 ]
Chai, Li [1 ]
Lee, Chin-Hui [2 ]
机构
[1] Univ Sci & Technol China, Hefei, Anhui, Peoples R China
[2] Georgia Inst Technol, Atlanta, GA 30332 USA
基金
中国国家自然科学基金; 国家重点研发计划;
关键词
multi-objective learning; maximum likelihood; deep neural network; shape factors update; generalized Gaussian distribution; CONVOLUTIONAL NEURAL-NETWORK; FEATURES;
D O I
10.1109/icassp40776.2020.9053995
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
The multi-objective learning using minimum mean squared error criterion for DNN-based speech enhancement (MMSE-MOL-DNN) has been demonstrated to achieve better performance than single output DNN. However, one problem of MMSE-MOL-DNN is that the prediction error values on different targets have a very broad dynamic range, causing difficulty in DNN training. In this paper, we extend the maximum likelihood approach proposed in our previous work [1] to the multi-objective learning for DNN-based speech enhancement (ML-MOL-DNN) to achieve the automatic adjustment of the dynamic range of prediction error values on different targets. The conditional likelihood function to be maximized is derived under the generalized Gaussian distribution (GGD) error model. Moreover, the control of the dynamic range of the prediction error values on different targets is achieved by the scale factors in GGD. Furthermore, we propose a method to update the shape factors automatically utilizing the one-to-one mapping between the kurtosis and shape factor in GGD instead of manual adjustment. The experimental results show that our ML-MOL-DNN can achieve better performance than MMSE-MOL-DNN in terms of different objective measures.
引用
收藏
页码:6229 / 6233
页数:5
相关论文
共 50 条
  • [21] Multi-objective based multi-channel speech enhancement with BiLSTM network
    Cui, Xingyue
    Chen, Zhe
    Yin, Fuliang
    APPLIED ACOUSTICS, 2021, 177
  • [22] Speech Enhancement Using Joint DNN-NMF Model Learned with Multi-Objective Frequency Differential Spectrum Loss Function
    Pashaian, Matin
    Seyedin, Sanaz
    IET SIGNAL PROCESSING, 2024, 2024
  • [23] A Multi-Task Scheme for Supervised DNN-Based Single-Channel Speech Enhancement by Using Speech Presence Probability as the Secondary Training Target
    Wang, Lei
    Zhu, Jie
    Sun, Kangbo
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2021, E104D (11) : 1963 - 1970
  • [24] Multi-objective Noisy based Deep Feature Loss for Speech Enhancement
    Pilarczyk, Rafa L.
    Skarbek, Wladyslaw
    PHOTONICS APPLICATIONS IN ASTRONOMY, COMMUNICATIONS, INDUSTRY, AND HIGH-ENERGY PHYSICS EXPERIMENTS 2019, 2019, 11176
  • [25] Multi-objective Approach to Speech Enhancement Using Tunable Q-Factor-based Wavelet Transform and ANN Techniques
    Tusar Kanti Dash
    Sandeep Singh Solanki
    Ganapati Panda
    Circuits, Systems, and Signal Processing, 2021, 40 : 6067 - 6097
  • [26] Multi-objective Approach to Speech Enhancement Using Tunable Q-Factor-based Wavelet Transform and ANN Techniques
    Dash, Tusar Kanti
    Solanki, Sandeep Singh
    Panda, Ganapati
    CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2021, 40 (12) : 6067 - 6097
  • [27] Generative approach using the noise generation models for DNN-based speech synthesis trained from noisy speech
    Une, Masakazu
    Saito, Yuki
    Takamichi, Shinnosuke
    Kitamura, Daichi
    Miyazaki, Ryoichi
    Saruwatari, Hiroshi
    2018 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2018, : 340 - 344
  • [28] Multi-objective learning based speech enhancement method to increase speech quality and intelligibility for hearing aid device users
    Lai, Ying-Hui
    Zheng, Wei-Zhong
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2019, 48 : 35 - 45
  • [29] Multi-objective constrained differential evolution using generalized opposition-based learning
    Wei W.
    Wang J.
    Tao M.
    Yuan H.
    1600, Science Press (53): : 1410 - 1421
  • [30] A multi-objective robust optimization approach based on Gaussian process model
    Zhou, Qi
    Jiang, Ping
    Huang, Xiang
    Zhang, Feng
    Zhou, Taotao
    STRUCTURAL AND MULTIDISCIPLINARY OPTIMIZATION, 2018, 57 (01) : 213 - 233