Speech enhancement using modified IMCRA and OMLSA methods

被引:0
|
作者
Tien Dung Tran [1 ]
Quoc Cuong Nguyen [1 ]
Dang Khoa Nguyen [1 ]
机构
[1] Hanoi Univ Technol, Int Res Ctr MICA, Hanoi, Vietnam
关键词
speech enhancement; Mean-Square Error Log-Spectral Amplitude; Improved Minimal Controlled Recursive Averaging; SPECTRAL AMPLITUDE ESTIMATOR;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this paper we present a speech enhancement method in highly non-stationary noise environments based on modified Improved Minimal Controlled Recursive Averaging (IMCRA) method and Optimal Modified Minimum Mean-Square Error Log-Spectral Amplitude (OMLSA) method. The original OMLSA method, the spectral gain function, which minimizes the mean-square error of the log-spectral amplitude, is obtained as a weighted geometric mean of the hypothetical gain associated with the presence uncertainty. Whereas in IMCRA method, noise estimation is given by averaging past spectral value of noisy speech using a smoothing parameter that is adjusted by speech presence probability in frequency domain. A new method is proposed, in which the minimum spectral power value of noisy speech is adjusted by past speech presence probability. In addition, a noise estimation algorithm is proposed for highly non-stationary noise environment. The noise estimate is updated by averaging the noise spectral power estimate of IMCRA method with the past noise spectral power. Evaluations under various environment conditions, especially highly non-stationary noise environment, confirm that the modification of IMCRA and OMLSA method improved the speech quality.
引用
收藏
页码:195 / 200
页数:6
相关论文
共 50 条
  • [11] Speech Enhancement using Spatial Processing and Modified Excitation Source for Underwater Speech Communication
    Ju, Hyung-jun
    Kim, Se-young
    Han, Jung-woo
    Kim, Ki-man
    Kang, Seok-yeb
    EUC 2008: PROCEEDINGS OF THE 5TH INTERNATIONAL CONFERENCE ON EMBEDDED AND UBIQUITOUS COMPUTING, VOL 2, WORKSHOPS, 2008, : 647 - 650
  • [12] Enhancement methods for reverberant speech
    Cole, D
    Moody, M
    Sridharan, S
    ISSPA 96 - FOURTH INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND ITS APPLICATIONS, PROCEEDINGS, VOLS 1 AND 2, 1996, : 383 - 386
  • [13] Classical Tamil Speech Enhancement with Modified Threshold Function using Wavelets
    Indra, J.
    Kasthuri, N.
    Krishnan, Navaneetha S.
    JOURNAL OF ELECTRICAL ENGINEERING & TECHNOLOGY, 2016, 11 (06) : 1793 - 1801
  • [14] A modified A priori SNR for speech enhancement using spectral subtraction rules
    Hasan, MK
    Salahuddin, S
    Khan, MR
    IEEE SIGNAL PROCESSING LETTERS, 2004, 11 (04) : 450 - 453
  • [15] Multi-sensor speech enhancement using modified LP residual for underwater speech communication
    Ju, Hyung-jun
    Park, Chan-sub
    Bae, Jong-tae
    Choi, Seok-soon
    Kim, Ki-man
    2007 OCEANS, VOLS 1-5, 2007, : 1871 - 1874
  • [16] A Modified Oesophageal Speech Enhancement Using Ephraim-Malah Filter For Robust Speech Recognition
    Babu, C. Ganesh
    Vanathi, P. T.
    Dcruz, Jibby Peter
    RECENT ADVANCES IN NETWORKING, VLSI AND SIGNAL PROCESSING, 2010, : 129 - +
  • [17] Speech Enhancement Using Modified MMSE-LSA and Phase Reconstruction in Voiced and Unvoiced Speech
    Jia, Hairong
    Wang, Weimei
    Wang, Dong
    Zhang, Xueying
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2019, 33 (02)
  • [18] Joint Speech Enhancement and Speaker Identification Using Monte Carlo Methods
    Maina, Ciira Wa
    Walsh, John MacLaren
    INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 1359 - 1362
  • [19] ENVELOPE EXPANSION METHODS FOR SPEECH ENHANCEMENT
    CLARKSON, PM
    BAHGAT, SF
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1991, 89 (03): : 1378 - 1382
  • [20] Comparative Analysis of Speech Enhancement Methods
    Goel, Pankaj
    Saxena, Prateek
    Chandra, Mahesh
    Gupta, V. K.
    2013 TENTH INTERNATIONAL CONFERENCE ON WIRELESS AND OPTICAL COMMUNICATIONS NETWORKS (WOCN), 2013,