A better decomposition of speech obtained using modified Empirical Mode Decomposition

被引:27
|
作者
Sharma, Rajib [1 ]
Prasanna, S. R. Mahadeva [1 ]
机构
[1] Indian Inst Technol Guwahati, Dept Elect & Elect Engn, Gauhati 781039, India
关键词
EMD; IPs; Mode mixing; Dyadic filterbank; LP; Formants; NOISE; SEPARATION; EXTRACTION; FREQUENCY; DATABASE; DESIGN;
D O I
10.1016/j.dsp.2016.07.012
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The objective of this work is to obtain meaningful time domain components, or Intrinsic Mode Functions (IMFs), of the speech signal, using Empirical Mode Decomposition (EMD), with reduced mode mixing, and in a time-efficient manner. This work focuses on two aspects - firstly, extracting IMFs of the speech signal which can better reflect its higher frequency spectrum; and secondly, to get a better representation and distribution of the vocal tract resonances of the speech signal in its IMFs, compared to that obtained from standard EMD. To this effect, modifications are proposed to the EMD algorithm for processing speech signals, based on the critical nature of the interpolation points (IPs) used for cubic spline interpolation in EMD. The effect of using different sets of IPs, other than the extrema of the residue - as used in standard EMD - is analyzed. It is found that having more IPs is beneficial only upto a certain limit, after which the characteristic dyadic filterbank nature of EMD breaks down. For certain sets of IPs, these modified EMD processes perform better than EMD, giving better frequency separability between the IMFs, and an enhanced representation of the higher frequency content of the signal. A detailed study of the distribution of the formants, in the IMFs of the speech signal, is done using Linear Prediction (LP) analysis of the IMFs. It is found that the IMFs of the EMD variants have a far better distribution of the formants structure within them, with reduced overlapping amongst their filter spectrums, compared to that of standard EMD. Henceforth, when subjected to the task of formants estimation of voiced speech, using LP analysis, the IMFs of the modified EMD processes cumulatively exhibit a superior performance than that of standard EMD, or the speech signal itself, under both dean and noisy conditions. (C) 2016 Elsevier Inc. All rights reserved.
引用
收藏
页码:26 / 39
页数:14
相关论文
共 50 条
  • [1] SPEECH ENHANCEMENT USING ADAPTIVE EMPIRICAL MODE DECOMPOSITION
    Chatlani, Navin
    Soraghan, John J.
    2009 16TH INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING, VOLS 1 AND 2, 2009, : 417 - 422
  • [2] Empirical Mode Decomposition for Speech Enhancement
    Bouchair, Asma
    Amrouche, Abderrahmane
    Selouani, Sid-Ahmed
    Hamidia, Mahfoud
    PROCEEDINGS 2018 3RD INTERNATIONAL CONFERENCE ON ELECTRICAL SCIENCES AND TECHNOLOGIES IN MAGHREB (CISTEM), 2018, : 653 - 656
  • [3] Speech Enhancement of Color Noise Using Empirical Mode Decomposition
    Koh, Min-sung
    Rodriguez-Marek, Esteban
    2012 CONFERENCE RECORD OF THE FORTY SIXTH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS AND COMPUTERS (ASILOMAR), 2012, : 1688 - 1692
  • [4] Speech vs Music Discrimination using Empirical Mode Decomposition
    Khonglah, Banriskhem K.
    Sharma, Rajib
    Prasanna, S. R. Mahadeva
    2015 TWENTY FIRST NATIONAL CONFERENCE ON COMMUNICATIONS (NCC), 2015,
  • [5] The Modified Bidimensional Empirical Mode Decomposition for Color Image Decomposition
    Ben Arfia, Faten
    Sabri, Abdelouahed
    Ben Messaoud, Mohamed
    Abid, Mohamed
    WORLD CONGRESS ON ENGINEERING, WCE 2011, VOL II, 2011, : 1610 - 1613
  • [6] Image decomposition based on modified Bidimensional Empirical Mode Decomposition
    Ben Arfia, Faten
    Ben Messaoud, Mohamed
    Abid, Mohamed
    THIRD INTERNATIONAL CONFERENCE ON DIGITAL IMAGE PROCESSING (ICDIP 2011), 2011, 8009
  • [7] Empirical mode decomposition of voiced speech signal
    Bouzid, A
    Ellouze, N
    ISCCSP : 2004 FIRST INTERNATIONAL SYMPOSIUM ON CONTROL, COMMUNICATIONS AND SIGNAL PROCESSING, 2004, : 603 - 606
  • [8] Empirical Mode Decomposition for Usable Speech Detection
    Ghezaiel, Wajdi
    Ben Slimane, Amel
    Ben Braiek, Ezzedine
    2013 INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING AND SOFTWARE APPLICATIONS (ICEESA), 2013, : 316 - 320
  • [9] Voiced speech analysis by empirical mode decomposition
    Bouzid, Aicha
    Ellouze, Noureddine
    ADVANCES IN NONLINEAR SPEECH PROCESSING, 2007, 4885 : 213 - +
  • [10] Accent Extraction of Emotional Speech based on Modified Ensemble Empirical Mode Decomposition
    Shen, Zhiyuan
    Wang, Qiang
    Shen, Yi
    Jin, Jing
    Lin, Yurong
    2010 IEEE INTERNATIONAL INSTRUMENTATION AND MEASUREMENT TECHNOLOGY CONFERENCE I2MTC 2010, PROCEEDINGS, 2010,