A better decomposition of speech obtained using modified Empirical Mode Decomposition

被引:27
|
作者
Sharma, Rajib [1 ]
Prasanna, S. R. Mahadeva [1 ]
机构
[1] Indian Inst Technol Guwahati, Dept Elect & Elect Engn, Gauhati 781039, India
关键词
EMD; IPs; Mode mixing; Dyadic filterbank; LP; Formants; NOISE; SEPARATION; EXTRACTION; FREQUENCY; DATABASE; DESIGN;
D O I
10.1016/j.dsp.2016.07.012
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The objective of this work is to obtain meaningful time domain components, or Intrinsic Mode Functions (IMFs), of the speech signal, using Empirical Mode Decomposition (EMD), with reduced mode mixing, and in a time-efficient manner. This work focuses on two aspects - firstly, extracting IMFs of the speech signal which can better reflect its higher frequency spectrum; and secondly, to get a better representation and distribution of the vocal tract resonances of the speech signal in its IMFs, compared to that obtained from standard EMD. To this effect, modifications are proposed to the EMD algorithm for processing speech signals, based on the critical nature of the interpolation points (IPs) used for cubic spline interpolation in EMD. The effect of using different sets of IPs, other than the extrema of the residue - as used in standard EMD - is analyzed. It is found that having more IPs is beneficial only upto a certain limit, after which the characteristic dyadic filterbank nature of EMD breaks down. For certain sets of IPs, these modified EMD processes perform better than EMD, giving better frequency separability between the IMFs, and an enhanced representation of the higher frequency content of the signal. A detailed study of the distribution of the formants, in the IMFs of the speech signal, is done using Linear Prediction (LP) analysis of the IMFs. It is found that the IMFs of the EMD variants have a far better distribution of the formants structure within them, with reduced overlapping amongst their filter spectrums, compared to that of standard EMD. Henceforth, when subjected to the task of formants estimation of voiced speech, using LP analysis, the IMFs of the modified EMD processes cumulatively exhibit a superior performance than that of standard EMD, or the speech signal itself, under both dean and noisy conditions. (C) 2016 Elsevier Inc. All rights reserved.
引用
收藏
页码:26 / 39
页数:14
相关论文
共 50 条
  • [21] Telephone-quality Pathological Speech Classification using Empirical Mode Decomposition
    Kaleem, M. F.
    Ghoraani, B.
    Guergachi, A.
    Krishnan, S.
    2011 ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC), 2011, : 7095 - 7098
  • [22] Suppression of Residual Noise From Speech Signals Using Empirical Mode Decomposition
    Hasan, Taufiq
    Hasan, Md. Kamrul
    IEEE SIGNAL PROCESSING LETTERS, 2009, 16 (1-3) : 2 - 5
  • [23] Bidimensional empirical mode decomposition modified for texture analysis
    Nunes, JC
    Niang, O
    Bouaoune, Y
    Delechelle, E
    Bunel, P
    IMAGE ANALYSIS, PROCEEDINGS, 2003, 2749 : 171 - 177
  • [24] The modified bidimensional empirical mode decomposition for image denoising
    Shen, Minfen
    Tang, Hongrong
    Li, Bin
    2006 8TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, VOLS 1-4, 2006, : 3257 - +
  • [25] SIGNIFICANCE OF MODIFIED EMPIRICAL MODE DECOMPOSITION FOR ECG DENOISING
    Singh, Pratik
    Shahnawazuddin, S.
    Pradhan, Gayadhar
    2017 39TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC), 2017, : 2956 - 2959
  • [26] Recognizing human iris by modified empirical mode decomposition
    Lee, Jen-Chun
    Huang, Ping S.
    Tu, Te-Ming
    Chang, Chien-Ping
    ADVANCES IN IMAGE AND VIDEO TECHNOLOGY, PROCEEDINGS, 2007, 4872 : 298 - +
  • [27] Dysfluent Speech Classification Using Variational Mode Decomposition and Complete Ensemble Empirical Mode Decomposition Techniques With NGCU-Based RNN
    Vinay, N. A.
    Vidyasagar, K. N.
    Rohith, S.
    Supreeth, S.
    Prasad, S. N.
    Kumar, S. Pramod
    Bharathi, S. H.
    IEEE ACCESS, 2024, 12 : 174934 - 174953
  • [28] Adaptive Empirical Mode Decomposition for Signal Enhancement with application to speech
    Chatlani, Navin
    Soraghan, John J.
    PROCEEDINGS OF IWSSIP 2008: 15TH INTERNATIONAL CONFERENCE ON SYSTEMS, SIGNALS AND IMAGE PROCESSING, 2008, : 101 - 104
  • [29] Reconstruction Of Speech Signal Using Empirical Mode Decomposition Based Glottal Source Extraction
    Goswami, Nisha
    Sarma, Mousmita
    Sarma, Kandarpa Kumar
    2013 1ST INTERNATIONAL CONFERENCE ON EMERGING TRENDS AND APPLICATIONS IN COMPUTER SCIENCE (ICETACS), 2013, : 27 - 32
  • [30] Improving dysarthric speech recognition using empirical mode decomposition and convolutional neural network
    Mohammed Sidi Yakoub
    Sid-ahmed Selouani
    Brahim-Fares Zaidi
    Asma Bouchair
    EURASIP Journal on Audio, Speech, and Music Processing, 2020