A better decomposition of speech obtained using modified Empirical Mode Decomposition

被引:27
|
作者
Sharma, Rajib [1 ]
Prasanna, S. R. Mahadeva [1 ]
机构
[1] Indian Inst Technol Guwahati, Dept Elect & Elect Engn, Gauhati 781039, India
关键词
EMD; IPs; Mode mixing; Dyadic filterbank; LP; Formants; NOISE; SEPARATION; EXTRACTION; FREQUENCY; DATABASE; DESIGN;
D O I
10.1016/j.dsp.2016.07.012
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The objective of this work is to obtain meaningful time domain components, or Intrinsic Mode Functions (IMFs), of the speech signal, using Empirical Mode Decomposition (EMD), with reduced mode mixing, and in a time-efficient manner. This work focuses on two aspects - firstly, extracting IMFs of the speech signal which can better reflect its higher frequency spectrum; and secondly, to get a better representation and distribution of the vocal tract resonances of the speech signal in its IMFs, compared to that obtained from standard EMD. To this effect, modifications are proposed to the EMD algorithm for processing speech signals, based on the critical nature of the interpolation points (IPs) used for cubic spline interpolation in EMD. The effect of using different sets of IPs, other than the extrema of the residue - as used in standard EMD - is analyzed. It is found that having more IPs is beneficial only upto a certain limit, after which the characteristic dyadic filterbank nature of EMD breaks down. For certain sets of IPs, these modified EMD processes perform better than EMD, giving better frequency separability between the IMFs, and an enhanced representation of the higher frequency content of the signal. A detailed study of the distribution of the formants, in the IMFs of the speech signal, is done using Linear Prediction (LP) analysis of the IMFs. It is found that the IMFs of the EMD variants have a far better distribution of the formants structure within them, with reduced overlapping amongst their filter spectrums, compared to that of standard EMD. Henceforth, when subjected to the task of formants estimation of voiced speech, using LP analysis, the IMFs of the modified EMD processes cumulatively exhibit a superior performance than that of standard EMD, or the speech signal itself, under both dean and noisy conditions. (C) 2016 Elsevier Inc. All rights reserved.
引用
收藏
页码:26 / 39
页数:14
相关论文
共 50 条
  • [41] Frequency mode identification using modified masking signal-based empirical mode decomposition
    Ray, Papia
    Lenka, Rajesh Kumar
    Biswal, Monalisa
    IET GENERATION TRANSMISSION & DISTRIBUTION, 2019, 13 (08) : 1266 - 1276
  • [42] Comparison of performances of variational mode decomposition and empirical mode decomposition
    Yue, Yingjuan
    Sun, Gang
    Cai, Yanping
    Chen, Ru
    Wang, Xu
    Zhang, Shixiong
    ENERGY SCIENCE AND APPLIED TECHNOLOGY (ESAT 2016), 2016, : 469 - 476
  • [43] Modified Empirical Mode Decomposition Process for Improved Fault Diagnosis
    Parey, Anand
    Pachori, Ram Biles
    8TH IFTOMM INTERNATIONAL CONFERENCE ON ROTOR DYNAMICS (IFTOMM ROTORDYNAMICS 2010), 2010, : 261 - 265
  • [44] A Modified Empirical Mode Decomposition Algorithm in TDLAS for Gas Detection
    Meng, Yunxia
    Liu, Tiegen
    Liu, Kun
    Jiang, Junfeng
    Wang, Ranran
    Wang, Tao
    Hu, Haofeng
    IEEE PHOTONICS JOURNAL, 2014, 6 (06):
  • [45] Bivariate Empirical Mode Decomposition of Speech Signals for Disordered Voices Assessment
    Boubekiria, Kawther
    Kacha, Abdellah
    CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2025,
  • [46] Empirical Mode Decomposition: A way for finding Pitch (Stuttered speech signal)
    Raju, N.
    Neelamegam, P.
    RESEARCH JOURNAL OF PHARMACEUTICAL BIOLOGICAL AND CHEMICAL SCIENCES, 2016, 7 (06): : 1030 - 1036
  • [47] Empirical Mode Decomposition For Noise-Robust Automatic Speech Recognition
    Wu, Kuo-Hao
    Chen, Chia-Ping
    11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4, 2010, : 2074 - 2077
  • [48] Empirical Mode Decomposition Based Reconstruction of Speech Signal in Noisy Environment
    Goswami, Nisha
    Sarma, Mousmita
    Sarma, Kandarpa Kumar
    2014 INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND INTEGRATED NETWORKS (SPIN), 2014, : 760 - 765
  • [49] Speech Stream Detection for Noisy Environments Based on Empirical Mode Decomposition
    Tang Qiang
    Zhang Dexiang
    Yan Qing
    ADVANCED DESIGN AND MANUFACTURING TECHNOLOGY III, PTS 1-4, 2013, 397-400 : 2239 - +
  • [50] Noise-robust speech feature processing with empirical mode decomposition
    Wu, Kuo-Hau
    Chen, Chia-Ping
    Yeh, Bing-Feng
    EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2011, : 1 - 9