Speech vs Music Discrimination using Empirical Mode Decomposition

被引:0
|
作者
Khonglah, Banriskhem K. [1 ]
Sharma, Rajib [1 ]
Prasanna, S. R. Mahadeva [1 ]
机构
[1] Indian Inst Technol Guwahati, Dept Elect & Elect Engn, Gauhati 781039, India
关键词
EMD; IMF; speech; music;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This work explores the use of Empirical Mode Decomposition (EMD) for discriminating speech regions from music in audio recordings. The different frequency scales or Intrinsic Mode Functions (IMFs) obtained from EMD of the audio signal are found to contain discriminatory evidence for distinguishing the speech regions from the music regions of the audio signal. Different statistical measures like mean, absolute mean, variance, skewness and kurtosis are computed from the various IMFs and investigated for speech vs music discrimination. These features on being used for classification using classifiers like Support Vector Machines (SVMs) and k-Nearest Neighbour (k-NN) on the Scheirer and Slaney database gives the best overall classification accuracy of 90.83% for the SVMs and 85.33% for the k-NN.
引用
收藏
页数:6
相关论文
共 50 条
  • [31] Improved Empirical Mode Decomposition Using Optimal Recursive Averaging Noise Estimation for Speech Enhancement
    Bouchair, Asma
    Selouani, Sid Ahmed
    Amrouche, Abderrahmane
    Sidi Yakoub, Mohammed
    CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2022, 41 (01) : 196 - 223
  • [32] Robust Voiced/Unvoiced Speech Classification using Empirical Mode Decomposition and Periodic Correlation Model
    Molla, Md. Khademul Islam
    Hirose, Keikichi
    Minematsu, Nobuaki
    INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 2530 - +
  • [33] Speech Enhancement Using Sliding Window Empirical Mode Decomposition and Hurst-based Technique
    Poovarasan, Selvaraj
    Chandra, Eswaran
    ARCHIVES OF ACOUSTICS, 2019, 44 (03) : 429 - 437
  • [34] Improved Empirical Mode Decomposition Using Optimal Recursive Averaging Noise Estimation for Speech Enhancement
    Asma Bouchair
    Sid Ahmed Selouani
    Abderrahmane Amrouche
    Mohammed Sidi Yakoub
    Circuits, Systems, and Signal Processing, 2022, 41 : 196 - 223
  • [35] Empirical Mode Decomposition vs. Variational Mode Decomposition on ECG Signal Processing: A Comparative Study
    Maji, Uday
    Pal, Saurabh
    2016 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2016, : 1129 - 1134
  • [36] Pitch Estimation of Noisy Speech using Ensemble Empirical Mode Decomposition and Dominant Harmonic Modification
    Roy, Sujan Kumar
    Zhu, Wei-Ping
    2014 IEEE 27TH CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING (CCECE), 2014,
  • [37] Rumor Situation Discrimination Based on Empirical Mode Decomposition Correlation Dimension
    Xin, Yanwen
    Liu, Fengming
    COMPLEXITY, 2021, 2021
  • [38] Bivariate Empirical Mode Decomposition of Speech Signals for Disordered Voices Assessment
    Boubekiria, Kawther
    Kacha, Abdellah
    CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2025,
  • [39] Empirical Mode Decomposition: A way for finding Pitch (Stuttered speech signal)
    Raju, N.
    Neelamegam, P.
    RESEARCH JOURNAL OF PHARMACEUTICAL BIOLOGICAL AND CHEMICAL SCIENCES, 2016, 7 (06): : 1030 - 1036
  • [40] Empirical Mode Decomposition For Noise-Robust Automatic Speech Recognition
    Wu, Kuo-Hao
    Chen, Chia-Ping
    11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4, 2010, : 2074 - 2077