Speech vs Music Discrimination using Empirical Mode Decomposition

被引:0
|
作者
Khonglah, Banriskhem K. [1 ]
Sharma, Rajib [1 ]
Prasanna, S. R. Mahadeva [1 ]
机构
[1] Indian Inst Technol Guwahati, Dept Elect & Elect Engn, Gauhati 781039, India
关键词
EMD; IMF; speech; music;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This work explores the use of Empirical Mode Decomposition (EMD) for discriminating speech regions from music in audio recordings. The different frequency scales or Intrinsic Mode Functions (IMFs) obtained from EMD of the audio signal are found to contain discriminatory evidence for distinguishing the speech regions from the music regions of the audio signal. Different statistical measures like mean, absolute mean, variance, skewness and kurtosis are computed from the various IMFs and investigated for speech vs music discrimination. These features on being used for classification using classifiers like Support Vector Machines (SVMs) and k-Nearest Neighbour (k-NN) on the Scheirer and Slaney database gives the best overall classification accuracy of 90.83% for the SVMs and 85.33% for the k-NN.
引用
收藏
页数:6
相关论文
共 50 条
  • [1] Empirical mode decomposition based statistical features for discrimination of speech and low frequency music signal
    Kumar, Arvind
    Chandra, Mahesh
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (01) : 33 - 58
  • [2] Empirical mode decomposition based statistical features for discrimination of speech and low frequency music signal
    Arvind Kumar
    Mahesh Chandra
    Multimedia Tools and Applications, 2023, 82 : 33 - 58
  • [3] SPEECH ENHANCEMENT USING ADAPTIVE EMPIRICAL MODE DECOMPOSITION
    Chatlani, Navin
    Soraghan, John J.
    2009 16TH INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING, VOLS 1 AND 2, 2009, : 417 - 422
  • [4] A better decomposition of speech obtained using modified Empirical Mode Decomposition
    Sharma, Rajib
    Prasanna, S. R. Mahadeva
    DIGITAL SIGNAL PROCESSING, 2016, 58 : 26 - 39
  • [5] Empirical Mode Decomposition for Speech Enhancement
    Bouchair, Asma
    Amrouche, Abderrahmane
    Selouani, Sid-Ahmed
    Hamidia, Mahfoud
    PROCEEDINGS 2018 3RD INTERNATIONAL CONFERENCE ON ELECTRICAL SCIENCES AND TECHNOLOGIES IN MAGHREB (CISTEM), 2018, : 653 - 656
  • [6] Speech Enhancement of Color Noise Using Empirical Mode Decomposition
    Koh, Min-sung
    Rodriguez-Marek, Esteban
    2012 CONFERENCE RECORD OF THE FORTY SIXTH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS AND COMPUTERS (ASILOMAR), 2012, : 1688 - 1692
  • [7] Pitch Estimation of Noisy Speech Signals using Empirical Mode Decomposition
    Molla, Md. Khademul Islam
    Hirose, Keikichi
    Minematsu, Nobuaki
    Hasan, Md. Kamrul
    INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 2177 - +
  • [8] Characterizing Glottal Activity from Speech using Empirical Mode Decomposition
    Sharma, Rajib
    Prasanna, S. R. Mahadeva
    2015 TWENTY FIRST NATIONAL CONFERENCE ON COMMUNICATIONS (NCC), 2015,
  • [9] Speaker recognition in an emotionalized spontaneous speech using empirical mode decomposition
    Chou, Fu-Hua
    Liu, Yu-Shuo
    Chiou, Che-Wun
    IMECS 2007: INTERNATIONAL MULTICONFERENCE OF ENGINEERS AND COMPUTER SCIENTISTS, VOLS I AND II, 2007, : 387 - +
  • [10] Pathological speech signal analysis and classification using empirical mode decomposition
    Kaleem, Muhammad
    Ghoraani, Behnaz
    Guergachi, Aziz
    Krishnan, Sridhar
    MEDICAL & BIOLOGICAL ENGINEERING & COMPUTING, 2013, 51 (07) : 811 - 821