Speech vs Music Discrimination using Empirical Mode Decomposition

被引:0
|
作者
Khonglah, Banriskhem K. [1 ]
Sharma, Rajib [1 ]
Prasanna, S. R. Mahadeva [1 ]
机构
[1] Indian Inst Technol Guwahati, Dept Elect & Elect Engn, Gauhati 781039, India
关键词
EMD; IMF; speech; music;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This work explores the use of Empirical Mode Decomposition (EMD) for discriminating speech regions from music in audio recordings. The different frequency scales or Intrinsic Mode Functions (IMFs) obtained from EMD of the audio signal are found to contain discriminatory evidence for distinguishing the speech regions from the music regions of the audio signal. Different statistical measures like mean, absolute mean, variance, skewness and kurtosis are computed from the various IMFs and investigated for speech vs music discrimination. These features on being used for classification using classifiers like Support Vector Machines (SVMs) and k-Nearest Neighbour (k-NN) on the Scheirer and Slaney database gives the best overall classification accuracy of 90.83% for the SVMs and 85.33% for the k-NN.
引用
收藏
页数:6
相关论文
共 50 条
  • [21] Improving dysarthric speech recognition using empirical mode decomposition and convolutional neural network
    Mohammed Sidi Yakoub
    Sid-ahmed Selouani
    Brahim-Fares Zaidi
    Asma Bouchair
    EURASIP Journal on Audio, Speech, and Music Processing, 2020
  • [22] Improving dysarthric speech recognition using empirical mode decomposition and convolutional neural network
    Yakoub, Mohammed
    Selouani, Sid-ahmed
    Zaidi, Brahim-Fares
    Bouchair, Asma
    EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2020, 2020 (01)
  • [23] Reconstruction Of Speech Signal Using Empirical Mode Decomposition Based Glottal Source Extraction
    Goswami, Nisha
    Sarma, Mousmita
    Sarma, Kandarpa Kumar
    2013 1ST INTERNATIONAL CONFERENCE ON EMERGING TRENDS AND APPLICATIONS IN COMPUTER SCIENCE (ICETACS), 2013, : 27 - 32
  • [24] Speech enhancement using empirical mode decomposition and the Teager-Kaiser energy operator
    Khaldi, Kais
    Boudraa, Abdel-Ouahab
    Komaty, Ali
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2014, 135 (01): : 451 - 459
  • [25] Tempo Induction from Music Recordings Using Ensemble Empirical Mode Decomposition Analysis
    Trohidis, Konstantinos
    Hadjileontiadis, Leontios
    COMPUTER MUSIC JOURNAL, 2011, 35 (04) : 83 - 97
  • [26] Adaptive Empirical Mode Decomposition for Signal Enhancement with application to speech
    Chatlani, Navin
    Soraghan, John J.
    PROCEEDINGS OF IWSSIP 2008: 15TH INTERNATIONAL CONFERENCE ON SYSTEMS, SIGNALS AND IMAGE PROCESSING, 2008, : 101 - 104
  • [27] Discrimination between Ictal and Seizure-Free EEG Signals Using Empirical Mode Decomposition
    Pachori, Ram Bilas
    JOURNAL OF ELECTRICAL AND COMPUTER ENGINEERING, 2008, 2008
  • [28] Dysfluent Speech Classification Using Variational Mode Decomposition and Complete Ensemble Empirical Mode Decomposition Techniques With NGCU-Based RNN
    Vinay, N. A.
    Vidyasagar, K. N.
    Rohith, S.
    Supreeth, S.
    Prasad, S. N.
    Kumar, S. Pramod
    Bharathi, S. H.
    IEEE ACCESS, 2024, 12 : 174934 - 174953
  • [29] Speech and Music Discrimination Using Spectral Transition Rate
    Yang, Kyong-Chul
    Bang, Yong-Chan
    Cho, Sun-Ho
    Yook, Dongsuk
    JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2009, 28 (03): : 273 - 278
  • [30] On the Discrimination of Speech/Music using a Time Series Regularity
    Swe, Ei Mon Mon
    Pwint, Moe
    Sattar, Farook
    ISM: 2008 IEEE INTERNATIONAL SYMPOSIUM ON MULTIMEDIA, 2008, : 53 - +