Speech vs Music Discrimination using Empirical Mode Decomposition

被引：0

作者：

Khonglah, Banriskhem K. ^{[1
]}

Sharma, Rajib ^{[1
]}

Prasanna, S. R. Mahadeva ^{[1
]}

机构：

[1] Indian Inst Technol Guwahati, Dept Elect & Elect Engn, Gauhati 781039, India

来源：

2015 TWENTY FIRST NATIONAL CONFERENCE ON COMMUNICATIONS (NCC) | 2015年

关键词：

EMD; IMF; speech; music;

D O I：

暂无

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

This work explores the use of Empirical Mode Decomposition (EMD) for discriminating speech regions from music in audio recordings. The different frequency scales or Intrinsic Mode Functions (IMFs) obtained from EMD of the audio signal are found to contain discriminatory evidence for distinguishing the speech regions from the music regions of the audio signal. Different statistical measures like mean, absolute mean, variance, skewness and kurtosis are computed from the various IMFs and investigated for speech vs music discrimination. These features on being used for classification using classifiers like Support Vector Machines (SVMs) and k-Nearest Neighbour (k-NN) on the Scheirer and Slaney database gives the best overall classification accuracy of 90.83% for the SVMs and 85.33% for the k-NN.

引用

页数：6

共 50 条

[1] Empirical mode decomposition based statistical features for discrimination of speech and low frequency music signal
Kumar, Arvind
Chandra, Mahesh
MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (01) : 33 - 58
[2] Empirical mode decomposition based statistical features for discrimination of speech and low frequency music signal
Arvind Kumar
Mahesh Chandra
Multimedia Tools and Applications, 2023, 82 : 33 - 58
[3] SPEECH ENHANCEMENT USING ADAPTIVE EMPIRICAL MODE DECOMPOSITION
Chatlani, Navin
Soraghan, John J.
2009 16TH INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING, VOLS 1 AND 2, 2009, : 417 - 422
[4] A better decomposition of speech obtained using modified Empirical Mode Decomposition
Sharma, Rajib
Prasanna, S. R. Mahadeva
DIGITAL SIGNAL PROCESSING, 2016, 58 : 26 - 39
[5] Empirical Mode Decomposition for Speech Enhancement
Bouchair, Asma
Amrouche, Abderrahmane
Selouani, Sid-Ahmed
Hamidia, Mahfoud
PROCEEDINGS 2018 3RD INTERNATIONAL CONFERENCE ON ELECTRICAL SCIENCES AND TECHNOLOGIES IN MAGHREB (CISTEM), 2018, : 653 - 656
[6] Speech Enhancement of Color Noise Using Empirical Mode Decomposition
Koh, Min-sung
Rodriguez-Marek, Esteban
2012 CONFERENCE RECORD OF THE FORTY SIXTH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS AND COMPUTERS (ASILOMAR), 2012, : 1688 - 1692
[7] Pitch Estimation of Noisy Speech Signals using Empirical Mode Decomposition
Molla, Md. Khademul Islam
Hirose, Keikichi
Minematsu, Nobuaki
Hasan, Md. Kamrul
INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 2177 - +
[8] Characterizing Glottal Activity from Speech using Empirical Mode Decomposition
Sharma, Rajib
Prasanna, S. R. Mahadeva
2015 TWENTY FIRST NATIONAL CONFERENCE ON COMMUNICATIONS (NCC), 2015,
[9] Speaker recognition in an emotionalized spontaneous speech using empirical mode decomposition
Chou, Fu-Hua
Liu, Yu-Shuo
Chiou, Che-Wun
IMECS 2007: INTERNATIONAL MULTICONFERENCE OF ENGINEERS AND COMPUTER SCIENTISTS, VOLS I AND II, 2007, : 387 - +
[10] Pathological speech signal analysis and classification using empirical mode decomposition
Kaleem, Muhammad
Ghoraani, Behnaz
Guergachi, Aziz
Krishnan, Sridhar
MEDICAL & BIOLOGICAL ENGINEERING & COMPUTING, 2013, 51 (07) : 811 - 821

← 1 2 3 4 5 →