Formant-based Feature Extraction for Emotion Classification from Speech

被引：0

作者：

Kim, Jonathan C. ^{[1
]}

Clements, Mark A. ^{[1
]}

机构：

[1] Georgia Inst Technol, Atlanta, GA 30332 USA

来源：

2015 38TH INTERNATIONAL CONFERENCE ON TELECOMMUNICATIONS AND SIGNAL PROCESSING (TSP) | 2015年

关键词：

affective computing; speech analysis; formant; Gaussian mixture model;

D O I：

暂无

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

In a previous study, a robust formant-tracking algorithm was introduced to model formant and spectral properties of speech. The algorithm utilizes Gaussian mixtures to estimate spectral parameters, and refines the estimates by using a maximum a posteriori adaptation (MAP) algorithm. In this paper, the formant-tracking algorithm was used to extract the formant-based features for emotion classification. The classification results were compared to a linear predictive coding (LPC) based algorithm for evaluation. On average, the formant features extracted using the algorithm improved the unweighted accuracy by 2.1 percentage points when compared to a LPC-based algorithm. The combination of formant features and other acoustic features statistically significantly improved the unweighted accuracy by 2.7 percentage points, whereas the LPC-based features barely improved it by 1 percentage point. The results clearly indicate that an improved formant-tracking method improved emotion classification accuracy. The effect of formant-based features in emotion classification is also discussed.

引用

页码：477 / 481

页数：5

共 50 条

[41] Formant-based audio synthesis using nonlinear distortion
IRCAM, Paris, France
J Audio Eng Soc, 1-2 (40-47):
[42] Formant-Based English Vowel Assessment For Chinese in Taiwan
Chen, Jiang-Chun
Hsu, Wei-Tang
Jang, J. -S. Roger
Lyu, Ren-Yuan
Chiang, Yuang-Chin
INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 1375 - +
[43] Audio feature extraction for effective emotion classification
Han E.
Cha H.
IEIE Transactions on Smart Processing and Computing, 2019, 8 (02): : 100 - 107
[44] Pitch- and Formant-Based Order Adaptation of the Fractional Fourier Transform and Its Application to Speech Recognition
Hui Yin
Climent Nadeu
Volker Hohmann
EURASIP Journal on Audio, Speech, and Music Processing, 2009
[45] Speech based emotion classification
Nwe, TL
Wei, FS
De Silva, LC
IEEE REGION 10 INTERNATIONAL CONFERENCE ON ELECTRICAL AND ELECTRONIC TECHNOLOGY, VOLS 1 AND 2, 2001, : 297 - 301
[46] A Salient Feature Extraction Algorithm for Speech Emotion Recognition
Liang, Ruiyu
Tao, Huawei
Tang, Guichen
Wang, Qingyun
Zhao, Li
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2015, E98D (09): : 1715 - 1718
[47] AUTOMATIC EXTRACTION OF FORMANT FREQUENCIES FROM CONTINUOUS SPEECH
FLANAGAN, JL
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1955, 27 (01): : 207 - 207
[48] A Feature Extraction Scheme Based on Enhanced Wavelet Coefficients for Speech Emotion Recognition
Shahnaz, C.
Sultana, S.
2014 IEEE 57TH INTERNATIONAL MIDWEST SYMPOSIUM ON CIRCUITS AND SYSTEMS (MWSCAS), 2014, : 1093 - 1096
[49] Robust feature extraction for mobile-based speech emotion recognition system
Lee, Kang-Kue
Cho, Youn-Ho
Park, Kyu-Sik
INTELLIGENT COMPUTING IN SIGNAL PROCESSING AND PATTERN RECOGNITION, 2006, 345 : 470 - 477
[50] AUTOMATIC EXTRACTION OF FORMANT FREQUENCIES FROM CONTINUOUS SPEECH
FLANAGAN, JL
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1956, 28 (01): : 110 - 118

← 1 2 3 4 5 →