Formant-based Feature Extraction for Emotion Classification from Speech

被引：0

作者：

Kim, Jonathan C. ^{[1
]}

Clements, Mark A. ^{[1
]}

机构：

[1] Georgia Inst Technol, Atlanta, GA 30332 USA

来源：

2015 38TH INTERNATIONAL CONFERENCE ON TELECOMMUNICATIONS AND SIGNAL PROCESSING (TSP) | 2015年

关键词：

affective computing; speech analysis; formant; Gaussian mixture model;

D O I：

暂无

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

In a previous study, a robust formant-tracking algorithm was introduced to model formant and spectral properties of speech. The algorithm utilizes Gaussian mixtures to estimate spectral parameters, and refines the estimates by using a maximum a posteriori adaptation (MAP) algorithm. In this paper, the formant-tracking algorithm was used to extract the formant-based features for emotion classification. The classification results were compared to a linear predictive coding (LPC) based algorithm for evaluation. On average, the formant features extracted using the algorithm improved the unweighted accuracy by 2.1 percentage points when compared to a LPC-based algorithm. The combination of formant features and other acoustic features statistically significantly improved the unweighted accuracy by 2.7 percentage points, whereas the LPC-based features barely improved it by 1 percentage point. The results clearly indicate that an improved formant-tracking method improved emotion classification accuracy. The effect of formant-based features in emotion classification is also discussed.

引用

页码：477 / 481

页数：5

共 50 条

[1] Speech emotion recognition based on formant characteristics feature extraction and phoneme type convergence
Liu, Zhen-Tao
Rehman, Abdul
Wu, Min
Cao, Wei-Hua
Hao, Man
Information Sciences, 2021, 563 : 309 - 325
[2] AN ON-CHIP FORMANT-BASED SPEECH SYNTHESIZER
VANESSEN, HA
TEULING, DJA
NTZ ARCHIV, 1982, 4 (03): : 75 - 77
[3] Speech emotion recognition based on formant characteristics feature extraction and phoneme type convergence q
Liu, Zhen-Tao
Rehman, Abdul
Wu, Min
Cao, Wei-Hua
Hao, Man
INFORMATION SCIENCES, 2021, 563 : 309 - 325
[4] Synthesis of unlimited speech in Indian languages using formant-based rules
Furtado, XA
Sen, A
SADHANA-ACADEMY PROCEEDINGS IN ENGINEERING SCIENCES, 1996, 21 : 345 - 362
[5] Evaluation of a formant-based speech-driven lip motion generation
Ishi, Carlos T.
Liu, Chaoran
Ishiguro, Hiroshi
Hagita, Norihiro
13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 114 - 117
[6] JOINING OF VOWEL AND SEMIVOWEL MODELS IN LITHUANIAN SPEECH FORMANT-BASED SYNTHESIZER
Pyz, Grazina
Simonyte, Virginija
Slivinskas, Vytautas
ELECTRICAL AND CONTROL TECHNOLOGIES, 2011, : 114 - +
[7] Synthesis of unlimited speech in Indian languages using formant-based rules
Tata Inst of Fundamental Research, Mumbai, India
Sadhana, pt 3 (345-362):
[8] Dynamic Feature Extraction for Speech Signal Based on Formant Curve and MUSIC
Han Zhiyan
Wang Jian
2017 29TH CHINESE CONTROL AND DECISION CONFERENCE (CCDC), 2017, : 403 - 407
[9] Nonlinear Dynamic Feature Extraction Based on Phase Space Reconstruction for the Classification of Speech and Emotion
Sun, Ying
Zhang, Xue-Ying
Ma, Jiang-He
Song, Chun-Xiao
Lv, Hui-Fen
MATHEMATICAL PROBLEMS IN ENGINEERING, 2020, 2020
[10] IG-based feature extraction and compensation for emotion recognition from speech
Chuang, ZJ
Wu, CH
AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION, PROCEEDINGS, 2005, 3784 : 358 - 365

← 1 2 3 4 5 →