Formant-based Feature Extraction for Emotion Classification from Speech

被引:0
|
作者
Kim, Jonathan C. [1 ]
Clements, Mark A. [1 ]
机构
[1] Georgia Inst Technol, Atlanta, GA 30332 USA
关键词
affective computing; speech analysis; formant; Gaussian mixture model;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In a previous study, a robust formant-tracking algorithm was introduced to model formant and spectral properties of speech. The algorithm utilizes Gaussian mixtures to estimate spectral parameters, and refines the estimates by using a maximum a posteriori adaptation (MAP) algorithm. In this paper, the formant-tracking algorithm was used to extract the formant-based features for emotion classification. The classification results were compared to a linear predictive coding (LPC) based algorithm for evaluation. On average, the formant features extracted using the algorithm improved the unweighted accuracy by 2.1 percentage points when compared to a LPC-based algorithm. The combination of formant features and other acoustic features statistically significantly improved the unweighted accuracy by 2.7 percentage points, whereas the LPC-based features barely improved it by 1 percentage point. The results clearly indicate that an improved formant-tracking method improved emotion classification accuracy. The effect of formant-based features in emotion classification is also discussed.
引用
收藏
页码:477 / 481
页数:5
相关论文
共 50 条
  • [1] Speech emotion recognition based on formant characteristics feature extraction and phoneme type convergence
    Liu, Zhen-Tao
    Rehman, Abdul
    Wu, Min
    Cao, Wei-Hua
    Hao, Man
    Information Sciences, 2021, 563 : 309 - 325
  • [2] AN ON-CHIP FORMANT-BASED SPEECH SYNTHESIZER
    VANESSEN, HA
    TEULING, DJA
    NTZ ARCHIV, 1982, 4 (03): : 75 - 77
  • [3] Speech emotion recognition based on formant characteristics feature extraction and phoneme type convergence q
    Liu, Zhen-Tao
    Rehman, Abdul
    Wu, Min
    Cao, Wei-Hua
    Hao, Man
    INFORMATION SCIENCES, 2021, 563 : 309 - 325
  • [4] Synthesis of unlimited speech in Indian languages using formant-based rules
    Furtado, XA
    Sen, A
    SADHANA-ACADEMY PROCEEDINGS IN ENGINEERING SCIENCES, 1996, 21 : 345 - 362
  • [5] Evaluation of a formant-based speech-driven lip motion generation
    Ishi, Carlos T.
    Liu, Chaoran
    Ishiguro, Hiroshi
    Hagita, Norihiro
    13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 114 - 117
  • [6] JOINING OF VOWEL AND SEMIVOWEL MODELS IN LITHUANIAN SPEECH FORMANT-BASED SYNTHESIZER
    Pyz, Grazina
    Simonyte, Virginija
    Slivinskas, Vytautas
    ELECTRICAL AND CONTROL TECHNOLOGIES, 2011, : 114 - +
  • [7] Synthesis of unlimited speech in Indian languages using formant-based rules
    Tata Inst of Fundamental Research, Mumbai, India
    Sadhana, pt 3 (345-362):
  • [8] Dynamic Feature Extraction for Speech Signal Based on Formant Curve and MUSIC
    Han Zhiyan
    Wang Jian
    2017 29TH CHINESE CONTROL AND DECISION CONFERENCE (CCDC), 2017, : 403 - 407
  • [9] Nonlinear Dynamic Feature Extraction Based on Phase Space Reconstruction for the Classification of Speech and Emotion
    Sun, Ying
    Zhang, Xue-Ying
    Ma, Jiang-He
    Song, Chun-Xiao
    Lv, Hui-Fen
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2020, 2020
  • [10] IG-based feature extraction and compensation for emotion recognition from speech
    Chuang, ZJ
    Wu, CH
    AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION, PROCEEDINGS, 2005, 3784 : 358 - 365