A systematic comparison of different HMM designs for emotion recognition from acted and spontaneous speech

被引:0
|
作者
Wagner, Johannes [1 ]
Vogt, Thurid [1 ]
Andre, Elisabeth [1 ]
机构
[1] Univ Augsburg, D-8900 Augsburg, Germany
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this work we elaborate the use of hidden Markov models (HMMs) for speech emotion recognition as a dynamic alternative to static modelling approaches. Since previous work on this field does not yet define a clear line which HMM design should be prioritised for this task, we run a systematic analysis of different HMM configurations. Furthermore, experiments are carried out on an acted and a spontaneous emotions corpus, since little is known about the suitability of HMMs for spontaneous speech. Additionally, we consider two different segmentation levels, namely words and utterances. Results are compared with the outcome of a support vector machine classifier trained on global statistics features. While for both databases similar performance was observed on utterance level, the HMM-based approach outperformed static classification on word level. However, setting up general guidelines which kind of models are best suited appeared to be rather difficult.
引用
收藏
页码:114 / +
页数:2
相关论文
共 50 条
  • [41] Comparison between two hybrid HMM/MLP approaches in speech recognition
    Fontaine, V
    Ris, C
    Leich, H
    Vantieghem, J
    Accaino, S
    VanCompernolle, D
    1996 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, CONFERENCE PROCEEDINGS, VOLS 1-6, 1996, : 3362 - 3365
  • [42] Attentive Convolutional Neural Network based Speech Emotion Recognition: A Study on the Impact of Input Features, Signal Length, and Acted Speech
    Neumann, Michael
    Ngoc Thang Vu
    18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 1263 - 1267
  • [43] Speaking Style Adaptation for Spontaneous Speech Recognition Using Multiple-Regression HMM
    Ijima, Yusuke
    Matsubara, Takeshi
    Nose, Takashi
    Kobayashi, Takao
    INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 548 - 551
  • [44] Exploring Transfer Learning between Scripted and Spontaneous Speech for Emotion Recognition
    Li, Qingqing
    Chaspari, Theodora
    ICMI'19: PROCEEDINGS OF THE 2019 INTERNATIONAL CONFERENCE ON MULTIMODAL INTERACTION, 2019, : 435 - 439
  • [45] Knowledge-based framework for intelligent emotion recognition in spontaneous speech
    Chakraborty, Rupayan
    Pandharipande, Meghna
    Kopparapu, Sunil Kumar
    KNOWLEDGE-BASED AND INTELLIGENT INFORMATION & ENGINEERING SYSTEMS: PROCEEDINGS OF THE 20TH INTERNATIONAL CONFERENCE KES-2016, 2016, 96 : 587 - 596
  • [46] Spontaneous Speech Emotion Recognition Using Multiscale Deep Convolutional LSTM
    Zhang, Shiqing
    Zhao, Xiaoming
    Tian, Qi
    IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2022, 13 (02) : 680 - 688
  • [47] Learning deep multimodal affective features for spontaneous speech emotion recognition
    Zhang, Shiqing
    Tao, Xin
    Chuang, Yuelong
    Zhao, Xiaoming
    SPEECH COMMUNICATION, 2021, 127 : 73 - 81
  • [48] EMOTION RECOGNITION FROM SPONTANEOUS SPEECH USING HIDDEN MARKOV MODELS WITH DEEP BELIEF NETWORKS
    Le, Duc
    Provost, Emily Mower
    2013 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING (ASRU), 2013, : 216 - 221
  • [49] Speech emotion recognition using machine learning - A systematic review
    Madanian, Samaneh
    Chen, Talen
    Adeleye, Olayinka
    Templeton, John Michael
    Poellabauer, Christian
    Parry, Dave
    Schneidere, Sandra L.
    INTELLIGENT SYSTEMS WITH APPLICATIONS, 2023, 20
  • [50] Biologically inspired emotion recognition from speech
    Laura Caponetti
    Cosimo Alessandro Buscicchio
    Giovanna Castellano
    EURASIP Journal on Advances in Signal Processing, 2011