Speech Emotion Recognition Based on Dynamic Models

被引:0
|
作者
Lv, Guoyun [1 ]
Hu, Shuixian [2 ]
Lu, Xipan [1 ]
机构
[1] Northwestern Polytech Univ, Sch Elect & Informat, Xian, Peoples R China
[2] Chinese Aeronaut Radio Elect Res Inst, Shanghai, Peoples R China
关键词
emotion recognition; dynamic bayesian network; dynamic model; hidden markov model;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
This paper introduced the semi-continuous Hidden Markov Model (HMM) and proposed a novel Dynamic Bayesian Network (DBN) model for dynamic speech emotion recognition. The former reduces the training complexity caused by mixture Gaussians by sharing the Condition Probability Densities (CPDs) of Gaussians among the states, and the latter adds a sub-state layer between state and observation layer based on traditional DBN framework and describes the dynamic process of speech emotion in detail. Experiments results show that average emotion recognition rate of semi-continuous HMM is 4% and 10% higher than those of classical HMM and Mixture Gaussian HMM respectively, and average emotion recognition rate of the three-layer DBN model is 11% and 8% higher than those of traditional DBN model and semi-continuous HMM.
引用
收藏
页码:480 / 484
页数:5
相关论文
共 50 条
  • [41] Value leadership of large language models based on speech recognition and emotion classification algorithms
    Liu, Huifang
    Ye, Yunfeng
    INTERNATIONAL JOURNAL OF BIO-INSPIRED COMPUTATION, 2024, 24 (04) : 201 - 211
  • [42] Ensemble Classifier based on Decision-Fusion of Multiple Models for Speech Emotion Recognition
    Noh, Kyoungju
    Lim, Jiyoun
    Chung, Seungeun
    Kim, Gague
    Jeong, Hyuntae
    2018 INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGY CONVERGENCE (ICTC), 2018, : 1246 - 1248
  • [43] Speech Emotion Recognition
    Lalitha, S.
    Madhavan, Abhishek
    Bhushan, Bharath
    Saketh, Srinivas
    2014 INTERNATIONAL CONFERENCE ON ADVANCES IN ELECTRONICS, COMPUTERS AND COMMUNICATIONS (ICAECC), 2014,
  • [44] COMBINING SPEECH RECOGNITION AND ACOUSTIC WORD EMOTION MODELS FOR ROBUST TEXT-INDEPENDENT EMOTION RECOGNITION
    Schuller, Bjoern
    Vlasenko, Bogdan
    Arsic, Dejan
    Rigoll, Gerhard
    Wendemuth, Andreas
    2008 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOLS 1-4, 2008, : 1333 - +
  • [45] ATDA: Attentional temporal dynamic activation for speech emotion recognition
    Liu, Lu-Yao
    Liu, Wen-Zhe
    Zhou, Jian
    Deng, Hui-Yuan
    Feng, Lin
    KNOWLEDGE-BASED SYSTEMS, 2022, 243
  • [46] Speech Emotion Recognition Using Hybrid Generative and Discriminative Models
    Huang, Yongming
    Zhang, Guobao
    Dong, Fei
    Li, Yue
    Da, Feipeng
    PRZEGLAD ELEKTROTECHNICZNY, 2012, 88 (3B): : 105 - 108
  • [47] Benchmarking Pretrained Models for Speech Emotion Recognition: A Focus on Xception
    Hassan, Ahmed
    Masood, Tehreem
    Ahmed, Hassan A.
    Shahzad, H. M.
    Khushi, Hafiz Muhammad Tayyab
    COMPUTERS, 2024, 13 (12)
  • [48] Emotion Prompting for Speech Emotion Recognition
    Zhou, Xingfa
    Li, Min
    Yang, Lan
    Sun, Rui
    Wang, Xin
    Zhan, Huayi
    INTERSPEECH 2023, 2023, : 3108 - 3112
  • [49] Speech-based Emotion Recognition and Speaker Identification: Static vs. Dynamic Mode of Speech Representation
    Sidorov, Maxim
    Minker, Wolfgang
    Semenkin, Eugene S.
    JOURNAL OF SIBERIAN FEDERAL UNIVERSITY-MATHEMATICS & PHYSICS, 2016, 9 (04): : 518 - 523
  • [50] Speech recognition using linear dynamic models
    Frankel, Joe
    King, Simon
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (01): : 246 - 256