PREDICTION-BASED LEARNING FOR CONTINUOUS EMOTION RECOGNITION IN SPEECH

被引:0
|
作者
Han, Jing [1 ]
Zhang, Zixing [1 ]
Ringeval, Fabien [2 ]
Schuller, Bjorn [1 ,3 ]
机构
[1] Univ Passau, Chair Complex Intelligent Syst, Passau, Germany
[2] Univ Grenoble Alpes, Lab Informat Grenoble, Grenoble, France
[3] Imperial Coll London, Dept Comp, London, England
基金
欧盟第七框架计划;
关键词
Affective computing; hierarchical regression models; support vector regression; long short-term memory; ALGORITHM;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper, a prediction-based learning framework is proposed for a continuous prediction task of emotion recognition from speech, which is one of the key components of affective computing in multimedia. The main goal of this framework is to utmost exploit the individual advantages of different regression models cooperatively. To this end, we take two widely used regression models for example, i.e., support vector regression and bidirectional long short-term memory recurrent neural network. We concatenate the two models in a tandem structure by different ways, forming a united cascaded framework. The outputs predicted by the former model are combined together with the original features as the input of the following model for final predictions. The experimental results on a time-and value-continuous spontaneous emotion database (RECOLA) show that, the prediction-based learning framework significantly outperforms the individual models for both arousal and valence dimensions, and provides significantly better results in comparison to other state-of-the-art methodologies on this corpus.
引用
收藏
页码:5005 / 5009
页数:5
相关论文
共 50 条
  • [21] Continuous speech recognition based on ICA and geometrical learning
    Feng, Hao
    Cao, Wenming
    Wang, Shoujue
    ADVANCES IN MACHINE LEARNING AND CYBERNETICS, 2006, 3930 : 974 - 983
  • [22] Speech emotion recognition based on emotion perception
    Gang Liu
    Shifang Cai
    Ce Wang
    EURASIP Journal on Audio, Speech, and Music Processing, 2023
  • [23] Speech emotion recognition based on emotion perception
    Liu, Gang
    Cai, Shifang
    Wang, Ce
    EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2023, 2023 (01)
  • [24] Relevance units machine based dimensional and continuous speech emotion prediction
    Wang, Fengna
    Sahli, Hichem
    Gao, Junbin
    Jiang, Dongmei
    Verhelst, Werner
    MULTIMEDIA TOOLS AND APPLICATIONS, 2015, 74 (22) : 9983 - 10000
  • [25] Relevance units machine based dimensional and continuous speech emotion prediction
    Fengna Wang
    Hichem Sahli
    Junbin Gao
    Dongmei Jiang
    Werner Verhelst
    Multimedia Tools and Applications, 2015, 74 : 9983 - 10000
  • [26] Speech Emotion Recognition Based on Transfer Emotion-Discriminative Features Subspace Learning
    Zhang, Kexin
    Liu, Yunxiang
    IEEE ACCESS, 2023, 11 : 56336 - 56343
  • [27] English speech emotion recognition method based on speech recognition
    Liu, Man
    INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2022, 25 (2) : 391 - 398
  • [28] English speech emotion recognition method based on speech recognition
    Man Liu
    International Journal of Speech Technology, 2022, 25 : 391 - 398
  • [29] Segment-based emotion recognition from continuous Mandarin Chinese speech
    Yeh, Jun-Heng
    Pao, Tsang-Long
    Lin, Ching-Yi
    Tsai, Yao-Wei
    Chen, Yu-Te
    COMPUTERS IN HUMAN BEHAVIOR, 2011, 27 (05) : 1545 - 1552
  • [30] Hybrid deep learning models based emotion recognition with speech signals
    Chowdary, M. Kalpana
    Priya, E. Anu
    Danciulescu, Daniela
    Anitha, J.
    Hemanth, D. Jude
    INTELLIGENT DECISION TECHNOLOGIES-NETHERLANDS, 2023, 17 (04): : 1435 - 1453