Ensemble Learning of Hybrid Acoustic Features for Speech Emotion Recognition

被引:42
|
作者
Zvarevashe, Kudakwashe [1 ]
Olugbara, Oludayo [1 ]
机构
[1] Durban Univ Technol, South Africa Luban Workshop, ICT & Soc Res Grp, ZA-4001 Durban, South Africa
关键词
emotion recognition; ensemble algorithm; feature extraction; hybrid feature; machine learning; supervised learning; CLASSIFICATION; PERFORMANCE; SELECTION;
D O I
10.3390/a13030070
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Automatic recognition of emotion is important for facilitating seamless interactivity between a human being and intelligent robot towards the full realization of a smart society. The methods of signal processing and machine learning are widely applied to recognize human emotions based on features extracted from facial images, video files or speech signals. However, these features were not able to recognize the fear emotion with the same level of precision as other emotions. The authors propose the agglutination of prosodic and spectral features from a group of carefully selected features to realize hybrid acoustic features for improving the task of emotion recognition. Experiments were performed to test the effectiveness of the proposed features extracted from speech files of two public databases and used to train five popular ensemble learning algorithms. Results show that random decision forest ensemble learning of the proposed hybrid acoustic features is highly effective for speech emotion recognition.
引用
收藏
页数:24
相关论文
共 50 条
  • [41] Learning deep multimodal affective features for spontaneous speech emotion recognition
    Zhang, Shiqing
    Tao, Xin
    Chuang, Yuelong
    Zhao, Xiaoming
    SPEECH COMMUNICATION, 2021, 127 : 73 - 81
  • [42] Acoustic features extraction for emotion recognition
    Rong, Jia
    Chen, Yi-Ping Phoebe
    Chowdhury, Morshed
    Li, Gang
    6TH IEEE/ACIS INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION SCIENCE, PROCEEDINGS, 2007, : 419 - +
  • [43] A hybrid meta-heuristic ensemble based classification technique speech emotion recognition
    Darekar, R. V.
    Chavand, Meena Suhas
    Sharanyaa, S.
    Ranjan, Nihar M.
    ADVANCES IN ENGINEERING SOFTWARE, 2023, 180
  • [44] A computationally efficient speech emotion recognition system employing machine learning classifiers and ensemble learning
    Aishwarya N.
    Kaur K.
    Seemakurthy K.
    International Journal of Speech Technology, 2024, 27 (1) : 239 - 254
  • [45] Pattern recognition and features selection for speech emotion recognition model using deep learning
    Jermsittiparsert, Kittisak
    Abdurrahman, Abdurrahman
    Siriattakul, Parinya
    Sundeeva, Ludmila A.
    Hashim, Wahidah
    Rahim, Robbi
    Maseleno, Andino
    INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2020, 23 (04) : 799 - 806
  • [46] Speech emotion recognition combining acoustic features and linguistic information in a hybrid support vector machine - Belief Network architecture
    Schuller, B
    Rigoll, G
    Lang, M
    2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING, 2004, : 577 - 580
  • [47] Pattern recognition and features selection for speech emotion recognition model using deep learning
    Kittisak Jermsittiparsert
    Abdurrahman Abdurrahman
    Parinya Siriattakul
    Ludmila A. Sundeeva
    Wahidah Hashim
    Robbi Rahim
    Andino Maseleno
    International Journal of Speech Technology, 2020, 23 : 799 - 806
  • [48] Hybrid deep learning models based emotion recognition with speech signals
    Chowdary, M. Kalpana
    Priya, E. Anu
    Danciulescu, Daniela
    Anitha, J.
    Hemanth, D. Jude
    INTELLIGENT DECISION TECHNOLOGIES-NETHERLANDS, 2023, 17 (04): : 1435 - 1453
  • [49] Acoustic-Prosodic Recognition of Emotion in Speech
    Montenegro, Chuchi S.
    Maravillas, Elmer A.
    2015 INTERNATIONAL CONFERENCE ON HUMANOID, NANOTECHNOLOGY, INFORMATION TECHNOLOGY,COMMUNICATION AND CONTROL, ENVIRONMENT AND MANAGEMENT (HNICEM), 2015, : 527 - +
  • [50] SPEECH EMOTION RECOGNITION WITH COMPLEMENTARY ACOUSTIC REPRESENTATIONS
    Zhang, Xiaoming
    Zhang, Fan
    Cui, Xiaodong
    Zhang, Wei
    2022 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP, SLT, 2022, : 846 - 852