Multimodal emotion recognition based on feature selection and extreme learning machine in video clips

被引:0
|
作者
Bei Pan
Kaoru Hirota
Zhiyang Jia
Linhui Zhao
Xiaoming Jin
Yaping Dai
机构
[1] Beijing Institute of Technology,School of Automation
[2] Beijing Union University,College of Robotics
[3] Beijing Engineering Research Center of Smart Mechanical Innovation Design Service,undefined
关键词
Emotion recognition; Multimodal fusion; Evolutionary optimization; Feature selection; Extreme learning machine;
D O I
暂无
中图分类号
学科分类号
摘要
Multimodal fusion-based emotion recognition has attracted increasing attention in affective computing because different modalities can achieve information complementation. One of the main challenges for reliable and effective model design is to define and extract appropriate emotional features from different modalities. In this paper, we present a novel multimodal emotion recognition framework to estimate categorical emotions, where visual and audio signals are utilized as multimodal input. The model learns neural appearance and key emotion frame using a statistical geometric method, which acts as a pre-processer for saving computation power. Discriminative emotion features expressed from visual and audio modalities are extracted through evolutionary optimization, and then fed to the optimized extreme learning machine (ELM) classifiers for unimodal emotion recognition. Finally, a decision-level fusion strategy is applied to integrate the results of predicted emotions by the different classifiers to enhance the overall performance. The effectiveness of the proposed method is demonstrated through three public datasets, i.e., the acted CK+ dataset, the acted Enterface05 dataset, and the spontaneous BAUM-1s dataset. An average recognition rate of 93.53%\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\%$$\end{document} on CK+, 91.62%\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\%$$\end{document} on Enterface05, and 60.77%\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\%$$\end{document} on BAUM-1s are obtained. The emotion recognition results acquired by fusing visual and audio predicted emotions are superior to both recognition of unimodality and concatenation of individual features.
引用
收藏
页码:1903 / 1917
页数:14
相关论文
共 50 条
  • [31] Facial Region Segmentation Based Emotion Recognition Using Extreme Learning Machine
    Islam, Bayezid
    Mahmud, Firoz
    Hossain, Arfat
    2018 INTERNATIONAL CONFERENCE ON ADVANCEMENT IN ELECTRICAL AND ELECTRONIC ENGINEERING (ICAEEE), 2018,
  • [32] Audio-Video Based Multimodal Emotion Recognition Using SVMs and Deep Learning
    Sun, Bo
    Xu, Qihua
    He, Jun
    Yu, Lejun
    Li, Liandong
    Wei, Qinglan
    PATTERN RECOGNITION (CCPR 2016), PT II, 2016, 663 : 621 - 631
  • [33] Emotion recognition based on EEG features in movie clips with channel selection
    Özerdem M.S.
    Polat H.
    Brain Informatics, 2017, 4 (04) : 241 - 252
  • [34] Connecting Subspace Learning and Extreme Learning Machine in Speech Emotion Recognition
    Xu, Xinzhou
    Deng, Jun
    Coutinho, Eduardo
    Wu, Chen
    Zhao, Li
    Schuller, Bjoern W.
    IEEE TRANSACTIONS ON MULTIMEDIA, 2019, 21 (03) : 795 - 808
  • [35] Emotion recognition based on sparse learning feature selection method for social communication
    Yixin Yan
    Chenyang Li
    Shaoliang Meng
    Signal, Image and Video Processing, 2019, 13 : 1253 - 1257
  • [36] Emotion recognition based on sparse learning feature selection method for social communication
    Yan, Yixin
    Li, Chenyang
    Meng, Shaoliang
    SIGNAL IMAGE AND VIDEO PROCESSING, 2019, 13 (07) : 1253 - 1257
  • [37] Audio-Visual Emotion Recognition in Video Clips
    Noroozi, Fatemeh
    Marjanovic, Marina
    Njegus, Angelina
    Escalera, Sergio
    Anbarjafari, Gholamreza
    IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2019, 10 (01) : 60 - 75
  • [38] A Framework of Human Emotion Recognition Using Extreme Learning Machine
    Utama, Prasetia
    Widodo
    Ajie, Hamidillah
    2014 INTERNATIONAL CONFERENCE OF ADVANCED INFORMATICS: CONCEPT, THEORY AND APPLICATION (ICAICTA), 2014, : 315 - 320
  • [39] Graph Classification Based on Sparse Graph Feature Selection and Extreme Learning Machine
    Yu, Yajun
    Pan, Zhisong
    Hu, Guyu
    PROCEEDINGS OF ELM-2015, VOL 1: THEORY, ALGORITHMS AND APPLICATIONS (I), 2016, 6 : 179 - 191
  • [40] A fast feature selection approach based on extreme learning machine and coefficient of variation
    Ertugrul, Omer Faruk
    Tagluk, Mehmet Emin
    TURKISH JOURNAL OF ELECTRICAL ENGINEERING AND COMPUTER SCIENCES, 2017, 25 (04) : 3409 - 3420