Multimodal emotion recognition based on feature selection and extreme learning machine in video clips

被引:0
|
作者
Bei Pan
Kaoru Hirota
Zhiyang Jia
Linhui Zhao
Xiaoming Jin
Yaping Dai
机构
[1] Beijing Institute of Technology,School of Automation
[2] Beijing Union University,College of Robotics
[3] Beijing Engineering Research Center of Smart Mechanical Innovation Design Service,undefined
关键词
Emotion recognition; Multimodal fusion; Evolutionary optimization; Feature selection; Extreme learning machine;
D O I
暂无
中图分类号
学科分类号
摘要
Multimodal fusion-based emotion recognition has attracted increasing attention in affective computing because different modalities can achieve information complementation. One of the main challenges for reliable and effective model design is to define and extract appropriate emotional features from different modalities. In this paper, we present a novel multimodal emotion recognition framework to estimate categorical emotions, where visual and audio signals are utilized as multimodal input. The model learns neural appearance and key emotion frame using a statistical geometric method, which acts as a pre-processer for saving computation power. Discriminative emotion features expressed from visual and audio modalities are extracted through evolutionary optimization, and then fed to the optimized extreme learning machine (ELM) classifiers for unimodal emotion recognition. Finally, a decision-level fusion strategy is applied to integrate the results of predicted emotions by the different classifiers to enhance the overall performance. The effectiveness of the proposed method is demonstrated through three public datasets, i.e., the acted CK+ dataset, the acted Enterface05 dataset, and the spontaneous BAUM-1s dataset. An average recognition rate of 93.53%\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\%$$\end{document} on CK+, 91.62%\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\%$$\end{document} on Enterface05, and 60.77%\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\%$$\end{document} on BAUM-1s are obtained. The emotion recognition results acquired by fusing visual and audio predicted emotions are superior to both recognition of unimodality and concatenation of individual features.
引用
收藏
页码:1903 / 1917
页数:14
相关论文
共 50 条
  • [41] An enhanced Harris hawk optimizer based on extreme learning machine for feature selection
    Abdullah Alzaqebah
    Omar Al-Kadi
    Ibrahim Aljarah
    Progress in Artificial Intelligence, 2023, 12 : 77 - 97
  • [42] Graph classification based on sparse graph feature selection and extreme learning machine
    Yu, Yajun
    Pan, Zhisong
    Hu, Guyu
    Ren, Huifeng
    NEUROCOMPUTING, 2017, 261 : 20 - 27
  • [43] An enhanced Harris hawk optimizer based on extreme learning machine for feature selection
    Alzaqebah, Abdullah
    Al-Kadi, Omar
    Aljarah, Ibrahim
    PROGRESS IN ARTIFICIAL INTELLIGENCE, 2023, 12 (01) : 77 - 97
  • [44] Analyze EEG signals with extreme learning machine based on PMIS feature selection
    Huanyu Zhao
    Xueyan Guo
    Mingwei Wang
    Tongliang Li
    Chaoyi Pang
    Dimitrios Georgakopoulos
    International Journal of Machine Learning and Cybernetics, 2018, 9 : 243 - 249
  • [45] Analyze EEG signals with extreme learning machine based on PMIS feature selection
    Zhao, Huanyu
    Guo, Xueyan
    Wang, Mingwei
    Li, Tongliang
    Pang, Chaoyi
    Georgakopoulos, Dimitrios
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2018, 9 (02) : 243 - 249
  • [46] Scene Recognition Based on Extreme Learning Machine for Digital Video Archive Management
    Cheng, DongSheng
    Yu, Wenjing
    He, Xiaoling
    Ni, Shilong
    Lv, Junyu
    Zeng, Weibo
    Yu Yuanlong
    2015 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND BIOMIMETICS (ROBIO), 2015, : 1619 - 1624
  • [47] Speech emotion recognition based on multimodal and multiscale feature fusion
    Hu, Huangshui
    Wei, Jie
    Sun, Hongyu
    Wang, Chuhang
    Tao, Shuo
    SIGNAL IMAGE AND VIDEO PROCESSING, 2025, 19 (01)
  • [48] Genetic Algorithm Application to Feature Selection in sEMG Movement Recognition with Regularized Extreme Learning Machine
    Tosin, Mauricio C.
    Bagesteiro, Leia B.
    Balbinot, Alexandre
    42ND ANNUAL INTERNATIONAL CONFERENCES OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY: ENABLING INNOVATIVE TECHNOLOGIES FOR GLOBAL HEALTHCARE EMBC'20, 2020, : 666 - 669
  • [49] Video-Audio Emotion Recognition Based on Feature Fusion Deep Learning Method
    Song, Yanan
    Cai, Yuanyang
    Tan, Lizhe
    2021 IEEE INTERNATIONAL MIDWEST SYMPOSIUM ON CIRCUITS AND SYSTEMS (MWSCAS), 2021, : 611 - 616
  • [50] A Deep Feature based Multi-kernel Learning Approach for Video Emotion Recognition
    Li, Wei
    Abtahi, Farnaz
    Zhu, Zhigang
    ICMI'15: PROCEEDINGS OF THE 2015 ACM INTERNATIONAL CONFERENCE ON MULTIMODAL INTERACTION, 2015, : 482 - 489