Multimodal emotion recognition based on feature selection and extreme learning machine in video clips

被引：0

作者：

Bei Pan

Kaoru Hirota

Zhiyang Jia

Linhui Zhao

Xiaoming Jin

Yaping Dai

机构：

[1] Beijing Institute of Technology,School of Automation

[2] Beijing Union University,College of Robotics

[3] Beijing Engineering Research Center of Smart Mechanical Innovation Design Service,undefined

来源：

Journal of Ambient Intelligence and Humanized Computing | 2023年 / 14卷

关键词：

Emotion recognition; Multimodal fusion; Evolutionary optimization; Feature selection; Extreme learning machine;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

Multimodal fusion-based emotion recognition has attracted increasing attention in affective computing because different modalities can achieve information complementation. One of the main challenges for reliable and effective model design is to define and extract appropriate emotional features from different modalities. In this paper, we present a novel multimodal emotion recognition framework to estimate categorical emotions, where visual and audio signals are utilized as multimodal input. The model learns neural appearance and key emotion frame using a statistical geometric method, which acts as a pre-processer for saving computation power. Discriminative emotion features expressed from visual and audio modalities are extracted through evolutionary optimization, and then fed to the optimized extreme learning machine (ELM) classifiers for unimodal emotion recognition. Finally, a decision-level fusion strategy is applied to integrate the results of predicted emotions by the different classifiers to enhance the overall performance. The effectiveness of the proposed method is demonstrated through three public datasets, i.e., the acted CK+ dataset, the acted Enterface05 dataset, and the spontaneous BAUM-1s dataset. An average recognition rate of 93.53%\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\%$$\end{document} on CK+, 91.62%\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\%$$\end{document} on Enterface05, and 60.77%\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\%$$\end{document} on BAUM-1s are obtained. The emotion recognition results acquired by fusing visual and audio predicted emotions are superior to both recognition of unimodality and concatenation of individual features.

引用

页码：1903 / 1917

页数：14

共 50 条

[41] An enhanced Harris hawk optimizer based on extreme learning machine for feature selection
Abdullah Alzaqebah
Omar Al-Kadi
Ibrahim Aljarah
Progress in Artificial Intelligence, 2023, 12 : 77 - 97
[42] Graph classification based on sparse graph feature selection and extreme learning machine
Yu, Yajun
Pan, Zhisong
Hu, Guyu
Ren, Huifeng
NEUROCOMPUTING, 2017, 261 : 20 - 27
[43] An enhanced Harris hawk optimizer based on extreme learning machine for feature selection
Alzaqebah, Abdullah
Al-Kadi, Omar
Aljarah, Ibrahim
PROGRESS IN ARTIFICIAL INTELLIGENCE, 2023, 12 (01) : 77 - 97
[44] Analyze EEG signals with extreme learning machine based on PMIS feature selection
Huanyu Zhao
Xueyan Guo
Mingwei Wang
Tongliang Li
Chaoyi Pang
Dimitrios Georgakopoulos
International Journal of Machine Learning and Cybernetics, 2018, 9 : 243 - 249
[45] Analyze EEG signals with extreme learning machine based on PMIS feature selection
Zhao, Huanyu
Guo, Xueyan
Wang, Mingwei
Li, Tongliang
Pang, Chaoyi
Georgakopoulos, Dimitrios
INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2018, 9 (02) : 243 - 249
[46] Scene Recognition Based on Extreme Learning Machine for Digital Video Archive Management
Cheng, DongSheng
Yu, Wenjing
He, Xiaoling
Ni, Shilong
Lv, Junyu
Zeng, Weibo
Yu Yuanlong
2015 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND BIOMIMETICS (ROBIO), 2015, : 1619 - 1624
[47] Speech emotion recognition based on multimodal and multiscale feature fusion
Hu, Huangshui
Wei, Jie
Sun, Hongyu
Wang, Chuhang
Tao, Shuo
SIGNAL IMAGE AND VIDEO PROCESSING, 2025, 19 (01)
[48] Genetic Algorithm Application to Feature Selection in sEMG Movement Recognition with Regularized Extreme Learning Machine
Tosin, Mauricio C.
Bagesteiro, Leia B.
Balbinot, Alexandre
42ND ANNUAL INTERNATIONAL CONFERENCES OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY: ENABLING INNOVATIVE TECHNOLOGIES FOR GLOBAL HEALTHCARE EMBC'20, 2020, : 666 - 669
[49] Video-Audio Emotion Recognition Based on Feature Fusion Deep Learning Method
Song, Yanan
Cai, Yuanyang
Tan, Lizhe
2021 IEEE INTERNATIONAL MIDWEST SYMPOSIUM ON CIRCUITS AND SYSTEMS (MWSCAS), 2021, : 611 - 616
[50] A Deep Feature based Multi-kernel Learning Approach for Video Emotion Recognition
Li, Wei
Abtahi, Farnaz
Zhu, Zhigang
ICMI'15: PROCEEDINGS OF THE 2015 ACM INTERNATIONAL CONFERENCE ON MULTIMODAL INTERACTION, 2015, : 482 - 489

← 1 2 3 4 5 →