Sparse Kernel Reduced-Rank Regression for Bimodal Emotion Recognition From Facial Expression and Speech

被引：74

作者：

Yan, Jingjie ^{[1
]}

Zheng, Wenming ^{[3
]}

Xu, Qinyu ^{[1
]}

Lu, Guanming ^{[1
]}

Li, Haibo ^{[1
,2
]}

Wang, Bei ^{[3
]}

机构：

[1] Nanjing Univ Posts & Telecommun, Jiangsu Prov Key Lab Image Proc & Image Commun, Coll Telecomm & Informat Engn, Nanjing 210003, Peoples R China

[2] Royal Inst Technol, Sch Comp Sci & Commun, S-11428 Stockholm, Sweden

[3] Southeast Univ, Key Lab Child Dev & Learning Sci, Minist Educ, Res Ctr Learning Sci, Nanjing 210096, Jiangsu, Peoples R China

来源：

IEEE TRANSACTIONS ON MULTIMEDIA | 2016年 / 18卷 / 07期

基金：

中国国家自然科学基金;

关键词：

Bimodal emotion recognition; facial expression; feature fusion; sparse kernel reduced-rank regression (SKRRR); speech; PHENOTYPES; FRAMEWORK; FUSION; FACE;

D O I：

10.1109/TMM.2016.2557721

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

A novel bimodal emotion recognition approach from facial expression and speech based on the sparse kernel reduced-rank regression (SKRRR) fusion method is proposed in this paper. In this method, we use the openSMILE feature extractor and the scale invariant feature transform feature descriptor to respectively extract effective features from speech modality and facial expression modality, and then propose the SKRRR fusion approach to fuse the emotion features of two modalities. The proposed SKRRR method is a nonlinear extension of the traditional reduced-rank regression (RRR), where both predictor and response feature vectors in RRR are kernelized by being mapped onto two high-dimensional feature space via two nonlinear mappings, respectively. To solve the SKRRR problem, we propose a sparse representation (SR)-based approach to find the optimal solution of the coefficient matrices of SKRRR, where the introduction of the SR technique aims to fully consider the different contributions of training data samples to the derivation of optimal solution of SKRRR. Finally, we utilize the eNTERFACE '05 and AFEW4.0 bimodal emotion database to conduct the experiments of monomodal emotion recognition and bimodal emotion recognition, and the results indicate that our presented approach acquires the highest or comparable bimodal emotion recognition rate among some state-of-the-art approaches.

引用

页码：1319 / 1329

页数：11

共 50 条

[21] Correction: Sparse reduced-rank regression for simultaneous rank and variable selection via manifold optimization
Kohei Yoshikawa
Shuichi Kawano
Computational Statistics, 2023, 38 : 77 - 78
[22] Envelope-based sparse reduced-rank regression for multivariate linear model
Guo, Wenxing
Balakrishnan, Narayanaswamy
He, Mu
JOURNAL OF MULTIVARIATE ANALYSIS, 2023, 195
[23] The implementation of the emotion recognition from speech and facial expression system
Park, CH
Byun, KS
Sim, KB
ADVANCES IN NATURAL COMPUTATION, PT 2, PROCEEDINGS, 2005, 3611 : 85 - 88
[24] Automatic Emotion Recognition for Facial Expression Animation from Speech
Bozkurt, Elif
Erzin, Engin
Erdem, Cigdem Eroglu
Erdem, A. Tanju
2009 IEEE 17TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE, VOLS 1 AND 2, 2009, : 271 - +
[25] Srrr-cluster: Using Sparse Reduced-Rank Regression to Optimize iCluster
Ge, Shu-Guang
Xia, Jun-Feng
Wei, Pi-Jing
Zheng, Chun-Hou
INTELLIGENT COMPUTING METHODOLOGIES, ICIC 2016, PT III, 2016, 9773 : 99 - 106
[26] Wavelet-Based Sparse Reduced-Rank Regression for Hyperspectral Image Restoration
Rasti, Behnood
Sveinsson, Johannes R.
Ulfarsson, Magnus Orn
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2014, 52 (10): : 6688 - 6698
[27] Bimodal Emotion Recognition from Speech and Text
Ye, Weilin
Fan, Xinghua
INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2014, 5 (02) : 26 - 29
[28] Speech Emotion Recognition Based on Robust Discriminative Sparse Regression
Song, Peng
Zheng, Wenming
Yu, Yanwei
Ou, Shifeng
IEEE TRANSACTIONS ON COGNITIVE AND DEVELOPMENTAL SYSTEMS, 2021, 13 (02) : 343 - 353
[29] Robust Face Recognition Based on Kernel Reduced Rank Regression
Chen, Ying
Zhang, Longyuan
2012 5TH INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING (CISP), 2012, : 1316 - 1319
[30] Speech emotion recognition using kernel sparse representation based classifier
Sharma, Pulkit
Abrol, Vinayak
Sachdev, Abhijeet
Dileep, A. D.
Sao, Anil Kumar
2016 24TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2016, : 374 - 377

← 1 2 3 4 5 →