Kernel Fusion of Audio and Visual Information for Emotion Recognition

被引：0

作者：

Wang, Yongjin ^{[1
]}

Zhang, Rui ^{[1
]}

Guan, Ling ^{[1
]}

Venetsanopoulos, A. N. ^{[1
]}

机构：

[1] Ryerson Univ, Dept Elect & Comp Engn, Toronto, ON, Canada

来源：

IMAGE ANALYSIS AND RECOGNITION: 8TH INTERNATIONAL CONFERENCE, ICIAR 2011, PT II: 8TH INTERNATIONAL CONFERENCE, ICIAR 2011 | 2011年 / 6754卷

关键词：

Audiovisual emotion recognition; kernel methods; multimodal information fusion; DISCRIMINANT;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Effective analysis and recognition of human emotional behavior are important for achieving efficient and intelligent human computer interaction. This paper presents an approach for audiovisual based multimodal emotion recognition. The proposed solution integrates the audio and visual information by fusing the kernel matrices of respective channels through algebraic operations, followed by dimensionality reduction techniques to map the original disparate features to a nonlinearly transformed joint subspace. A hidden Markov model is employed for characterizing the statistical dependence across successive frames, and identifying the inherent temporal structure of the features. We examine the kernel fusion method at both feature and score levels. The effectiveness of the proposed method is demonstrated through extensive experimentation.

引用

页码：140 / 150

页数：11

共 50 条

[11] Information Fusion Techniques in Audio-Visual Speech Recognition
Karabalkan, H.
Erdogan, H.
2009 IEEE 17TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE, VOLS 1 AND 2, 2009, : 734 - 737
[12] INFORMATION FUSION BASED ON KERNEL ENTROPY COMPONENT ANALYSIS IN DISCRIMINATIVE CANONICAL CORRELATION SPACE WITH APPLICATION TO AUDIO EMOTION RECOGNITION
Gao, Lei
Qi, Lin
Guan, Ling
2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 2817 - 2821
[13] Audio-Visual Domain Adaptation Feature Fusion for Speech Emotion Recognition
Wei, Jie
Hu, Guanyu
Yang, Xinyu
Luu, Anh Tuan
Dong, Yizhuo
INTERSPEECH 2022, 2022, : 1988 - 1992
[14] Audio-Visual Fusion Network Based on Conformer for Multimodal Emotion Recognition
Guo, Peini
Chen, Zhengyan
Li, Yidi
Liu, Hong
ARTIFICIAL INTELLIGENCE, CICAI 2022, PT II, 2022, 13605 : 315 - 326
[15] The CASIA Audio Emotion Recognition Method for Audio/Visual Emotion Challenge 2011
Pan, Shifeng
Tao, Jianhua
Li, Ya
AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION, PT II, 2011, 6975 : 388 - 395
[16] Learning Better Representations for Audio-Visual Emotion Recognition with Common Information
Ma, Fei
Zhang, Wei
Li, Yang
Huang, Shao-Lun
Zhang, Lin
APPLIED SCIENCES-BASEL, 2020, 10 (20): : 1 - 23
[17] Deep Learning for Audio Visual Emotion Recognition
Hussain, T.
Wang, W.
Bouaynaya, N.
Fathallah-Shaykh, H.
Mihaylova, L.
2022 25TH INTERNATIONAL CONFERENCE ON INFORMATION FUSION (FUSION 2022), 2022,
[18] Audio-visual spontaneous emotion recognition
Zeng, Zhihong
Hu, Yuxiao
Roisman, Glenn I.
Wen, Zhen
Fu, Yun
Huang, Thomas S.
ARTIFICIAL INTELLIGENCE FOR HUMAN COMPUTING, 2007, 4451 : 72 - +
[19] Context-Aware Based Visual-Audio Feature Fusion for Emotion Recognition
Cheng, Huijie
Tie, Yun
Qi, Lin
Jin, Cong
2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
[20] Continuous Emotion Recognition with Audio-visual Leader-follower Attentive Fusion
Zhang, Su
Ding, Yi
Wei, Ziquan
Guan, Cuntai
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2021), 2021, : 3560 - 3567

← 1 2 3 4 5 →