Kernel Fusion of Audio and Visual Information for Emotion Recognition

被引:0
|
作者
Wang, Yongjin [1 ]
Zhang, Rui [1 ]
Guan, Ling [1 ]
Venetsanopoulos, A. N. [1 ]
机构
[1] Ryerson Univ, Dept Elect & Comp Engn, Toronto, ON, Canada
关键词
Audiovisual emotion recognition; kernel methods; multimodal information fusion; DISCRIMINANT;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Effective analysis and recognition of human emotional behavior are important for achieving efficient and intelligent human computer interaction. This paper presents an approach for audiovisual based multimodal emotion recognition. The proposed solution integrates the audio and visual information by fusing the kernel matrices of respective channels through algebraic operations, followed by dimensionality reduction techniques to map the original disparate features to a nonlinearly transformed joint subspace. A hidden Markov model is employed for characterizing the statistical dependence across successive frames, and identifying the inherent temporal structure of the features. We examine the kernel fusion method at both feature and score levels. The effectiveness of the proposed method is demonstrated through extensive experimentation.
引用
收藏
页码:140 / 150
页数:11
相关论文
共 50 条
  • [1] Multimodal Information Fusion of Audio Emotion Recognition Based on Kernel Entropy Component Analysis
    Xie, Zhibing
    Guan, Ling
    2012 IEEE INTERNATIONAL SYMPOSIUM ON MULTIMEDIA (ISM), 2012, : 1 - 8
  • [2] MULTIMODAL INFORMATION FUSION OF AUDIO EMOTION RECOGNITION BASED ON KERNEL ENTROPY COMPONENT ANALYSIS
    Xie, Zhibing
    Guan, Ling
    INTERNATIONAL JOURNAL OF SEMANTIC COMPUTING, 2013, 7 (01) : 25 - 42
  • [3] Fusion of Classifier Predictions for Audio-Visual Emotion Recognition
    Noroozi, Fatemeh
    Marjanovic, Marina
    Njegus, Angelina
    Escalera, Sergio
    Anbarjafari, Gholamreza
    2016 23RD INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2016, : 61 - 66
  • [4] INFORMATION FUSION OF AUDIO EMOTION RECOGNITION BASED ON KERNEL ENTROPY COMPONENT ANALYSIS IN CANONICAL CORRELATION SPACE
    Gao, Lei
    Qi, Lin
    Guan, Ling
    2015 IEEE INTERNATIONAL SYMPOSIUM ON MULTIMEDIA (ISM), 2015, : 241 - 244
  • [5] An Infant Emotion Recognition System Using Visual and Audio Information
    Fang, Chiung-Yao
    Ma, Chung-Wen
    Chiang, Meng-Lin
    Chen, Sei-Wang
    2017 4TH INTERNATIONAL CONFERENCE ON INDUSTRIAL ENGINEERING AND APPLICATIONS (ICIEA), 2017, : 284 - 291
  • [6] Semantic audio-visual data fusion for automatic emotion recognition
    Datcu, Dragos
    Rothkrantz, Leon J. M.
    EUROMEDIA '2008, 2008, : 58 - 65
  • [7] A new information fusion method for SVM-Based robotic audio-visual emotion recognition
    Han, Meng-Ju
    Hsu, Jing-Huai
    Song, Kai-Tai
    Chang, Fuh-Yu
    2007 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS, VOLS 1-8, 2007, : 2464 - +
  • [8] Cross Attentional Audio-Visual Fusion for Dimensional Emotion Recognition
    Praveen, R. Gnana
    Granger, Eric
    Cardinal, Patrick
    2021 16TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION (FG 2021), 2021,
  • [9] Information Fusion VIA Optimized KECA with Application to Audio Emotion Recognition
    Gao, Lei
    Guan, Ling
    2018 INTERNATIONAL CONFERENCE ON CYBER-ENABLED DISTRIBUTED COMPUTING AND KNOWLEDGE DISCOVERY (CYBERC 2018), 2018, : 255 - 264
  • [10] Multistage information fusion for audio-visual speech recognition
    Chu, SM
    Libal, V
    Marcheret, E
    Neti, C
    Potamianos, G
    2004 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXP (ICME), VOLS 1-3, 2004, : 1651 - 1654