Video Emotion Recognition with Transferred Deep Feature Encodings

被引:39
|
作者
Xu, Baohan [1 ]
Fu, Yanwei [2 ,3 ]
Jiang, Yu-Gang [1 ]
Li, Boyang [3 ]
Sigal, Leonid [3 ]
机构
[1] Fudan Univ, Shanghai Key Lab Intelligent Informat Proc, Sch Comp Sci, Yangpu Qu, Shanghai Shi, Peoples R China
[2] Fudan Univ, Sch Data Sci, Yangpu Qu, Shanghai Shi, Peoples R China
[3] Disney Res, Orlando, FL 32830 USA
关键词
D O I
10.1145/2911996.2912006
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Despite growing research interest, emotion understanding for user-generated videos remains a challenging problem. Major obstacles include the diversity and complexity of video content, as well as the sparsity of expressed emotions. For the first time, we systematically study large-scale video emotion recognition by transferring deep feature encodings. In addition to the traditional, supervised recognition, we study the problem of zero-shot emotion recognition, where emotions in the test set are unseen during training. To cope with this task, we utilize knowledge transferred from auxiliary image and text corpora. A novel auxiliary Image Transfer Encoding (ITE) process is proposed to efficiently encode and generate video representation. We also thoroughly investigate different configurations of convolutional neural networks. Comprehensive experiments on multiple datasets demonstrate the effectiveness of our framework.
引用
收藏
页码:15 / 22
页数:8
相关论文
共 50 条
  • [31] Feature Encodings and Poolings for Action and Event Recognition: A Comprehensive Survey
    Liu, Changyu
    Zhang, Qian
    Lu, Bin
    Li, Cong
    INFORMATION, 2017, 8 (04)
  • [32] AUTOMATIC EMOTION RECOGNITION IN VIDEO
    KalaiSelvi, R.
    Kavitha, P.
    Shunmuganathan, K. L.
    2014 INTERNATIONAL CONFERENCE ON GREEN COMPUTING COMMUNICATION AND ELECTRICAL ENGINEERING (ICGCCEE), 2014,
  • [33] Emotion recognition based on EEG feature maps through deep learning network
    Topic, Ante
    Russo, Mladen
    ENGINEERING SCIENCE AND TECHNOLOGY-AN INTERNATIONAL JOURNAL-JESTECH, 2021, 24 (06): : 1442 - 1454
  • [34] Speech emotion recognition using feature fusion: a hybrid approach to deep learning
    Khan, Waleed Akram
    ul Qudous, Hamad
    Farhan, Asma Ahmad
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (31) : 75557 - 75584
  • [35] Feature Learning via Deep Belief Network for Chinese Speech Emotion Recognition
    Zhang, Shiqing
    Zhao, Xiaoming
    Chuang, Yuelong
    Guo, Wenping
    Chen, Ying
    PATTERN RECOGNITION (CCPR 2016), PT II, 2016, 663 : 645 - 651
  • [36] Utterance Level Feature Aggregation with Deep Metric Learning for Speech Emotion Recognition
    Mocanu, Bogdan
    Tapu, Ruxandra
    Zaharia, Titus
    SENSORS, 2021, 21 (12)
  • [37] ADFF: Attention Based Deep Feature Fusion Approach for Music Emotion Recognition
    Huang, Zi
    Ji, Shulei
    Hu, Zhilan
    Cai, Chuangjian
    Luo, Jing
    Yang, Xinyu
    INTERSPEECH 2022, 2022, : 4152 - 4156
  • [38] Enhancing speech emotion recognition through deep learning and handcrafted feature fusion
    Eris, Fatma Gunes
    Akbal, Erhan
    APPLIED ACOUSTICS, 2024, 222
  • [39] Feature Fusion for Multimodal Emotion Recognition Based on Deep Canonical Correlation Analysis
    Zhang, Ke
    Li, Yuanqing
    Wang, Jingyu
    Wang, Zhen
    Li, Xuelong
    IEEE SIGNAL PROCESSING LETTERS, 2021, 28 : 1898 - 1902
  • [40] A hybrid deep feature selection framework for emotion recognition from human speeches
    Marik, Aritra
    Chattopadhyay, Soumitri
    Singh, Pawan Kumar
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (08) : 11461 - 11487