Video Emotion Recognition with Transferred Deep Feature Encodings

被引:39
|
作者
Xu, Baohan [1 ]
Fu, Yanwei [2 ,3 ]
Jiang, Yu-Gang [1 ]
Li, Boyang [3 ]
Sigal, Leonid [3 ]
机构
[1] Fudan Univ, Shanghai Key Lab Intelligent Informat Proc, Sch Comp Sci, Yangpu Qu, Shanghai Shi, Peoples R China
[2] Fudan Univ, Sch Data Sci, Yangpu Qu, Shanghai Shi, Peoples R China
[3] Disney Res, Orlando, FL 32830 USA
关键词
D O I
10.1145/2911996.2912006
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Despite growing research interest, emotion understanding for user-generated videos remains a challenging problem. Major obstacles include the diversity and complexity of video content, as well as the sparsity of expressed emotions. For the first time, we systematically study large-scale video emotion recognition by transferring deep feature encodings. In addition to the traditional, supervised recognition, we study the problem of zero-shot emotion recognition, where emotions in the test set are unseen during training. To cope with this task, we utilize knowledge transferred from auxiliary image and text corpora. A novel auxiliary Image Transfer Encoding (ITE) process is proposed to efficiently encode and generate video representation. We also thoroughly investigate different configurations of convolutional neural networks. Comprehensive experiments on multiple datasets demonstrate the effectiveness of our framework.
引用
收藏
页码:15 / 22
页数:8
相关论文
共 50 条
  • [41] A hybrid deep feature selection framework for emotion recognition from human speeches
    Aritra Marik
    Soumitri Chattopadhyay
    Pawan Kumar Singh
    Multimedia Tools and Applications, 2023, 82 : 11461 - 11487
  • [42] Multimodal emotion recognition based on feature selection and extreme learning machine in video clips
    Bei Pan
    Kaoru Hirota
    Zhiyang Jia
    Linhui Zhao
    Xiaoming Jin
    Yaping Dai
    Journal of Ambient Intelligence and Humanized Computing, 2023, 14 : 1903 - 1917
  • [43] Character emotion recognition algorithm in small sample video based on multimodal feature fusion
    Xie, Jian
    Chu, Dan
    INTERNATIONAL JOURNAL OF BIOMETRICS, 2025, 17 (1-2) : 1 - 14
  • [44] Multimodal emotion recognition based on feature selection and extreme learning machine in video clips
    Pan, Bei
    Hirota, Kaoru
    Jia, Zhiyang
    Zhao, Linhui
    Jin, Xiaoming
    Dai, Yaping
    JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2021, 14 (3) : 1903 - 1917
  • [45] Feature Extraction and Selection for Real-Time Emotion Recognition in Video Games Players
    Granato, Marco
    Gadia, Davide
    Maggiorini, Dario
    Ripamonti, Laura Anna
    2018 14TH INTERNATIONAL CONFERENCE ON SIGNAL IMAGE TECHNOLOGY & INTERNET BASED SYSTEMS (SITIS), 2018, : 717 - 724
  • [46] HOW DEEP NEURAL NETWORKS CAN IMPROVE EMOTION RECOGNITION ON VIDEO DATA
    Khorrami, Pooya
    Le Paine, Tom
    Brady, Kevin
    Dagli, Charlie
    Huang, Thomas S.
    2016 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2016, : 619 - 623
  • [47] Emotion recognition using multimodal deep learning in multiple psychophysiological signals and video
    Wang, Zhongmin
    Zhou, Xiaoxiao
    Wang, Wenlang
    Liang, Chen
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2020, 11 (04) : 923 - 934
  • [48] Deep Learning Based Video Spatio-Temporal Modeling for Emotion Recognition
    Fonnegra, Ruben D.
    Diaz, Gloria M.
    HUMAN-COMPUTER INTERACTION: THEORIES, METHODS, AND HUMAN ISSUES, HCI INTERNATIONAL 2018, PT I, 2018, 10901 : 397 - 408
  • [49] Emotion recognition using multimodal deep learning in multiple psychophysiological signals and video
    Zhongmin Wang
    Xiaoxiao Zhou
    Wenlang Wang
    Chen Liang
    International Journal of Machine Learning and Cybernetics, 2020, 11 : 923 - 934
  • [50] Video Emotion Recognition using Hand-Crafted and Deep Learning Features
    Xia, Xiaohan
    Liu, Jiamu
    Yang, Tao
    Jiang, Dongmei
    Han, Wenjing
    Sahli, Hichem
    2018 FIRST ASIAN CONFERENCE ON AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION (ACII ASIA), 2018,