Video Emotion Recognition with Transferred Deep Feature Encodings

被引:39
|
作者
Xu, Baohan [1 ]
Fu, Yanwei [2 ,3 ]
Jiang, Yu-Gang [1 ]
Li, Boyang [3 ]
Sigal, Leonid [3 ]
机构
[1] Fudan Univ, Shanghai Key Lab Intelligent Informat Proc, Sch Comp Sci, Yangpu Qu, Shanghai Shi, Peoples R China
[2] Fudan Univ, Sch Data Sci, Yangpu Qu, Shanghai Shi, Peoples R China
[3] Disney Res, Orlando, FL 32830 USA
关键词
D O I
10.1145/2911996.2912006
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Despite growing research interest, emotion understanding for user-generated videos remains a challenging problem. Major obstacles include the diversity and complexity of video content, as well as the sparsity of expressed emotions. For the first time, we systematically study large-scale video emotion recognition by transferring deep feature encodings. In addition to the traditional, supervised recognition, we study the problem of zero-shot emotion recognition, where emotions in the test set are unseen during training. To cope with this task, we utilize knowledge transferred from auxiliary image and text corpora. A novel auxiliary Image Transfer Encoding (ITE) process is proposed to efficiently encode and generate video representation. We also thoroughly investigate different configurations of convolutional neural networks. Comprehensive experiments on multiple datasets demonstrate the effectiveness of our framework.
引用
收藏
页码:15 / 22
页数:8
相关论文
共 50 条
  • [1] A novel feature set for video emotion recognition
    Mo, Shasha
    Niu, Jianwei
    Su, Yiming
    Das, Sajal K.
    NEUROCOMPUTING, 2018, 291 : 11 - 20
  • [2] Video-Based Emotion Recognition using Face Frontalization and Deep Spatiotemporal Feature
    Wang, Jinwei
    Zhao, Ziping
    Liang, Jinglian
    Li, Chao
    2018 FIRST ASIAN CONFERENCE ON AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION (ACII ASIA), 2018,
  • [3] A Deep Feature based Multi-kernel Learning Approach for Video Emotion Recognition
    Li, Wei
    Abtahi, Farnaz
    Zhu, Zhigang
    ICMI'15: PROCEEDINGS OF THE 2015 ACM INTERNATIONAL CONFERENCE ON MULTIMODAL INTERACTION, 2015, : 482 - 489
  • [4] Video-Audio Emotion Recognition Based on Feature Fusion Deep Learning Method
    Song, Yanan
    Cai, Yuanyang
    Tan, Lizhe
    2021 IEEE INTERNATIONAL MIDWEST SYMPOSIUM ON CIRCUITS AND SYSTEMS (MWSCAS), 2021, : 611 - 616
  • [5] Deep Feature Flow for Video Recognition
    Zhu, Xizhou
    Xiong, Yuwen
    Dai, Jifeng
    Yuan, Lu
    Wei, Yichen
    30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 4141 - 4150
  • [6] Simple, Efficient and Effective Encodings of Local Deep Features for Video Action Recognition
    Duta, Ionut C.
    Ionescu, Bogdan
    Aizawa, Kiyoharu
    Sebe, Nicu
    PROCEEDINGS OF THE 2017 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL (ICMR'17), 2017, : 223 - 230
  • [7] Deep Local Video Feature for Action Recognition
    Lan, Zhenzhong
    Zhu, Yi
    Hauptmann, Alexander G.
    Newsam, Shawn
    2017 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2017, : 1219 - 1225
  • [8] Deep feature pyramid network for EEG emotion recognition
    Hou, Fazheng
    Gao, Qiang
    Song, Yu
    Wang, Zhe
    Bai, Zhongli
    Yang, Yi
    Tian, Zekun
    MEASUREMENT, 2022, 201
  • [9] Discriminative Deep Feature Learning for Facial Emotion Recognition
    Dinh Viet Sang
    Le Tran Bao Cuong
    Pham Thai Ha
    2018 1ST INTERNATIONAL CONFERENCE ON MULTIMEDIA ANALYSIS AND PATTERN RECOGNITION (MAPR), 2018,
  • [10] Deep facial emotion recognition in video using eigenframes
    Hajarolasvadi, Noushin
    Demirel, Hasan
    IET IMAGE PROCESSING, 2020, 14 (14) : 3536 - 3546