Video Emotion Recognition with Transferred Deep Feature Encodings

被引：39

作者：

Xu, Baohan ^{[1
]}

Fu, Yanwei ^{[2
,3
]}

Jiang, Yu-Gang ^{[1
]}

Li, Boyang ^{[3
]}

Sigal, Leonid ^{[3
]}

机构：

[1] Fudan Univ, Shanghai Key Lab Intelligent Informat Proc, Sch Comp Sci, Yangpu Qu, Shanghai Shi, Peoples R China

[2] Fudan Univ, Sch Data Sci, Yangpu Qu, Shanghai Shi, Peoples R China

[3] Disney Res, Orlando, FL 32830 USA

来源：

ICMR'16: PROCEEDINGS OF THE 2016 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL | 2016年

关键词：

D O I：

10.1145/2911996.2912006

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

Despite growing research interest, emotion understanding for user-generated videos remains a challenging problem. Major obstacles include the diversity and complexity of video content, as well as the sparsity of expressed emotions. For the first time, we systematically study large-scale video emotion recognition by transferring deep feature encodings. In addition to the traditional, supervised recognition, we study the problem of zero-shot emotion recognition, where emotions in the test set are unseen during training. To cope with this task, we utilize knowledge transferred from auxiliary image and text corpora. A novel auxiliary Image Transfer Encoding (ITE) process is proposed to efficiently encode and generate video representation. We also thoroughly investigate different configurations of convolutional neural networks. Comprehensive experiments on multiple datasets demonstrate the effectiveness of our framework.

引用

页码：15 / 22

页数：8

共 50 条

[1] A novel feature set for video emotion recognition
Mo, Shasha
Niu, Jianwei
Su, Yiming
Das, Sajal K.
NEUROCOMPUTING, 2018, 291 : 11 - 20
[2] Video-Based Emotion Recognition using Face Frontalization and Deep Spatiotemporal Feature
Wang, Jinwei
Zhao, Ziping
Liang, Jinglian
Li, Chao
2018 FIRST ASIAN CONFERENCE ON AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION (ACII ASIA), 2018,
[3] A Deep Feature based Multi-kernel Learning Approach for Video Emotion Recognition
Li, Wei
Abtahi, Farnaz
Zhu, Zhigang
ICMI'15: PROCEEDINGS OF THE 2015 ACM INTERNATIONAL CONFERENCE ON MULTIMODAL INTERACTION, 2015, : 482 - 489
[4] Video-Audio Emotion Recognition Based on Feature Fusion Deep Learning Method
Song, Yanan
Cai, Yuanyang
Tan, Lizhe
2021 IEEE INTERNATIONAL MIDWEST SYMPOSIUM ON CIRCUITS AND SYSTEMS (MWSCAS), 2021, : 611 - 616
[5] Deep Feature Flow for Video Recognition
Zhu, Xizhou
Xiong, Yuwen
Dai, Jifeng
Yuan, Lu
Wei, Yichen
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 4141 - 4150
[6] Simple, Efficient and Effective Encodings of Local Deep Features for Video Action Recognition
Duta, Ionut C.
Ionescu, Bogdan
Aizawa, Kiyoharu
Sebe, Nicu
PROCEEDINGS OF THE 2017 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL (ICMR'17), 2017, : 223 - 230
[7] Deep Local Video Feature for Action Recognition
Lan, Zhenzhong
Zhu, Yi
Hauptmann, Alexander G.
Newsam, Shawn
2017 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2017, : 1219 - 1225
[8] Deep feature pyramid network for EEG emotion recognition
Hou, Fazheng
Gao, Qiang
Song, Yu
Wang, Zhe
Bai, Zhongli
Yang, Yi
Tian, Zekun
MEASUREMENT, 2022, 201
[9] Discriminative Deep Feature Learning for Facial Emotion Recognition
Dinh Viet Sang
Le Tran Bao Cuong
Pham Thai Ha
2018 1ST INTERNATIONAL CONFERENCE ON MULTIMEDIA ANALYSIS AND PATTERN RECOGNITION (MAPR), 2018,
[10] Deep facial emotion recognition in video using eigenframes
Hajarolasvadi, Noushin
Demirel, Hasan
IET IMAGE PROCESSING, 2020, 14 (14) : 3536 - 3546

← 1 2 3 4 5 →