DUAL FOCUS ATTENTION NETWORK FOR VIDEO EMOTION RECOGNITION

被引：0

作者：

Qiu, Haonan ^{[1
]}

He, Liang ^{[1
]}

Wang, Feng ^{[1
]}

机构：

[1] East China Normal Univ, Sch Comp Sci & Technol, Shanghai Key Lab Multidimens Informat Proc, Shanghai, Peoples R China

来源：

2020 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME) | 2020年

基金：

中国国家自然科学基金;

关键词：

Video emotion recognition; attention for video; deep learning;

D O I：

10.1109/icme46284.2020.9102808

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

Video emotion recognition is a challenging task due to complex scenes and various forms of emotion expression. Most existing works focus on fusing multiple features over the whole video clips. According to our observations, given a long video clip, the emotion is usually presented by only several actions/objects in a few short snippets, and the meaningful cues are buried in the noisy background. When human judging the emotion in videos, we first find the informative clips and then closely look for emotional cues in the frames. In this paper, we propose Dual Focus Attention Network to mimic this process. First, three kinds of features including action, object, and scene are extracted from videos. Second, Two attention modules are used to focus on the visual features of the videos from temporal and spatial dimensions respectively. With our dual focus attention network, we can effectively discover the most emotional frames along the time dimension and the most emotional visual cues in each frame. Our experiments conducted on two widely used datasets Ekman and VideoEmotion show that our proposed approach outperforms the existing approaches.

引用

页数：6

共 50 条

[31] Efficient dual attention SlowFast networks for video action recognition
Wei, Dafeng
Tian, Ye
Wei, Liqing
Zhong, Hong
Chen, Siqian
Pu, Shiliang
Lu, Hongtao
COMPUTER VISION AND IMAGE UNDERSTANDING, 2022, 222
[32] A Video Action Recognition Method via Dual-Stream Feature Fusion Neural Network with Attention
Han, Jianmin
Li, Jie
INTERNATIONAL JOURNAL OF UNCERTAINTY FUZZINESS AND KNOWLEDGE-BASED SYSTEMS, 2024, 32 (04) : 673 - 694
[33] Emotion recognition with attention mechanism-guided dual-feature multi-path interaction network
Li, Yaxuan
Guo, Wenhui
Wang, Yanjiang
SIGNAL IMAGE AND VIDEO PROCESSING, 2024, 18 (SUPPL 1) : 617 - 626
[34] Sparse Spatial-Temporal Emotion Graph Convolutional Network for Video Emotion Recognition
Liu, Xiaodong
Xu, Huating
Wang, Miao
COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2022, 2022
[35] DualSANet: Dual Spatial Attention Network for Iris Recognition
Yang, Kai
Xu, Zihao
Fei, Jingjing
2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2021), 2021, : 888 - 896
[36] A CROSS-ATTENTION EMOTION RECOGNITION ALGORITHM BASED ON AUDIO AND VIDEO MODALITIES
Wu, Xiao
Mu, Xuan
Qi, Wen
Liu, Xiaorui
2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING WORKSHOPS, ICASSPW 2024, 2024, : 309 - 313
[37] Video multimodal emotion recognition based on Bi-GRU and attention fusion
Huan, Ruo-Hong
Shu, Jia
Bao, Sheng-Lin
Liang, Rong-Hua
Chen, Peng
Chi, Kai-Kai
MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (06) : 8213 - 8240
[38] Video multimodal emotion recognition based on Bi-GRU and attention fusion
Ruo-Hong Huan
Jia Shu
Sheng-Lin Bao
Rong-Hua Liang
Peng Chen
Kai-Kai Chi
Multimedia Tools and Applications, 2021, 80 : 8213 - 8240
[39] Multipath Attention and Adaptive Gating Network for Video Action Recognition
Haiping Zhang
Zepeng Hu
Dongjin Yu
Liming Guan
Xu Liu
Conghao Ma
Neural Processing Letters, 56
[40] SDAN: Stacked Diverse Attention Network for Video Action Recognition
Zhu, Xiaoguang
Huang, Siran
Fan, Wenjing
Cheng, Yuhao
Shao, Huaqing
Liu, Peilin
2021 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2021,

← 1 2 3 4 5 →