DUAL FOCUS ATTENTION NETWORK FOR VIDEO EMOTION RECOGNITION

被引:0
|
作者
Qiu, Haonan [1 ]
He, Liang [1 ]
Wang, Feng [1 ]
机构
[1] East China Normal Univ, Sch Comp Sci & Technol, Shanghai Key Lab Multidimens Informat Proc, Shanghai, Peoples R China
基金
中国国家自然科学基金;
关键词
Video emotion recognition; attention for video; deep learning;
D O I
10.1109/icme46284.2020.9102808
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Video emotion recognition is a challenging task due to complex scenes and various forms of emotion expression. Most existing works focus on fusing multiple features over the whole video clips. According to our observations, given a long video clip, the emotion is usually presented by only several actions/objects in a few short snippets, and the meaningful cues are buried in the noisy background. When human judging the emotion in videos, we first find the informative clips and then closely look for emotional cues in the frames. In this paper, we propose Dual Focus Attention Network to mimic this process. First, three kinds of features including action, object, and scene are extracted from videos. Second, Two attention modules are used to focus on the visual features of the videos from temporal and spatial dimensions respectively. With our dual focus attention network, we can effectively discover the most emotional frames along the time dimension and the most emotional visual cues in each frame. Our experiments conducted on two widely used datasets Ekman and VideoEmotion show that our proposed approach outperforms the existing approaches.
引用
收藏
页数:6
相关论文
共 50 条
  • [31] Efficient dual attention SlowFast networks for video action recognition
    Wei, Dafeng
    Tian, Ye
    Wei, Liqing
    Zhong, Hong
    Chen, Siqian
    Pu, Shiliang
    Lu, Hongtao
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2022, 222
  • [32] A Video Action Recognition Method via Dual-Stream Feature Fusion Neural Network with Attention
    Han, Jianmin
    Li, Jie
    INTERNATIONAL JOURNAL OF UNCERTAINTY FUZZINESS AND KNOWLEDGE-BASED SYSTEMS, 2024, 32 (04) : 673 - 694
  • [33] Emotion recognition with attention mechanism-guided dual-feature multi-path interaction network
    Li, Yaxuan
    Guo, Wenhui
    Wang, Yanjiang
    SIGNAL IMAGE AND VIDEO PROCESSING, 2024, 18 (SUPPL 1) : 617 - 626
  • [34] Sparse Spatial-Temporal Emotion Graph Convolutional Network for Video Emotion Recognition
    Liu, Xiaodong
    Xu, Huating
    Wang, Miao
    COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2022, 2022
  • [35] DualSANet: Dual Spatial Attention Network for Iris Recognition
    Yang, Kai
    Xu, Zihao
    Fei, Jingjing
    2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2021), 2021, : 888 - 896
  • [36] A CROSS-ATTENTION EMOTION RECOGNITION ALGORITHM BASED ON AUDIO AND VIDEO MODALITIES
    Wu, Xiao
    Mu, Xuan
    Qi, Wen
    Liu, Xiaorui
    2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING WORKSHOPS, ICASSPW 2024, 2024, : 309 - 313
  • [37] Video multimodal emotion recognition based on Bi-GRU and attention fusion
    Huan, Ruo-Hong
    Shu, Jia
    Bao, Sheng-Lin
    Liang, Rong-Hua
    Chen, Peng
    Chi, Kai-Kai
    MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (06) : 8213 - 8240
  • [38] Video multimodal emotion recognition based on Bi-GRU and attention fusion
    Ruo-Hong Huan
    Jia Shu
    Sheng-Lin Bao
    Rong-Hua Liang
    Peng Chen
    Kai-Kai Chi
    Multimedia Tools and Applications, 2021, 80 : 8213 - 8240
  • [39] Multipath Attention and Adaptive Gating Network for Video Action Recognition
    Haiping Zhang
    Zepeng Hu
    Dongjin Yu
    Liming Guan
    Xu Liu
    Conghao Ma
    Neural Processing Letters, 56
  • [40] SDAN: Stacked Diverse Attention Network for Video Action Recognition
    Zhu, Xiaoguang
    Huang, Siran
    Fan, Wenjing
    Cheng, Yuhao
    Shao, Huaqing
    Liu, Peilin
    2021 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2021,