A cascaded spatiotemporal attention network for dynamic facial expression recognition

被引:0
|
作者
Yaoguang Ye
Yongqi Pan
Yan Liang
Jiahui Pan
机构
[1] South China Normal University,School of Software
[2] South China Agricultural University,College of Mathematics and Informatics College of Software Engineering
[3] Pazhou Lab,undefined
来源
Applied Intelligence | 2023年 / 53卷
关键词
Dynamic facial expression recognition; Spatiotemporal features; Cascaded network; Attention module;
D O I
暂无
中图分类号
学科分类号
摘要
Dynamic facial expression recognition (DFER) is a promising research area because it concerns the dynamic change pattern of facial expressions, but it is difficult to effectively capture the facial appearances and dynamic temporal information of each image in an image sequence. In this paper, a cascaded spatiotemporal attention network (CSTAN) is proposed to learn and integrate spatial and temporal emotional information in the process of facial expression change. Three types of attention modules are embedded into the cascaded network to enable it to extract more informative spatiotemporal features for the DFER task in different dimensions. A channel attention module helps the network focus on the meaningful spatial feature maps for the DFER task, a spatial attention module focuses on the regions of interest among the spatial feature maps, and a temporal attention module aims to explore the dynamic temporal information when an expression changes. The experimental results on three public facial expression recognition datasets prove the good performance of the CSTAN, and it can extract representative spatiotemporal features. Meanwhile, the visualization results reveal that the CSTAN can locate regions of interest and contributing timesteps, which illustrates the effectiveness of the multidimensional attention modules.
引用
收藏
页码:5402 / 5415
页数:13
相关论文
共 50 条
  • [31] Multi-Scale Attention Learning Network for Facial Expression Recognition
    Dong, Qian
    Ren, Weihong
    Gao, Yu
    Jiang, Weibo
    Liu, Honghai
    IEEE SIGNAL PROCESSING LETTERS, 2023, 30 : 1732 - 1736
  • [32] Twinned attention network for occlusion-aware facial expression recognition
    Devasena, G.
    Vidhya, V.
    MACHINE VISION AND APPLICATIONS, 2025, 36 (01)
  • [33] Facial expression recognition based on attention mechanism ResNet lightweight network
    Zhao Xiao
    Yang Chen
    Wang Ruo-nan
    Li Yue-chen
    CHINESE JOURNAL OF LIQUID CRYSTALS AND DISPLAYS, 2023, 38 (11) : 1503 - 1510
  • [34] Ventral-Dorsal Attention Capsule Network for facial expression recognition
    Qian, Zhizhe
    Mu, Jing
    Tian, Feng
    DIGITAL SIGNAL PROCESSING, 2023, 136
  • [35] Triple attention feature enhanced pyramid network for facial expression recognition
    Fang, Jian
    Lin, Xiaomei
    Liu, Weida
    An, Yi
    Sun, Haoran
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2023, 44 (05) : 8649 - 8661
  • [36] Hierarchical attention network with progressive feature fusion for facial expression recognition
    Tao, Huanjie
    Duan, Qianyue
    NEURAL NETWORKS, 2024, 170 : 337 - 348
  • [37] Facial expression recognition based on strong attention mechanism and residual network
    Zhizhe Qian
    Jing Mu
    Feng Tian
    Zhiyu Gao
    Jie Zhang
    Multimedia Tools and Applications, 2023, 82 : 14287 - 14306
  • [38] Facial Expression Recognition Method Embedded with Attention Mechanism Residual Network
    Zhong, Rui
    Jiang, Bin
    Li, Nanxing
    Cui, Xiaomei
    Computer Engineering and Applications, 2023, 59 (11) : 88 - 97
  • [39] Facial expression recognition based on strong attention mechanism and residual network
    Qian, Zhizhe
    Mu, Jing
    Tian, Feng
    Gao, Zhiyu
    Zhang, Jie
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (09) : 14287 - 14306
  • [40] Deep Joint Spatiotemporal Network (DJS']JSTN) for Efficient Facial Expression Recognition
    Jeong, Dami
    Kim, Byung-Gyu
    Dong, Suh-Yeon
    SENSORS, 2020, 20 (07)