Dual-STI: Dual-path spatial-temporal interaction learning for dynamic facial expression recognition

被引:1
|
作者
Li, Min [1 ]
Zhang, Xiaoqin [1 ]
Fan, Chenxiang [1 ]
Liao, Tangfei [1 ]
Xiao, Guobao [2 ]
机构
[1] Wenzhou Univ, Coll Comp & Artificial Intelligence, Wenzhou 325035, Peoples R China
[2] Tongji Univ, Sch Elect & Informat Engn, Shanghai 201804, Peoples R China
基金
中国国家自然科学基金;
关键词
Dynamic facial expression recognition; Spatial-temporal feature; Spatial-temporal interaction; Comparative learning; NETWORK; AWARE;
D O I
10.1016/j.ins.2024.120953
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Learning facial evaluation is crucial for dynamic facial expression recognition. Current recognition methods typically extract temporal features after spatial features to achieve low computation complexity. However, these methods struggle to model complex facial evaluations due to a lack of interaction between spatial and temporal features. This paper proposes a novel Dualpath Spatial -Temporal Interaction (Dual-STI) framework that concurrently extracts spatial and temporal features through two efficient paths. Specifically, Dual-STI comprises a spatial path and a temporal path. The spatial path contains several spatial transformers to capture robust facial features from each sampled frame, while the temporal path includes several temporal transformers to learn rich contextual facial features from the sequence of frames. To facilitate spatial -temporal interaction, Dual-STI features a distinct dual-path interaction module that adaptively fuses spatial and temporal features by combining spatial and temporal attention mechanisms. Additionally, comparative learning is introduced into the loss function to enhance this interaction. To evaluate the proposed method, extensive experiments are conducted on three popular benchmarks, namely DFEW, AFEW, and FERV39k. The experimental results demonstrate that the proposed Dual-STI achieves state -of -the -art performance with low computational complexity across all datasets. Notably, Dual-STI shows significant improvements in the "disgust" and "fear" categories, with precision increases of 3 .45% and 2 .1% on the DFEW dataset, respectively.
引用
收藏
页数:15
相关论文
共 50 条
  • [21] Deep-Learning-Based Stress Recognition with Spatial-Temporal Facial Information
    Jeon, Taejae
    Bae, Han Byeol
    Lee, Yongju
    Jang, Sungjun
    Lee, Sangyoun
    SENSORS, 2021, 21 (22)
  • [22] Unsupervised Dual Modality Prompt Learning for Facial Expression Recognition
    Shahid, Muhammad
    2023 8th International Conference on Computer and Communication Systems, ICCCS 2023, 2023, : 1056 - 1061
  • [23] Spatial-temporal dual-actor CNN for human interaction prediction in video
    Afrasiabi, Mahlagha
    Khotanlou, Hassan
    Gevers, Theo
    MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (27-28) : 20019 - 20038
  • [24] Spatial-temporal dual-actor CNN for human interaction prediction in video
    Mahlagha Afrasiabi
    Hassan Khotanlou
    Theo Gevers
    Multimedia Tools and Applications, 2020, 79 : 20019 - 20038
  • [25] Dual-Path Style Learning for End-to-End Noise-Robust Speech Recognition
    Hu, Yuchen
    Hou, Nana
    Chen, Chen
    Chng, Eng Siong
    INTERSPEECH 2023, 2023, : 2918 - 2922
  • [26] Facial Expression Recognition Using Dual Path Feature Fusion and Stacked Attention
    Zhu, Hongtao
    Xu, Huahu
    Ma, Xiaojin
    Bian, Minjie
    FUTURE INTERNET, 2022, 14 (09):
  • [27] Spatial-Temporal Graphs Plus Transformers for Geometry-Guided Facial Expression Recognition
    Zhao, Rui
    Liu, Tianshan
    Huang, Zixun
    Lun, Daniel P. K.
    Lam, Kin-Man
    IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2023, 14 (04) : 2751 - 2767
  • [28] A Dual Attention Spatial-Temporal Graph Convolutional Network for Emotion Recognition from Gait
    Liu, Jiaqing
    Kisita, Shoji
    Chai, Shurong
    Tateyama, Tomoko
    Iwamoto, Yutaro
    Chen, Yen-Wei
    Journal of the Institute of Image Electronics Engineers of Japan, 2022, 51 (04): : 309 - 317
  • [29] Dual attention based spatial-temporal inference network for volleyball group activity recognition
    Yanshan Li
    Yan Liu
    Rui Yu
    Hailin Zong
    Weixin Xie
    Multimedia Tools and Applications, 2023, 82 : 15515 - 15533
  • [30] Dual attention based spatial-temporal inference network for volleyball group activity recognition
    Li, Yanshan
    Liu, Yan
    Yu, Rui
    Zong, Hailin
    Xie, Weixin
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (10) : 15515 - 15533