Dual-STI: Dual-path spatial-temporal interaction learning for dynamic facial expression recognition

被引:1
|
作者
Li, Min [1 ]
Zhang, Xiaoqin [1 ]
Fan, Chenxiang [1 ]
Liao, Tangfei [1 ]
Xiao, Guobao [2 ]
机构
[1] Wenzhou Univ, Coll Comp & Artificial Intelligence, Wenzhou 325035, Peoples R China
[2] Tongji Univ, Sch Elect & Informat Engn, Shanghai 201804, Peoples R China
基金
中国国家自然科学基金;
关键词
Dynamic facial expression recognition; Spatial-temporal feature; Spatial-temporal interaction; Comparative learning; NETWORK; AWARE;
D O I
10.1016/j.ins.2024.120953
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Learning facial evaluation is crucial for dynamic facial expression recognition. Current recognition methods typically extract temporal features after spatial features to achieve low computation complexity. However, these methods struggle to model complex facial evaluations due to a lack of interaction between spatial and temporal features. This paper proposes a novel Dualpath Spatial -Temporal Interaction (Dual-STI) framework that concurrently extracts spatial and temporal features through two efficient paths. Specifically, Dual-STI comprises a spatial path and a temporal path. The spatial path contains several spatial transformers to capture robust facial features from each sampled frame, while the temporal path includes several temporal transformers to learn rich contextual facial features from the sequence of frames. To facilitate spatial -temporal interaction, Dual-STI features a distinct dual-path interaction module that adaptively fuses spatial and temporal features by combining spatial and temporal attention mechanisms. Additionally, comparative learning is introduced into the loss function to enhance this interaction. To evaluate the proposed method, extensive experiments are conducted on three popular benchmarks, namely DFEW, AFEW, and FERV39k. The experimental results demonstrate that the proposed Dual-STI achieves state -of -the -art performance with low computational complexity across all datasets. Notably, Dual-STI shows significant improvements in the "disgust" and "fear" categories, with precision increases of 3 .45% and 2 .1% on the DFEW dataset, respectively.
引用
收藏
页数:15
相关论文
共 50 条
  • [1] Enhanced spatial-temporal learning network for dynamic facial expression recognition
    Gong, Weijun
    Qian, Yurong
    Zhou, Weihang
    Leng, Hongyong
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2024, 88
  • [2] Dual-AI: Dual-path Actor Interaction Learning for Group Activity Recognition
    Han, Mingfei
    Zhang, David Junhao
    Wang, Yali
    Yan, Rui
    Yao, Lina
    Chang, Xiaojun
    Qiao, Yu
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 2980 - 2989
  • [3] Facial Expression Recognition with Identity and Spatial-temporal Integrated Learning
    Teng, Halting
    Zhang, Dong
    Li, Ming
    Huang, Yudong
    2019 8TH INTERNATIONAL CONFERENCE ON AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION WORKSHOPS AND DEMOS (ACIIW), 2019, : 100 - 104
  • [4] STCAM: Spatial-Temporal and Channel Attention Module for Dynamic Facial Expression Recognition
    Chen, Weicong
    Zhang, Dong
    Li, Ming
    Lee, Dah-Jye
    IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2023, 14 (01) : 800 - 810
  • [5] A Mix Fusion Spatial-Temporal Network for Facial Expression Recognition
    Shu, Chang
    Xue, Feng
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT V, 2024, 14429 : 315 - 326
  • [6] STADNet: Spatial-Temporal Attention-Guided Dual-Path Network for cardiac cine MRI super-resolution
    Lyu, Jun
    Wang, Shuo
    Tian, Yapeng
    Zou, Jing
    Dong, Shunjie
    Wang, Chengyan
    Aviles-Rivero, Angelica I.
    Qin, Jing
    MEDICAL IMAGE ANALYSIS, 2024, 94
  • [7] Facial expression recognition using dual dictionary learning
    Moeini, Ali
    Faez, Karim
    Moeini, Hossein
    Safai, Armon Matthew
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2017, 45 : 20 - 33
  • [8] Facial Expression Recognition Based on Spatial-Temporal Fusion with Attention Mechanism
    Zhang, Lifeng
    Zheng, Xiangwei
    Chen, Xuanchi
    Ren, Xiuxiu
    Ji, Cun
    NEURAL PROCESSING LETTERS, 2023, 55 (05) : 6109 - 6124
  • [9] Facial Expression Recognition Based on Spatial-Temporal Fusion with Attention Mechanism
    Lifeng Zhang
    Xiangwei Zheng
    Xuanchi Chen
    Xiuxiu Ren
    Cun Ji
    Neural Processing Letters, 2023, 55 : 6109 - 6124
  • [10] FACIAL EXPRESSION RECOGNITION USING SPATIAL-TEMPORAL SEMANTIC GRAPH NETWORK
    Zhou, Jinzhao
    Zhang, Xingming
    Liu, Yang
    Lan, Xiangyuan
    2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2020, : 1961 - 1965