Dual-STI: Dual-path spatial-temporal interaction learning for dynamic facial expression recognition

被引：1

作者：

Li, Min ^{[1
]}

Zhang, Xiaoqin ^{[1
]}

Fan, Chenxiang ^{[1
]}

Liao, Tangfei ^{[1
]}

Xiao, Guobao ^{[2
]}

机构：

[1] Wenzhou Univ, Coll Comp & Artificial Intelligence, Wenzhou 325035, Peoples R China

[2] Tongji Univ, Sch Elect & Informat Engn, Shanghai 201804, Peoples R China

来源：

INFORMATION SCIENCES | 2024年 / 678卷

基金：

中国国家自然科学基金;

关键词：

Dynamic facial expression recognition; Spatial-temporal feature; Spatial-temporal interaction; Comparative learning; NETWORK; AWARE;

D O I：

10.1016/j.ins.2024.120953

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Learning facial evaluation is crucial for dynamic facial expression recognition. Current recognition methods typically extract temporal features after spatial features to achieve low computation complexity. However, these methods struggle to model complex facial evaluations due to a lack of interaction between spatial and temporal features. This paper proposes a novel Dualpath Spatial -Temporal Interaction (Dual-STI) framework that concurrently extracts spatial and temporal features through two efficient paths. Specifically, Dual-STI comprises a spatial path and a temporal path. The spatial path contains several spatial transformers to capture robust facial features from each sampled frame, while the temporal path includes several temporal transformers to learn rich contextual facial features from the sequence of frames. To facilitate spatial -temporal interaction, Dual-STI features a distinct dual-path interaction module that adaptively fuses spatial and temporal features by combining spatial and temporal attention mechanisms. Additionally, comparative learning is introduced into the loss function to enhance this interaction. To evaluate the proposed method, extensive experiments are conducted on three popular benchmarks, namely DFEW, AFEW, and FERV39k. The experimental results demonstrate that the proposed Dual-STI achieves state -of -the -art performance with low computational complexity across all datasets. Notably, Dual-STI shows significant improvements in the "disgust" and "fear" categories, with precision increases of 3 .45% and 2 .1% on the DFEW dataset, respectively.

引用

页数：15

共 50 条

[1] Enhanced spatial-temporal learning network for dynamic facial expression recognition
Gong, Weijun
Qian, Yurong
Zhou, Weihang
Leng, Hongyong
BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2024, 88
[2] Dual-AI: Dual-path Actor Interaction Learning for Group Activity Recognition
Han, Mingfei
Zhang, David Junhao
Wang, Yali
Yan, Rui
Yao, Lina
Chang, Xiaojun
Qiao, Yu
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 2980 - 2989
[3] Facial Expression Recognition with Identity and Spatial-temporal Integrated Learning
Teng, Halting
Zhang, Dong
Li, Ming
Huang, Yudong
2019 8TH INTERNATIONAL CONFERENCE ON AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION WORKSHOPS AND DEMOS (ACIIW), 2019, : 100 - 104
[4] STCAM: Spatial-Temporal and Channel Attention Module for Dynamic Facial Expression Recognition
Chen, Weicong
Zhang, Dong
Li, Ming
Lee, Dah-Jye
IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2023, 14 (01) : 800 - 810
[5] A Mix Fusion Spatial-Temporal Network for Facial Expression Recognition
Shu, Chang
Xue, Feng
PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT V, 2024, 14429 : 315 - 326
[6] STADNet: Spatial-Temporal Attention-Guided Dual-Path Network for cardiac cine MRI super-resolution
Lyu, Jun
Wang, Shuo
Tian, Yapeng
Zou, Jing
Dong, Shunjie
Wang, Chengyan
Aviles-Rivero, Angelica I.
Qin, Jing
MEDICAL IMAGE ANALYSIS, 2024, 94
[7] Facial expression recognition using dual dictionary learning
Moeini, Ali
Faez, Karim
Moeini, Hossein
Safai, Armon Matthew
JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2017, 45 : 20 - 33
[8] Facial Expression Recognition Based on Spatial-Temporal Fusion with Attention Mechanism
Zhang, Lifeng
Zheng, Xiangwei
Chen, Xuanchi
Ren, Xiuxiu
Ji, Cun
NEURAL PROCESSING LETTERS, 2023, 55 (05) : 6109 - 6124
[9] Facial Expression Recognition Based on Spatial-Temporal Fusion with Attention Mechanism
Lifeng Zhang
Xiangwei Zheng
Xuanchi Chen
Xiuxiu Ren
Cun Ji
Neural Processing Letters, 2023, 55 : 6109 - 6124
[10] FACIAL EXPRESSION RECOGNITION USING SPATIAL-TEMPORAL SEMANTIC GRAPH NETWORK
Zhou, Jinzhao
Zhang, Xingming
Liu, Yang
Lan, Xiangyuan
2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2020, : 1961 - 1965

← 1 2 3 4 5 →