Dual-STI: Dual-path spatial-temporal interaction learning for dynamic facial expression recognition

被引：1

作者：

Li, Min ^{[1
]}

Zhang, Xiaoqin ^{[1
]}

Fan, Chenxiang ^{[1
]}

Liao, Tangfei ^{[1
]}

Xiao, Guobao ^{[2
]}

机构：

[1] Wenzhou Univ, Coll Comp & Artificial Intelligence, Wenzhou 325035, Peoples R China

[2] Tongji Univ, Sch Elect & Informat Engn, Shanghai 201804, Peoples R China

来源：

INFORMATION SCIENCES | 2024年 / 678卷

基金：

中国国家自然科学基金;

关键词：

Dynamic facial expression recognition; Spatial-temporal feature; Spatial-temporal interaction; Comparative learning; NETWORK; AWARE;

D O I：

10.1016/j.ins.2024.120953

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Learning facial evaluation is crucial for dynamic facial expression recognition. Current recognition methods typically extract temporal features after spatial features to achieve low computation complexity. However, these methods struggle to model complex facial evaluations due to a lack of interaction between spatial and temporal features. This paper proposes a novel Dualpath Spatial -Temporal Interaction (Dual-STI) framework that concurrently extracts spatial and temporal features through two efficient paths. Specifically, Dual-STI comprises a spatial path and a temporal path. The spatial path contains several spatial transformers to capture robust facial features from each sampled frame, while the temporal path includes several temporal transformers to learn rich contextual facial features from the sequence of frames. To facilitate spatial -temporal interaction, Dual-STI features a distinct dual-path interaction module that adaptively fuses spatial and temporal features by combining spatial and temporal attention mechanisms. Additionally, comparative learning is introduced into the loss function to enhance this interaction. To evaluate the proposed method, extensive experiments are conducted on three popular benchmarks, namely DFEW, AFEW, and FERV39k. The experimental results demonstrate that the proposed Dual-STI achieves state -of -the -art performance with low computational complexity across all datasets. Notably, Dual-STI shows significant improvements in the "disgust" and "fear" categories, with precision increases of 3 .45% and 2 .1% on the DFEW dataset, respectively.

引用

页数：15

共 50 条

[21] Deep-Learning-Based Stress Recognition with Spatial-Temporal Facial Information
Jeon, Taejae
Bae, Han Byeol
Lee, Yongju
Jang, Sungjun
Lee, Sangyoun
SENSORS, 2021, 21 (22)
[22] Unsupervised Dual Modality Prompt Learning for Facial Expression Recognition
Shahid, Muhammad
2023 8th International Conference on Computer and Communication Systems, ICCCS 2023, 2023, : 1056 - 1061
[23] Spatial-temporal dual-actor CNN for human interaction prediction in video
Afrasiabi, Mahlagha
Khotanlou, Hassan
Gevers, Theo
MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (27-28) : 20019 - 20038
[24] Spatial-temporal dual-actor CNN for human interaction prediction in video
Mahlagha Afrasiabi
Hassan Khotanlou
Theo Gevers
Multimedia Tools and Applications, 2020, 79 : 20019 - 20038
[25] Dual-Path Style Learning for End-to-End Noise-Robust Speech Recognition
Hu, Yuchen
Hou, Nana
Chen, Chen
Chng, Eng Siong
INTERSPEECH 2023, 2023, : 2918 - 2922
[26] Facial Expression Recognition Using Dual Path Feature Fusion and Stacked Attention
Zhu, Hongtao
Xu, Huahu
Ma, Xiaojin
Bian, Minjie
FUTURE INTERNET, 2022, 14 (09):
[27] Spatial-Temporal Graphs Plus Transformers for Geometry-Guided Facial Expression Recognition
Zhao, Rui
Liu, Tianshan
Huang, Zixun
Lun, Daniel P. K.
Lam, Kin-Man
IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2023, 14 (04) : 2751 - 2767
[28] A Dual Attention Spatial-Temporal Graph Convolutional Network for Emotion Recognition from Gait
Liu, Jiaqing
Kisita, Shoji
Chai, Shurong
Tateyama, Tomoko
Iwamoto, Yutaro
Chen, Yen-Wei
Journal of the Institute of Image Electronics Engineers of Japan, 2022, 51 (04): : 309 - 317
[29] Dual attention based spatial-temporal inference network for volleyball group activity recognition
Yanshan Li
Yan Liu
Rui Yu
Hailin Zong
Weixin Xie
Multimedia Tools and Applications, 2023, 82 : 15515 - 15533
[30] Dual attention based spatial-temporal inference network for volleyball group activity recognition
Li, Yanshan
Liu, Yan
Yu, Rui
Zong, Hailin
Xie, Weixin
MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (10) : 15515 - 15533

← 1 2 3 4 5 →