PASTFNet: a paralleled attention spatio-temporal fusion network for micro-expression recognition

被引：3

作者：

Tian, Haichen ^{[1
]}

Gong, Weijun ^{[1
]}

Li, Wei ^{[2
]}

Qian, Yurong ^{[1
,2
,3
]}

机构：

[1] Xinjiang Univ, Sch Informat Sci & Engn, Urumqi, Peoples R China

[2] Xinjiang Univ, Sch Software, Urumqi, Peoples R China

[3] Key Lab Signal Detect & Proc Xinjiang Uygur Autono, Urumqi 830014, Peoples R China

来源：

MEDICAL & BIOLOGICAL ENGINEERING & COMPUTING | 2024年 / 62卷 / 06期

基金：

美国国家科学基金会;

关键词：

Micro-expression recognition; Spatio-temporal features; Multi-scale fusion; Attention; Dual-branch; 3D;

D O I：

10.1007/s11517-024-03041-y

中图分类号：

TP39 [计算机的应用];

学科分类号：

081203 ; 0835 ;

摘要：

Micro-expressions (MEs) play such an important role in predicting a person's genuine emotions, as to make micro-expression recognition such an important resea rch focus in recent years. Most recent researchers have made efforts to recognize MEs with spatial and temporal information of video clips. However, because of their short duration and subtle intensity, capturing spatio-temporal features of micro-expressions remains challenging. To effectively promote the recognition performance, this paper presents a novel paralleled dual-branch attention-based spatio-temporal fusion network (PASTFNet). We jointly extract short- and long-range spatial relationships in spatial branch. Inspired by the composite architecture of the convolutional neural network (CNN) and long short-term memory (LSTM) for temporal modeling, we propose a novel attention-based multi-scale feature fusion network (AMFNet) to encode features of sequential frames, which can learn more expressive facial-detailed features for it implements the integrated use of attention and multi-scale feature fusion, then design an aggregation block to aggregate and acquire temporal features. At last, the features learned by the above two branches are fused to accomplish expression recognition with outstanding effect. Experiments on two MER datasets (CASMEII and SAMM) show that the PASTFNet model achieves promising ME recognition performance compared with other methods.

引用

页码：1911 / 1924

页数：14

共 50 条

[41] The dual stream network with embedding temporal convolution for micro-expression recognition
Wang, Haiquan
Wang, Kunxia
Yu, Wancheng
SIGNAL IMAGE AND VIDEO PROCESSING, 2025, 19 (05)
[42] IMPROVING THE ACCURACY OF FACIAL MICRO-EXPRESSION RECOGNITION: SPATIO-TEMPORAL DEEP LEARNING WITH ENHANCED DATA AUGMENTATION AND CLASS BALANCING
Irawan, Budhi
Munir, Rinaldi
Utama, Nugraha Priya
Purwarianti, Ayu
Interdisciplinary Journal of Information, Knowledge, and Management, 2024, 19
[43] A Spatio-Temporal Motion Network for Action Recognition Based on Spatial Attention
Yang, Qi
Lu, Tongwei
Zhou, Huabing
ENTROPY, 2022, 24 (03)
[44] Dual Temporal Scale Convolutional Neural Network for Micro-Expression Recognition
Peng, Min
Wang, Chongyang
Chen, Tong
Liu, Guangyuan
Fu, Xiaolan
FRONTIERS IN PSYCHOLOGY, 2017, 8
[45] SPATIO-TEMPORAL SLOWFAST SELF-ATTENTION NETWORK FOR ACTION RECOGNITION
Kim, Myeongjun
Kim, Taehun
Kim, Daijin
2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2020, : 2206 - 2210
[46] A Dual Pipeline With Spatio-Temporal Attention Fusion Approach for Human Activity Recognition
Wang, Xiaodong
Li, Ying
Fang, Aiqing
He, Pei
Guo, Yangming
IEEE SENSORS JOURNAL, 2024, 24 (15) : 25150 - 25162
[47] AU-assisted Graph Attention Convolutional Network for Micro-Expression Recognition
Xie, Hong-Xia
Lo, Ling
Shuai, Hong-Han
Cheng, Wen-Huang
MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 2871 - 2880
[48] Spatio-Temporal Fusion Networks for Action Recognition
Cho, Sangwoo
Foroosh, Hassan
COMPUTER VISION - ACCV 2018, PT I, 2019, 11361 : 347 - 364
[49] STRAN: Student expression recognition based on spatio-temporal residual attention network in classroom teaching videos
Chen, Zheng
Liang, Meiyu
Xue, Zhe
Yu, Wanying
APPLIED INTELLIGENCE, 2023, 53 (21) : 25310 - 25329
[50] STRAN: Student expression recognition based on spatio-temporal residual attention network in classroom teaching videos
Zheng Chen
Meiyu Liang
Zhe Xue
Wanying Yu
Applied Intelligence, 2023, 53 : 25310 - 25329

← 1 2 3 4 5 →