PASTFNet: a paralleled attention spatio-temporal fusion network for micro-expression recognition

被引:3
|
作者
Tian, Haichen [1 ]
Gong, Weijun [1 ]
Li, Wei [2 ]
Qian, Yurong [1 ,2 ,3 ]
机构
[1] Xinjiang Univ, Sch Informat Sci & Engn, Urumqi, Peoples R China
[2] Xinjiang Univ, Sch Software, Urumqi, Peoples R China
[3] Key Lab Signal Detect & Proc Xinjiang Uygur Autono, Urumqi 830014, Peoples R China
基金
美国国家科学基金会;
关键词
Micro-expression recognition; Spatio-temporal features; Multi-scale fusion; Attention; Dual-branch; 3D;
D O I
10.1007/s11517-024-03041-y
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Micro-expressions (MEs) play such an important role in predicting a person's genuine emotions, as to make micro-expression recognition such an important resea rch focus in recent years. Most recent researchers have made efforts to recognize MEs with spatial and temporal information of video clips. However, because of their short duration and subtle intensity, capturing spatio-temporal features of micro-expressions remains challenging. To effectively promote the recognition performance, this paper presents a novel paralleled dual-branch attention-based spatio-temporal fusion network (PASTFNet). We jointly extract short- and long-range spatial relationships in spatial branch. Inspired by the composite architecture of the convolutional neural network (CNN) and long short-term memory (LSTM) for temporal modeling, we propose a novel attention-based multi-scale feature fusion network (AMFNet) to encode features of sequential frames, which can learn more expressive facial-detailed features for it implements the integrated use of attention and multi-scale feature fusion, then design an aggregation block to aggregate and acquire temporal features. At last, the features learned by the above two branches are fused to accomplish expression recognition with outstanding effect. Experiments on two MER datasets (CASMEII and SAMM) show that the PASTFNet model achieves promising ME recognition performance compared with other methods.
引用
收藏
页码:1911 / 1924
页数:14
相关论文
共 50 条
  • [41] The dual stream network with embedding temporal convolution for micro-expression recognition
    Wang, Haiquan
    Wang, Kunxia
    Yu, Wancheng
    SIGNAL IMAGE AND VIDEO PROCESSING, 2025, 19 (05)
  • [42] IMPROVING THE ACCURACY OF FACIAL MICRO-EXPRESSION RECOGNITION: SPATIO-TEMPORAL DEEP LEARNING WITH ENHANCED DATA AUGMENTATION AND CLASS BALANCING
    Irawan, Budhi
    Munir, Rinaldi
    Utama, Nugraha Priya
    Purwarianti, Ayu
    Interdisciplinary Journal of Information, Knowledge, and Management, 2024, 19
  • [43] A Spatio-Temporal Motion Network for Action Recognition Based on Spatial Attention
    Yang, Qi
    Lu, Tongwei
    Zhou, Huabing
    ENTROPY, 2022, 24 (03)
  • [44] Dual Temporal Scale Convolutional Neural Network for Micro-Expression Recognition
    Peng, Min
    Wang, Chongyang
    Chen, Tong
    Liu, Guangyuan
    Fu, Xiaolan
    FRONTIERS IN PSYCHOLOGY, 2017, 8
  • [45] SPATIO-TEMPORAL SLOWFAST SELF-ATTENTION NETWORK FOR ACTION RECOGNITION
    Kim, Myeongjun
    Kim, Taehun
    Kim, Daijin
    2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2020, : 2206 - 2210
  • [46] A Dual Pipeline With Spatio-Temporal Attention Fusion Approach for Human Activity Recognition
    Wang, Xiaodong
    Li, Ying
    Fang, Aiqing
    He, Pei
    Guo, Yangming
    IEEE SENSORS JOURNAL, 2024, 24 (15) : 25150 - 25162
  • [47] AU-assisted Graph Attention Convolutional Network for Micro-Expression Recognition
    Xie, Hong-Xia
    Lo, Ling
    Shuai, Hong-Han
    Cheng, Wen-Huang
    MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 2871 - 2880
  • [48] Spatio-Temporal Fusion Networks for Action Recognition
    Cho, Sangwoo
    Foroosh, Hassan
    COMPUTER VISION - ACCV 2018, PT I, 2019, 11361 : 347 - 364
  • [49] STRAN: Student expression recognition based on spatio-temporal residual attention network in classroom teaching videos
    Chen, Zheng
    Liang, Meiyu
    Xue, Zhe
    Yu, Wanying
    APPLIED INTELLIGENCE, 2023, 53 (21) : 25310 - 25329
  • [50] STRAN: Student expression recognition based on spatio-temporal residual attention network in classroom teaching videos
    Zheng Chen
    Meiyu Liang
    Zhe Xue
    Wanying Yu
    Applied Intelligence, 2023, 53 : 25310 - 25329