Facial Expression Recognition-You Only Look Once-Neighborhood Coordinate Attention Mamba: Facial Expression Detection and Classification Based on Neighbor and Coordinates Attention Mechanism

被引:0
|
作者
Peng, Cheng [1 ]
Sun, Mingqi [2 ]
Zou, Kun [1 ]
Zhang, Bowen [3 ]
Dai, Genan [3 ]
Tsoi, Ah Chung [4 ]
机构
[1] Univ Elect Sci & Technol China, Zhongshan Inst, Sch Comp, Zhongshan 528402, Peoples R China
[2] Univ Elect Sci & Technol China, Sch Comp Sci & Engn, Chengdu 611731, Peoples R China
[3] Shenzhen Technol Univ, Coll Big Data & Internet, Shenzhen 518118, Peoples R China
[4] Univ Wollongong, Sch Comp & Informat Technol, Wollongong, NSW 2522, Australia
关键词
facial expression recognition; visual state space model; attention; object detection;
D O I
10.3390/s24216912
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
In studying the joint object detection and classification problem for facial expression recognition (FER) deploying the YOLOX framework, we introduce a novel feature extractor, called neighborhood coordinate attention Mamba (NCAMamba) to substitute for the original feature extractor in the Feature Pyramid Network (FPN). NCAMamba combines the background information reduction capabilities of Mamba, the local neighborhood relationship understanding of neighborhood attention, and the directional relationship understanding of coordinate attention. The resulting FER-YOLO-NCAMamba model, when applied to two unaligned FER benchmark datasets, RAF-DB and SFEW, obtains significantly improved mean average precision (mAP) scores when compared with those obtained by other state-of-the-art methods. Moreover, in ablation studies, it is found that the NCA module is relatively more important than the Visual State Space (VSS), a version of using Mamba for image processing, and in visualization studies using the grad-CAM method, it reveals that regions around the nose tip are critical to recognizing the expression; if it is too large, it may lead to erroneous prediction, while a small focused region would lead to correct recognition; this may explain why FER of unaligned faces is such a challenging problem.
引用
收藏
页数:20
相关论文
共 50 条
  • [41] Facial micro-expression recognition based on motion magnification network and graph attention mechanism
    Wu, Falin
    Xia, Yu
    Hu, Tiangyang
    Ma, Boyi
    Yang, Jingyao
    Li, Haoxin
    HELIYON, 2024, 10 (16)
  • [42] Child Attention Detection through Facial Expression Recognition using SVM Algorithm
    Baldovino, Aika Patricia
    Vergonio, Frances Neele
    Tomas, John Paul
    2019 INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY AND COMPUTER COMMUNICATIONS (ITCC 2019), 2019, : 52 - 58
  • [43] Facial Expression Recognition Based on Multi-Channel Attention Residual Network
    Shen, Tongping
    Xu, Huanqing
    CMES-COMPUTER MODELING IN ENGINEERING & SCIENCES, 2023, 135 (01): : 539 - 560
  • [44] A facial expression recognition network based on attention double branch enhanced fusion
    Wang, Wenming
    Jia, Min
    PEERJ COMPUTER SCIENCE, 2024, 10
  • [45] A facial expression recognition network based on attention double branch enhanced fusion
    Wang, Wenming
    Jia, Min
    PeerJ Computer Science, 2024, 10 : 1 - 23
  • [46] Facial Expression Recognition Using a Semantic-Based Bottleneck Attention Module
    Zhang, Shengfu
    Xiao, Zhongjie
    INTERNATIONAL JOURNAL ON SEMANTIC WEB AND INFORMATION SYSTEMS, 2024, 20 (01)
  • [47] STAN: spatiotemporal attention network for video-based facial expression recognition
    Yi, Yufan
    Xu, Yiping
    Ye, Ziyi
    Li, Linhui
    Hu, Xinli
    Tian, Yan
    VISUAL COMPUTER, 2023, 39 (12): : 6205 - 6220
  • [48] Facial expression recognition network with slow convolution and zero-parameter attention mechanism
    Li, Xi
    Xiao, Zhenhua
    Li, Chao
    Li, Congcong
    Liu, Hai
    Fan, Guowen
    OPTIK, 2023, 283
  • [49] Facial Expression Recognition Based on Region-Wise Attention and Geometry Difference
    Du, Heran
    Zheng, Huicheng
    Yu, Mingjing
    PATTERN RECOGNITION AND COMPUTER VISION, PT III, 2018, 11258 : 183 - 194
  • [50] Visual attention based composite dense neural network for facial expression recognition
    Shaik N.S.
    Cherukuri T.K.
    Journal of Ambient Intelligence and Humanized Computing, 2023, 14 (12) : 16229 - 16242