Learning Deep Global Multi-Scale and Local Attention Features for Facial Expression Recognition in the Wild

被引:146
|
作者
Zhao, Zengqun [1 ]
Liu, Qingshan [1 ]
Wang, Shanmin [2 ,3 ]
机构
[1] Nanjing Univ Informat Sci & Technol, Engn Res Ctr Digital Forens, Minist Educ, Nanjing 210044, Peoples R China
[2] Nanjing Univ Aeronaut & Astronaut, Coll Comp Sci & Technol, Nanjing 210016, Peoples R China
[3] Minist Educ, Engn Res Ctr Digital Forens, Nanjing 210044, Peoples R China
基金
中国国家自然科学基金;
关键词
Feature extraction; Face recognition; Image recognition; Faces; Convolution; Image reconstruction; Geometry; Facial expression recognition; deep convolutional neural networks; multi-scale; local attention; INFORMATION; PATCHES; JOINT; POSE;
D O I
10.1109/TIP.2021.3093397
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Facial expression recognition (FER) in the wild received broad concerns in which occlusion and pose variation are two key issues. This paper proposed a global multi-scale and local attention network (MA-Net) for FER in the wild. Specifically, the proposed network consists of three main components: a feature pre-extractor, a multi-scale module, and a local attention module. The feature pre-extractor is utilized to pre-extract middle-level features, the multi-scale module to fuse features with different receptive fields, which reduces the susceptibility of deeper convolution towards occlusion and variant pose, while the local attention module can guide the network to focus on local salient features, which releases the interference of occlusion and non-frontal pose problems on FER in the wild. Extensive experiments demonstrate that the proposed MA-Net achieves the state-of-the-art results on several in-the-wild FER benchmarks: CAER-S, AffectNet-7, AffectNet-8, RAFDB, and SFEW with accuracies of 88.42%, 64.53%, 60.29%, 88.40%, and 59.40% respectively. The codes and training logs are publicly available at https://github.com/zengqunzhao/MA-Net.
引用
收藏
页码:6544 / 6556
页数:13
相关论文
共 50 条
  • [31] Facial Expression Recognition Based on Multi-scale Vector Triangle
    Jiang, He
    Hu, Min
    Chen, Hongbo
    Li, Kun
    Wang, Xiaohua
    Ren, Fuji
    2013 IEEE/SICE INTERNATIONAL SYMPOSIUM ON SYSTEM INTEGRATION (SII), 2013, : 82 - 87
  • [32] Development of a Robust Multi-Scale Featured Local Binary Pattern for Improved Facial Expression Recognition
    Yasmin, Suraiya
    Pathan, Refat Khan
    Biswas, Munmun
    Khandaker, Mayeen Uddin
    Faruque, Mohammad Rashed Iqbal
    SENSORS, 2020, 20 (18) : 1 - 17
  • [33] Reliable facial expression recognition for multi-scale images using weber local binary image based cosine transform features
    Khan, Sajid Ali
    Hussain, Ayyaz
    Usman, Muhammad
    MULTIMEDIA TOOLS AND APPLICATIONS, 2018, 77 (01) : 1133 - 1165
  • [34] Reliable facial expression recognition for multi-scale images using weber local binary image based cosine transform features
    Sajid Ali Khan
    Ayyaz Hussain
    Muhammad Usman
    Multimedia Tools and Applications, 2018, 77 : 1133 - 1165
  • [35] Facial Expression Recognition Based on Multi-scale Feature Fusion Convolutional Neural Network and Attention Mechanism
    Wu, Yana
    Jia, Kebin
    Sun, Zhonghua
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2021, PT II, 2021, 13020 : 324 - 335
  • [36] Multi-scale pedestrian detection with global-local attention and multi-scale receptive field context
    Xue, Pan
    Chen, Houjin
    Li, Yanfeng
    Li, Jupeng
    IET COMPUTER VISION, 2023, 17 (01) : 13 - 25
  • [37] Learning Expression Features via Deep Residual Attention Networks for Facial Expression Recognition From Video Sequences
    Zhao, Xiaoming
    Chen, Gang
    Chuang, Yuelong
    Tao, Xin
    Zhang, Shiqing
    IETE TECHNICAL REVIEW, 2021, 38 (06) : 602 - 610
  • [38] Facial Expression Recognition Based on Local Features of Transfer Learning
    Feng, Haiqiang
    Shao, Jingfeng
    PROCEEDINGS OF 2020 IEEE 4TH INFORMATION TECHNOLOGY, NETWORKING, ELECTRONIC AND AUTOMATION CONTROL CONFERENCE (ITNEC 2020), 2020, : 71 - 76
  • [39] Deep Learning Based Mobilenet and Multi-Head Attention Model for Facial Expression Recognition
    Nouisser, Aicha
    Zouari, Ramzi
    Kherallah, Monji
    INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2023, 20 (3A) : 485 - 491
  • [40] Feature fusion of multi-granularity and multi-scale for facial expression recognition
    Xia, Haiying
    Lu, Lidan
    Song, Shuxiang
    VISUAL COMPUTER, 2024, 40 (03): : 2035 - 2047