MMATrans: Muscle Movement Aware Representation Learning for Facial Expression Recognition via Transformers

被引:13
|
作者
Liu, Hai [1 ]
Zhou, Qiyun [1 ]
Zhang, Cheng [1 ]
Zhu, Junyan [1 ]
Liu, Tingting [2 ,3 ]
Zhang, Zhaoli [1 ]
Li, You-Fu [4 ,5 ]
机构
[1] Cent China Normal Univ, Natl Engn Res Ctr E Learning, Wuhan 430079, Peoples R China
[2] Univ Hong Kong, Fac Educ, Hong Kong, Peoples R China
[3] Hubei Univ, Sch Educ, Wuhan 430062, Peoples R China
[4] City Univ Hong Kong, Dept Mech Engn, Hong Kong, Peoples R China
[5] City Univ Hong Kong Shenzhen Res Inst, Shenzhen 518057, Peoples R China
基金
中国国家自然科学基金;
关键词
Feature extraction; Facial muscles; Muscles; Visualization; Transformers; Semantics; Representation learning; Critical minority; facial expression recognition (FER); facial muscle movement; human-robot interaction; semantic relationships; visual transformer;
D O I
10.1109/TII.2024.3431640
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
How to automatically recognize facial expression has caused concerns in industrial human-robot interaction. However, facial expression recognition (FER) is susceptible to problems, such as occlusion, arbitrary orientations, and illumination. To effectively address these challenges in FER, we present a novel facial muscle movement aware representation learning that can learn the semantic relationships of facial muscle movements in facial expression images. Two key findings are revealed: 1) muscle movements from different facial regions often show semantic relationships; and 2) not all facial muscle regions have equal contributions for different facial expressions. On this basis, this model presents two novel modules, namely, discriminative feature generation (DFG) and muscle relationship mining (MRM). Specifically, in DFG, the memory of our model for mislabeling decreases. In MRM, muscle-motion interaction among diverse facial regions is learned through visual transformers (MMATrans). Experiments on three in-the-wild FER datasets (RAF-DB, FERPlus, and AffectNet) show that our MMATrans yields better performance compared with state-of-the-art methods.
引用
收藏
页码:13753 / 13764
页数:12
相关论文
共 50 条
  • [21] Facial Expression Recognition via Broad Learning System
    Zhang, Tong
    Liu, Zhu-lin
    Wang, Xue-han
    Xing, Xiao-fen
    Chen, C. L. Philip
    Chen, Enhong
    2018 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2018, : 1898 - 1902
  • [22] Identity-Aware Facial Expression Recognition Via Deep Metric Learning Based on Synthesized Images
    Huang, Wei
    Zhang, Siyuan
    Zhang, Peng
    Zha, Yufei
    Fang, Yuming
    Zhang, Yanning
    IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 24 : 3327 - 3339
  • [23] AU-Expression Knowledge Constrained Representation Learning for Facial Expression Recognition
    Pu, Tao
    Chen, Tianshui
    Xie, Yuan
    Wu, Hefeng
    Lin, Liang
    2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2021), 2021, : 11154 - 11161
  • [24] Uncertainty-aware Label Distribution Learning for Facial Expression Recognition
    Le, Nhat
    Nguyen, Khanh
    Tran, Quang
    Tjiputra, Erman
    Le, Bac
    Nguyen, Anh
    2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 6077 - 6086
  • [25] Occlusion-aware facial expression recognition: A deep learning approach
    Palanichamy Naveen
    Multimedia Tools and Applications, 2024, 83 : 32895 - 32921
  • [26] Occlusion-aware facial expression recognition: A deep learning approach
    Naveen, Palanichamy
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (11) : 32895 - 32921
  • [27] FACIAL EXPRESSION RECOGNITION VIA GABOR WAVELET AND STRUCTURED SPARSE REPRESENTATION
    Chen, Ting
    Su, Fei
    PROCEEDINGS OF THE 3RD IEEE INTERNATIONAL CONFERENCE ON NETWORK INFRASTRUCTURE AND DIGITAL CONTENT (IEEE IC-NIDC 2012), 2012, : 420 - 424
  • [28] A solution for facial expression representation and recognition
    Dubuisson, S
    Davoine, F
    Masson, M
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 2002, 17 (09) : 657 - 673
  • [29] Facial Expression Recognition Using Facial Movement Features
    Zhang, Ligang
    Tjondronegoro, Dian
    IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2011, 2 (04) : 219 - 229
  • [30] Facial expression recognition via learning deep sparse autoencoders
    Zeng, Nianyin
    Zhang, Hong
    Song, Baoye
    Liu, Weibo
    Li, Yurong
    Dobaie, Abdullah M.
    NEUROCOMPUTING, 2018, 273 : 643 - 649