Facial Action Unit detection based on multi-task learning strategy for unlabeled facial images in the wild

被引:2
|
作者
Shang, Ziqiao [1 ]
Liu, Bin [2 ]
机构
[1] Huazhong Univ Sci & Technol HUST, Sch Elect Informat & Commun, Wuhan 430074, Peoples R China
[2] Southwest Jiaotong Univ SWJTU, Sch Comp & Artificial Intelligence, Chengdu 610031, Peoples R China
关键词
Facial action unit detection; Multi-task learning strategy; Pixel-level feature alignment scheme; Weighted asymmetric loss;
D O I
10.1016/j.eswa.2024.124285
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Facial Action Unit (AU) detection often relies on highly -cost accurate labeling or inaccurate pseudo labeling techniques in recent years. How to introduce large amounts of unlabeled facial images in the wild into supervised AU detection frameworks has become a challenging problem. Additionally, nearly every type of AUs has the problem of unbalanced positive and negative samples. Inspired by other multi -task learning frameworks, we first propose a multi -task learning strategy boosting AU detection in the wild through jointing facial landmark detection and AU domain separation and reconstruction. Our introduced dual domains facial landmark detection framework can solve the lack of accurate facial landmark coordinates during the AU domain separation and reconstruction training process, while the parameters of homostructural facial extraction modules from these two similar facial tasks are shared. Moreover, we propose a pixel -level feature alignment scheme to maintain the consistency of features obtained from two separation and reconstruction processes. Furthermore, a weighted asymmetric loss is proposed to change the contribution of positive and negative samples of each type of AUs to model parameters updating. Experimental results on three widely used benchmarks demonstrate our superiority to most state-of-the-art methods for AU detection.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] On Multi-task Learning for Facial Action Unit Detection
    Zhang, Xiao
    Mahoor, Mohammad H.
    Nielsen, Rodney D.
    PROCEEDINGS OF 2013 28TH INTERNATIONAL CONFERENCE ON IMAGE AND VISION COMPUTING NEW ZEALAND (IVCNZ 2013), 2013, : 202 - 207
  • [2] Task-dependent multi-task multiple kernel learning for facial action unit detection
    Zhang, Xiao
    Mahoor, Mohammad H.
    PATTERN RECOGNITION, 2016, 51 : 187 - 196
  • [3] Facial Action Unit Detection with Multilayer Fused Multi-Task and Multi-Label Deep Learning Network
    He, Jun
    Li, Dongliang
    Bo, Sun
    Yu, Lejun
    KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2019, 13 (11) : 5546 - 5559
  • [4] MULTI-TASK LEARNING OF EMOTION RECOGNITION AND FACIAL ACTION UNIT DETECTION WITH ADAPTIVELY WEIGHTS SHARING NETWORK
    Wang, Chu
    Zeng, Jiabei
    Shan, Shiguang
    Chen, Xilin
    2019 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2019, : 56 - 60
  • [5] A Co-regularization Facial Emotion Recognition Based on Multi-Task Facial Action Unit Recognition
    Udeh, Chinonso Paschal
    Chen, Luefeng
    Du, Sheng
    Li, Min
    Wu, Min
    2022 41ST CHINESE CONTROL CONFERENCE (CCC), 2022, : 6806 - 6810
  • [6] Facial Landmark Detection by Deep Multi-task Learning
    Zhang, Zhanpeng
    Luo, Ping
    Loy, Chen Change
    Tang, Xiaoou
    COMPUTER VISION - ECCV 2014, PT VI, 2014, 8694 : 94 - 108
  • [7] Efficient Multi-task based Facial Landmark and Gesture Detection in Monocular Images
    Goenetxea, Jon
    Unzueta, Luis
    Elordi, Unai
    Otaegui, Oihana
    Dornaika, Fadi
    VISAPP: PROCEEDINGS OF THE 16TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS - VOL. 5: VISAPP, 2021, : 680 - 687
  • [8] Facial Action Unit Recognition in the Wild with Multi-Task CNN Self-Training for the EmotioNet Challenge
    Werner, Philipp
    Saxen, Frerk
    Al-Hamadi, Ayoub
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2020), 2020, : 1649 - 1652
  • [9] Thermal Facial Landmark Detection by Deep Multi-Task Learning
    Chu, Wei-Ta
    Liu, Yu-Hui
    2019 IEEE 21ST INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP 2019), 2019,
  • [10] Pose-independent Facial Action Unit Intensity Regression Based on Multi-task Deep Transfer Learning
    Zhou, Yuqian
    Pi, Jimin
    Shi, Bertram E.
    2017 12TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION (FG 2017), 2017, : 872 - 877