Facial Action Unit detection based on multi-task learning strategy for unlabeled facial images in the wild

被引:2
|
作者
Shang, Ziqiao [1 ]
Liu, Bin [2 ]
机构
[1] Huazhong Univ Sci & Technol HUST, Sch Elect Informat & Commun, Wuhan 430074, Peoples R China
[2] Southwest Jiaotong Univ SWJTU, Sch Comp & Artificial Intelligence, Chengdu 610031, Peoples R China
关键词
Facial action unit detection; Multi-task learning strategy; Pixel-level feature alignment scheme; Weighted asymmetric loss;
D O I
10.1016/j.eswa.2024.124285
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Facial Action Unit (AU) detection often relies on highly -cost accurate labeling or inaccurate pseudo labeling techniques in recent years. How to introduce large amounts of unlabeled facial images in the wild into supervised AU detection frameworks has become a challenging problem. Additionally, nearly every type of AUs has the problem of unbalanced positive and negative samples. Inspired by other multi -task learning frameworks, we first propose a multi -task learning strategy boosting AU detection in the wild through jointing facial landmark detection and AU domain separation and reconstruction. Our introduced dual domains facial landmark detection framework can solve the lack of accurate facial landmark coordinates during the AU domain separation and reconstruction training process, while the parameters of homostructural facial extraction modules from these two similar facial tasks are shared. Moreover, we propose a pixel -level feature alignment scheme to maintain the consistency of features obtained from two separation and reconstruction processes. Furthermore, a weighted asymmetric loss is proposed to change the contribution of positive and negative samples of each type of AUs to model parameters updating. Experimental results on three widely used benchmarks demonstrate our superiority to most state-of-the-art methods for AU detection.
引用
收藏
页数:13
相关论文
共 50 条
  • [31] Deep Region and Multi-label Learning for Facial Action Unit Detection
    Zhao, Kaili
    Chu, Wen-Sheng
    Zhang, Honggang
    2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 3391 - 3399
  • [32] Joint Patch and Multi-label Learning for Facial Action Unit Detection
    Zhao, Kaili
    Chu, Wen-Sheng
    De la Torre, Fernando
    Cohn, Jeffrey F.
    Zhang, Honggang
    2015 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2015, : 2207 - 2216
  • [33] Multi-task multi-scale attention learning-based facial age estimation
    Shi, Chaojun
    Zhao, Shiwei
    Zhang, Ke
    Feng, Xiaohan
    IET SIGNAL PROCESSING, 2023, 17 (02)
  • [34] Multi-task Learning with Labeled and Unlabeled Tasks
    Pentina, Anastasia
    Lampert, Christoph H.
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 70, 2017, 70
  • [35] Meta Auxiliary Learning for Facial Action Unit Detection
    Li, Yong
    Shan, Shiguang
    IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2023, 14 (03) : 2526 - 2538
  • [36] Facial Landmark Detection via Self-adaption Model and Multi-task Feature Learning
    Xie, Zhengnan
    Jin, Yi
    Bian, Peng
    Zhou, Wei
    PROCEEDINGS OF 2016 IEEE 13TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP 2016), 2016, : 113 - 117
  • [37] Residual multi-task learning for facial landmark localization and expression recognition
    Chen, Boyu
    Guan, Wenlong
    Li, Peixia
    Ikeda, Naoki
    Hirasawa, Kosuke
    Lu, Huchuan
    PATTERN RECOGNITION, 2021, 115
  • [38] Multi-task Facial Landmark Detection Network for Early ASD Screening
    Lin, Ruihan
    Zhang, Hanlin
    Wang, Xinming
    Ren, Weihong
    Wu, Wenhao
    Liu, Zuode
    Xu, Xiu
    Xu, Qiong
    Liu, Honghai
    INTELLIGENT ROBOTICS AND APPLICATIONS (ICIRA 2022), PT I, 2022, 13455 : 381 - 391
  • [39] Safe Facial Multi-attribute Feature Fusion Analysis Based on Embedded Multi-Task Learning
    Liu, Haibo
    Li, Qianmu
    Meng, Shunmei
    Rong, Zhenbang
    2019 22ND IEEE INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND ENGINEERING (IEEE CSE 2019) AND 17TH IEEE INTERNATIONAL CONFERENCE ON EMBEDDED AND UBIQUITOUS COMPUTING (IEEE EUC 2019), 2019, : 441 - 446
  • [40] A multi-task meta-learner-based ensemble for robust facial expression recognition in-the-wild
    Khelifa, Afifa
    Ghazouani, Haythem
    Barhoumi, Walid
    EVOLUTIONARY INTELLIGENCE, 2024, 17 (5-6) : 4007 - 4027