Facial Action Unit detection based on multi-task learning strategy for unlabeled facial images in the wild

被引:2
|
作者
Shang, Ziqiao [1 ]
Liu, Bin [2 ]
机构
[1] Huazhong Univ Sci & Technol HUST, Sch Elect Informat & Commun, Wuhan 430074, Peoples R China
[2] Southwest Jiaotong Univ SWJTU, Sch Comp & Artificial Intelligence, Chengdu 610031, Peoples R China
关键词
Facial action unit detection; Multi-task learning strategy; Pixel-level feature alignment scheme; Weighted asymmetric loss;
D O I
10.1016/j.eswa.2024.124285
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Facial Action Unit (AU) detection often relies on highly -cost accurate labeling or inaccurate pseudo labeling techniques in recent years. How to introduce large amounts of unlabeled facial images in the wild into supervised AU detection frameworks has become a challenging problem. Additionally, nearly every type of AUs has the problem of unbalanced positive and negative samples. Inspired by other multi -task learning frameworks, we first propose a multi -task learning strategy boosting AU detection in the wild through jointing facial landmark detection and AU domain separation and reconstruction. Our introduced dual domains facial landmark detection framework can solve the lack of accurate facial landmark coordinates during the AU domain separation and reconstruction training process, while the parameters of homostructural facial extraction modules from these two similar facial tasks are shared. Moreover, we propose a pixel -level feature alignment scheme to maintain the consistency of features obtained from two separation and reconstruction processes. Furthermore, a weighted asymmetric loss is proposed to change the contribution of positive and negative samples of each type of AUs to model parameters updating. Experimental results on three widely used benchmarks demonstrate our superiority to most state-of-the-art methods for AU detection.
引用
收藏
页数:13
相关论文
共 50 条
  • [21] 3D facial landmark detection based on differential cylindrical projection and multi-task learning
    Terada, Takuma
    Kimura, Ryusuke
    Chen, Yen-Wei
    COMMUNICATIONS IN INFORMATION AND SYSTEMS, 2020, 20 (04) : 443 - 459
  • [22] Multi-task feature learning-based improved supervised descent method for facial landmark detection
    Peng Bian
    Zhengnan Xie
    Yi Jin
    Signal, Image and Video Processing, 2018, 12 : 17 - 24
  • [23] Multi-task feature learning-based improved supervised descent method for facial landmark detection
    Bian, Peng
    Xie, Zhengnan
    Jin, Yi
    SIGNAL IMAGE AND VIDEO PROCESSING, 2018, 12 (01) : 17 - 24
  • [24] Facial Attributes Classification using Multi-Task Representation Learning
    Ehrlich, Max
    Shields, Timothy J.
    Almaev, Timur
    Amer, Mohamed R.
    PROCEEDINGS OF 29TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, (CVPRW 2016), 2016, : 752 - 760
  • [25] Facial Expression Recognition by Regional Attention and Multi-task Learning
    Cui, Longlei
    Tian, Ying
    ENGINEERING LETTERS, 2021, 29 (03) : 919 - 925
  • [26] Multi-task Learning of Cascaded CNN for Facial Attribute Classification
    Zhuang, Ni
    Yan, Yan
    Chen, Si
    Wang, Hanzi
    2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2018, : 2069 - 2074
  • [27] Automatic Facial Attractiveness Prediction by Deep Multi-Task Learning
    Gao, Lian
    Li, Weixin
    Huang, Zehua
    Huang, Di
    Wang, Yunhong
    2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2018, : 3592 - 3597
  • [28] Discriminative deep multi-task learning for facial expression recognition
    Zheng, Hao
    Wang, Ruili
    Ji, Wanting
    Zong, Ming
    Wong, Wai Keung
    Lai, Zhihui
    Lv, Hexin
    INFORMATION SCIENCES, 2020, 533 : 60 - 71
  • [29] Multi-Task Learning with Knowledge Transfer for Facial Attribute Classification
    Fanhe, Xiaohui
    Guo, Jie
    Huang, Zheng
    Qiu, Weidong
    Zhang, Yuele
    2019 IEEE INTERNATIONAL CONFERENCE ON INDUSTRIAL TECHNOLOGY (ICIT), 2019, : 877 - 882
  • [30] Heterogeneous Multi-task Learning on Non-overlapping Datasets for Facial Landmark Detection
    Semitsu, Takayuki
    Zhao, Xiongxin
    Matsumoto, Wataru
    NEURAL INFORMATION PROCESSING, ICONIP 2016, PT III, 2016, 9949 : 616 - 625