Weakly Supervised Regional and Temporal Learning for Facial Action Unit Recognition

被引:3
|
作者
Yan, Jingwei [1 ]
Wang, Jingjing [1 ]
Li, Qiang [1 ]
Wang, Chunmao [1 ]
Pu, Shiliang [1 ]
机构
[1] Hikvis Res Inst, Hangzhou 310051, Peoples R China
关键词
Gold; Task analysis; Face recognition; Feature extraction; Representation learning; Optical imaging; Facial muscles; Facial action unit recognition; regional and temporal feature learning; weakly supervised learning;
D O I
10.1109/TMM.2022.3160061
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Automatic facial action unit (AU) recognition is a challenging task due to the scarcity of manual annotations. To alleviate this problem, a large amount of efforts has been dedicated to exploiting various weakly supervised methods which leverage numerous unlabeled data. However, many aspects with regard to some unique properties of AUs, such as the regional and relational characteristics, are not sufficiently explored in previous works. Motivated by this, we take the AU properties into consideration and propose two auxiliary AU related tasks to bridge the gap between limited annotations and the model performance in a self-supervised manner via the unlabeled data. Specifically, to enhance the discrimination of regional features with AU relation embedding, we design a task of RoI inpainting to recover the randomly cropped AU patches. Meanwhile, a single image based optical flow estimation task is proposed to leverage the dynamic change of facial muscles and encode the motion information into the global feature representation. Based on these two self-supervised auxiliary tasks, local features, mutual relation and motion cues of AUs are better captured in the backbone network. Furthermore, by incorporating semi-supervised learning, we propose an end-to-end trainable framework named weakly supervised regional and temporal learning (WSRTL) for AU recognition. Extensive experiments on BP4D and DISFA demonstrate the superiority of our method and new state-of-the-art performances are achieved.
引用
收藏
页码:1760 / 1772
页数:13
相关论文
共 50 条
  • [1] Weakly Supervised Dual Learning for Facial Action Unit Recognition
    Wang, Shangfei
    Peng, Guozhu
    IEEE TRANSACTIONS ON MULTIMEDIA, 2019, 21 (12) : 3218 - 3230
  • [2] Weakly Supervised Facial Action Unit Recognition With Domain Knowledge
    Wang, Shangfei
    Peng, Guozhu
    Chen, Shiyu
    Ji, Qiang
    IEEE TRANSACTIONS ON CYBERNETICS, 2018, 48 (11) : 3265 - 3276
  • [3] Self-Supervised Regional and Temporal Auxiliary Tasks for Facial Action Unit Recognition
    Yan, Jingwei
    Wang, Jingjing
    Li, Qiang
    Wang, Chunmao
    Pu, Shiliang
    PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 1038 - 1046
  • [4] Weakly Supervised Facial Action Unit Recognition through Adversarial Training
    Peng, Guozhu
    Wang, Shangfei
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 2188 - 2196
  • [5] Dual Semi-Supervised Learning for Facial Action Unit Recognition
    Peng, Guozhu
    Wang, Shangfei
    THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 8827 - 8834
  • [6] Action Unit Memory Network for Weakly Supervised Temporal Action Localization
    Luo, Wang
    Zhang, Tianzhu
    Yang, Wenfei
    Liu, Jingen
    Mei, Tao
    Wu, Feng
    Zhang, Yongdong
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 9964 - 9974
  • [7] Weakly Supervised Temporal Action Detection With Temporal Dependency Learning
    Li, Bairong
    Liu, Ruixin
    Chen, Tianquan
    Zhu, Yuesheng
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (07) : 4473 - 4485
  • [8] Pursuing Knowledge Consistency: Supervised Hierarchical Contrastive Learning for Facial Action Unit Recognition
    Chen, Yingjie
    Chen, Chong
    Luo, Xiao
    Huang, Jianqiang
    Hua, Xian-Sheng
    Wang, Tao
    Liang, Yun
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022,
  • [9] Drop-relationship learning for semi-supervised facial action unit recognition
    Hu, Xin
    Zhi, Ruicong
    Zhou, Caixia
    NEUROCOMPUTING, 2023, 550
  • [10] Temporal RPN Learning for Weakly-Supervised Temporal Action Localization
    Huang, Jing
    Kong, Ming
    Chen, Luyuan
    Liang, Tian
    Zhu, Qiang
    ASIAN CONFERENCE ON MACHINE LEARNING, VOL 222, 2023, 222