Towards unsupervised learning of joint facial landmark detection and head pose estimation

被引:0
|
作者
Zou, Zhiming [1 ]
Jia, Dian [1 ]
Tang, Wei [1 ]
机构
[1] Univ Illinois, Dept Comp Sci, Chicago, IL 60607 USA
基金
美国国家科学基金会;
关键词
Facial landmark detection; Head pose estimation; Unsupervised learning;
D O I
10.1016/j.patcog.2025.111393
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep learning approaches have advanced state-of-the-art performance drastically in facial landmark detection and head pose estimation. Recent work shows that meaningful landmarks could be discovered from unlabeled image collections. However, they only mine local visual patterns in images as 2D landmarks while ignoring the 3D object structure. Consequently, they can neither directly estimate the object pose from an image nor use it for improved landmark discovery. Therefore, we propose a novel framework that jointly learns both tasks. It includes a multi-task network for joint landmark and pose prediction, a set of learnable 3D canonical landmarks, and an image generation network. They are learned collaboratively on unlabeled face images through an integrated loss of conditional image generation and geometric consistency. We also investigate different strategies to handle potential face deformation. Extensive experiments show that our approach is very effective in both tasks, even comparable to some supervised methods. The code is available at https://github.com/ZhimingZo/unsup-face-analysis
引用
收藏
页数:13
相关论文
共 50 条
  • [41] SUBSPACE LEARNING FOR HUMAN HEAD POSE ESTIMATION
    Ru, Yuxiao
    Huang, Thomas S.
    2008 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOLS 1-4, 2008, : 1585 - 1588
  • [42] Recurrent 3D-2D Dual Learning for Large-pose Facial Landmark Detection
    Xiao, Shengtao
    Feng, Jiashi
    Liu, Luoqi
    Nie, Xuecheng
    Wang, Wei
    Yan, Shuicheng
    Kassim, Ashraf
    2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 1642 - 1651
  • [43] Deep Learning for Head Pose Estimation: A Survey
    Asperti A.
    Filippini D.
    SN Computer Science, 4 (4)
  • [44] Learning toward practical head pose estimation
    Sang, Gaoli
    He, Feixiang
    Zhu, Rong
    Xuan, Shibin
    OPTICAL ENGINEERING, 2017, 56 (08)
  • [45] Head Pose Estimation using Transfer Learning
    Sreekanth, Pavan
    Kulkarni, Uday
    Shetty, Sachin
    Meena, S. M.
    PROCEEDINGS OF THE 2018 INTERNATIONAL CONFERENCE ON RECENT TRENDS IN ADVANCED COMPUTING (ICRTAC-CPS 2018), 2018, : 73 - 79
  • [46] Head pose estimation by nonlinear manifold learning
    Raytchev, B
    Yoda, I
    Sakaue, K
    PROCEEDINGS OF THE 17TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 4, 2004, : 462 - 466
  • [47] Learning Joint Structure for Human Pose Estimation
    Feng, Shenming
    Hu, Haifeng
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2020, 16 (03)
  • [48] Pose Invariant 3D Facial Landmark Detection Via Pose Normalization and Deep Regression
    Zhang, Jingchen
    Gao, Kangkang
    Zhao, Qijun
    Wang, Daning
    PROCEEDINGS OF 2020 2ND INTERNATIONAL CONFERENCE ON IMAGE PROCESSING AND MACHINE VISION AND INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION AND MACHINE LEARNING, IPMV 2020, 2020, : 74 - 78
  • [49] Facial Expression Recognition for Different Pose Faces Based on Special Landmark Detection
    Wu, Wenqi
    Yin, Yingjie
    Wang, Yingying
    Wang, Xingang
    Xu, De
    2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2018, : 1524 - 1529
  • [50] Towards Unbiased Label Distribution Learning for Facial Pose Estimation Using Anisotropic Spherical Gaussian
    Cao, Zhiwen
    Liu, Dongfang
    Wang, Qifan
    Chen, Yingjie
    COMPUTER VISION, ECCV 2022, PT XII, 2022, 13672 : 737 - 753