Towards unsupervised learning of joint facial landmark detection and head pose estimation

被引:0
|
作者
Zou, Zhiming [1 ]
Jia, Dian [1 ]
Tang, Wei [1 ]
机构
[1] Univ Illinois, Dept Comp Sci, Chicago, IL 60607 USA
基金
美国国家科学基金会;
关键词
Facial landmark detection; Head pose estimation; Unsupervised learning;
D O I
10.1016/j.patcog.2025.111393
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep learning approaches have advanced state-of-the-art performance drastically in facial landmark detection and head pose estimation. Recent work shows that meaningful landmarks could be discovered from unlabeled image collections. However, they only mine local visual patterns in images as 2D landmarks while ignoring the 3D object structure. Consequently, they can neither directly estimate the object pose from an image nor use it for improved landmark discovery. Therefore, we propose a novel framework that jointly learns both tasks. It includes a multi-task network for joint landmark and pose prediction, a set of learnable 3D canonical landmarks, and an image generation network. They are learned collaboratively on unlabeled face images through an integrated loss of conditional image generation and geometric consistency. We also investigate different strategies to handle potential face deformation. Extensive experiments show that our approach is very effective in both tasks, even comparable to some supervised methods. The code is available at https://github.com/ZhimingZo/unsup-face-analysis
引用
收藏
页数:13
相关论文
共 50 条
  • [21] Joint Facial Landmark Detection and Action Estimation Based on Deep Probabilistic Random Forest
    Yu, Jun
    Chen, Chang Wen
    2017 IEEE VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP), 2017,
  • [22] WNet: Joint Multiple Head Detection and Head Pose Estimation from a Spectator Crowd Image
    Jan, Yasir
    Sohel, Ferdous
    Shiratuddin, Mohd Fairuz
    Wong, Kok Wai
    COMPUTER VISION - ACCV 2018 WORKSHOPS, 2019, 11367 : 484 - 493
  • [23] Facial tracking with head pose estimation in stereo vision
    Huang, Y
    Huang, T
    2002 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOL III, PROCEEDINGS, 2002, : 833 - 836
  • [24] Facial Landmark, Head Pose, and Occlusion Analysis Using Multitask Stacked Hourglass
    Kim, Youngsam
    Roh, Jong-Hyuk
    Kim, Soohyung
    IEEE ACCESS, 2023, 11 : 30970 - 30981
  • [25] Head pose estimation based on detecting facial features
    Hatem, Hiyam
    Beiji, Zou
    Majeed, Raed
    Waleed, Jumana
    Lutf, Mohammed
    International Journal of Multimedia and Ubiquitous Engineering, 2015, 10 (03): : 311 - 322
  • [26] Joint Learning of Object Detection and Pose Estimation using Augmented Autoencoder
    Hayashi, Ryota
    Shimokura, Asei
    Matsumoto, Takuya
    Ukita, Norimichi
    PROCEEDINGS OF 17TH INTERNATIONAL CONFERENCE ON MACHINE VISION APPLICATIONS (MVA 2021), 2021,
  • [27] TRFH: towards real-time face detection and head pose estimation
    Chen, Shicun
    Zhang, Yong
    Yin, Baocai
    Wang, Boyue
    PATTERN ANALYSIS AND APPLICATIONS, 2021, 24 (04) : 1745 - 1755
  • [28] TRFH: towards real-time face detection and head pose estimation
    Shicun Chen
    Yong Zhang
    Baocai Yin
    Boyue Wang
    Pattern Analysis and Applications, 2021, 24 : 1745 - 1755
  • [29] Coupled cascade regression from real and synthesized faces for simultaneous landmark detection and head pose estimation
    Gou, Chao
    Ji, Qiang
    JOURNAL OF ELECTRONIC IMAGING, 2020, 29 (02)
  • [30] Unsupervised Incremental Learning for Hand Shape and Pose Estimation
    Kalshetti, Pratik
    Chaudhuri, Parag
    SIGGRAPH '19 - ACM SIGGRAPH 2019 POSTERS, 2019,