RealHePoNet: a robust single-stage ConvNet for head pose estimation in the wild

被引:7
|
作者
Berral-Soler, Rafael [1 ]
Madrid-Cuevas, Francisco J. [1 ]
Munoz-Salinas, Rafael [1 ]
Marin-Jimenez, Manuel J. [1 ]
机构
[1] Univ Cordoba, Dept Comp & Numer Anal, Cordoba, Spain
来源
NEURAL COMPUTING & APPLICATIONS | 2021年 / 33卷 / 13期
关键词
Human head pose estimation; ConvNets; Human-computer interaction; Deep Learning; FRAMEWORK;
D O I
10.1007/s00521-020-05511-4
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Human head pose estimation in images has applications in many fields such as human-computer interaction or video surveillance tasks. In this work, we address this problem, defined here as the estimation of both vertical (tilt/pitch) and horizontal (pan/yaw) angles, through the use of a single Convolutional Neural Network (ConvNet) model, trying to balance precision and inference speed in order to maximize its usability in real-world applications. Our model is trained over the combination of two datasets: 'Pointing'04' (aiming at covering a wide range of poses) and 'Annotated Facial Landmarks in the Wild' (in order to improve robustness of our model for its use on real-world images). Three different partitions of the combined dataset are defined and used for training, validation and testing purposes. As a result of this work, we have obtained a trained ConvNet model, coined RealHePoNet, that given a low-resolution grayscale input image, and without the need of using facial landmarks, is able to estimate with low error both tilt and pan angles (4.4 degrees average error on the test partition). Also, given its low inference time (6 ms per head), we consider our model usable even when paired with medium-spec hardware (i.e. GTX 1060 GPU). Code available at: https://github.com/rafabs97/headpose_final Demo video at: https://www.youtube.com/watch?v=2UeuXh5DjAE.
引用
收藏
页码:7673 / 7689
页数:17
相关论文
共 50 条
  • [21] Pose Anchor: A Single-Stage Hand Keypoint Detection Network
    Li, Yuan
    Wang, Xinggang
    Liu, Wenyu
    Feng, Bin
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2020, 30 (07) : 2104 - 2113
  • [22] PlaneSegNet: Fast and Robust Plane Estimation Using a Single-stage Instance Segmentation CNN
    Xie, Yaxu
    Rambach, Jason
    Shu, Fangwen
    Stricker, Didier
    2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2021), 2021, : 13574 - 13580
  • [23] Head Pose Estimation in the Wild using Approximate View Manifolds
    Sundararajan, Kalaivani
    Woodard, Damon L.
    2015 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2015,
  • [24] Three-Dimensional Head Pose Estimation in-the-Wild
    Peng, Xi
    Huang, Junzhou
    Hu, Qiong
    Zhang, Shaoting
    Metaxas, Dimitris N.
    2015 11TH IEEE INTERNATIONAL CONFERENCE AND WORKSHOPS ON AUTOMATIC FACE AND GESTURE RECOGNITION (FG), VOL. 1, 2015,
  • [25] Multi-Task Head Pose Estimation in-the-Wild
    Valle, Roberto
    Buenaposada, Jose M.
    Baumela, Luis
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 43 (08) : 2874 - 2881
  • [26] Single-stage magnetorheological damper parameter estimation
    Vavreck, AN
    SMART MATERIALS & STRUCTURES, 2002, 11 (04): : 596 - 598
  • [27] Robust Head Pose Estimation using ContourletSD Transform and GLCM
    Meydanipour, Gelareh
    Faez, Karim
    2013 8TH IRANIAN CONFERENCE ON MACHINE VISION & IMAGE PROCESSING (MVIP 2013), 2013, : 375 - 380
  • [28] AUTOMATIC AND ROBUST HEAD POSE ESTIMATION BY BLOCK ENERGY MAP
    Li, Wei
    Huang, Yan
    Peng, Jingliang
    2014 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2014, : 3357 - 3361
  • [29] Latent Embedding Clustering for Occlusion Robust Head Pose Estimation
    Celestino, Jose
    Marques, Manuel
    Nascimento, Jacinto C.
    2024 IEEE 18TH INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION, FG 2024, 2024,
  • [30] Robust head pose estimation via supervised manifold learning
    Wang, Chao
    Song, Xubo
    NEURAL NETWORKS, 2014, 53 : 15 - 25