RealHePoNet: a robust single-stage ConvNet for head pose estimation in the wild

被引:7
|
作者
Berral-Soler, Rafael [1 ]
Madrid-Cuevas, Francisco J. [1 ]
Munoz-Salinas, Rafael [1 ]
Marin-Jimenez, Manuel J. [1 ]
机构
[1] Univ Cordoba, Dept Comp & Numer Anal, Cordoba, Spain
来源
NEURAL COMPUTING & APPLICATIONS | 2021年 / 33卷 / 13期
关键词
Human head pose estimation; ConvNets; Human-computer interaction; Deep Learning; FRAMEWORK;
D O I
10.1007/s00521-020-05511-4
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Human head pose estimation in images has applications in many fields such as human-computer interaction or video surveillance tasks. In this work, we address this problem, defined here as the estimation of both vertical (tilt/pitch) and horizontal (pan/yaw) angles, through the use of a single Convolutional Neural Network (ConvNet) model, trying to balance precision and inference speed in order to maximize its usability in real-world applications. Our model is trained over the combination of two datasets: 'Pointing'04' (aiming at covering a wide range of poses) and 'Annotated Facial Landmarks in the Wild' (in order to improve robustness of our model for its use on real-world images). Three different partitions of the combined dataset are defined and used for training, validation and testing purposes. As a result of this work, we have obtained a trained ConvNet model, coined RealHePoNet, that given a low-resolution grayscale input image, and without the need of using facial landmarks, is able to estimate with low error both tilt and pan angles (4.4 degrees average error on the test partition). Also, given its low inference time (6 ms per head), we consider our model usable even when paired with medium-spec hardware (i.e. GTX 1060 GPU). Code available at: https://github.com/rafabs97/headpose_final Demo video at: https://www.youtube.com/watch?v=2UeuXh5DjAE.
引用
收藏
页码:7673 / 7689
页数:17
相关论文
共 50 条
  • [41] YOLO-6D-Pose: Enhancing YOLO for Single-Stage Monocular Multi-Object 6D Pose Estimation
    Maji, Debapriya
    Nagori, Soyeb
    Mathew, Manu
    Poddar, Deepak
    2024 INTERNATIONAL CONFERENCE IN 3D VISION, 3DV 2024, 2024, : 1616 - 1625
  • [42] Deep Head Pose Estimation for Faces in the Wild and Its Transfer Learning
    Hanh Tran Thi Bao
    Kim, Yong-Guk
    2017 SEVENTH INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND TECHNOLOGY (ICIST2017), 2017, : 187 - 193
  • [43] Head-Pose Estimation In-the-Wild Using a Random Forest
    Valle, Roberto
    Miguel Buenaposada, Jose
    Valdes, Antonio
    Baumela, Luis
    ARTICULATED MOTION AND DEFORMABLE OBJECTS, 2016, 9756 : 24 - 33
  • [44] Estimation of tool requirements in single-stage multimachine systems
    Koo, PH
    Tanchoco, JMA
    Talavage, JJ
    INTERNATIONAL JOURNAL OF PRODUCTION RESEARCH, 1998, 36 (06) : 1699 - 1713
  • [45] TinyPoseNet: A Fast and Compact Deep Network for Robust Head Pose Estimation
    Li, Shanru
    Wang, Liping
    Yang, Shuang
    Wang, Yuanquan
    Wang, Chongwen
    NEURAL INFORMATION PROCESSING (ICONIP 2017), PT II, 2017, 10635 : 53 - 63
  • [46] Toward Robust and Unconstrained Full Range of Rotation Head Pose Estimation
    Hempel, Thorsten
    Abdelrahman, Ahmed A.
    Al-Hamadi, Ayoub
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 33 : 2377 - 2387
  • [47] SCALE ROBUST HEAD POSE ESTIMATION BASED ON RELATIVE HOMOGRAPHY TRANSFORMATION
    Liu, Chenguang
    Cheng, Hengda
    Dasu, Aravind
    NEW MATHEMATICS AND NATURAL COMPUTATION, 2014, 10 (01) : 69 - 90
  • [48] A Fast and Robust Head Pose Estimation System Based on Depth Data
    Mou, Xiaozheng
    Wang, Han
    2012 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND BIOMIMETICS (ROBIO 2012), 2012,
  • [49] ROBUST HEAD POSE ESTIMATION VIA CONVEX REGULARIZED SPARSE REGRESSION
    Ji, Hao
    Liu, Risheng
    Su, Fei
    Su, Zhixun
    Tian, Yan
    2011 18TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2011,
  • [50] Robust Single-Stage Reconstruction of the Mid and Anterior Alar Rim
    Rooney, James A.
    DERMATOLOGIC SURGERY, 2020, 46 (09) : 1217 - 1220