LPHD: A LARGE-SCALE HEAD POSE DATASET FOR RGB IMAGES

被引:1
|
作者
Sun, Wei [1 ]
Fan, Yezhao [1 ]
Min, Xiongkuo [1 ]
Peng, Shihao [1 ]
Ma, Siwei [2 ]
Zhai, Guangtao [1 ]
机构
[1] Shanghai Jiao Tong Univ, Inst Image Commu & Infor Proce, Shanghai, Peoples R China
[2] Peking Univ, Sch Elect Engn & Comp Sci, Beijing, Peoples R China
基金
美国国家科学基金会;
关键词
head pose dataset; head pose estimation; facial landmark detection; convolution nerual network; MOTION;
D O I
10.1109/ICME.2019.00190
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Head pose estimation has attracted many research interest in recent years. With the advent of deep learning, it is possible to predict the head pose accurately from the RGB images without the help of facial landmarks or depth information. However, existing head pose datasets often lack large pose head images, which extremely limits the development of head pose estimation algorithms. In this paper, we build the large-scale head pose dataset (LHPD) including more than 140,000 images with the diverse and accurate head poses. The LHPD dataset includes the head images recorded from different shooting angles between the camera and the human body for the first time, which greatly expands the range of head pose compared to previous datasets. Therefore, the range of head pose can cover +/- 90. for each Euler angle. The accurate and reliable head pose annotation is labeled by the motion capture system and careful calibration procedures. We then propose a head pose estimation method through fine-tuning the ResNet on the LHPD dataset when using the Euclidean distance of quaternions as the loss function. The results show that our method achieves better performance than current state-of-the-art algorithms.
引用
收藏
页码:1084 / 1089
页数:6
相关论文
共 50 条
  • [31] AffordPose: A Large-scale Dataset of Hand-Object Interactions with Affordance-driven Hand Pose
    Jian, Juntao
    Liu, Xiuping
    Li, Manyi
    Hu, Ruizhen
    Liu, Jian
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 14667 - 14678
  • [32] Images of large-scale environments
    Canter, D
    INTERNATIONAL JOURNAL OF PSYCHOLOGY, 1996, 31 (3-4) : 5055 - 5055
  • [33] The Jester Dataset: A Large-Scale Video Dataset of Human Gestures
    Materzynska, Joanna
    Berger, Guillaume
    Bax, Ingo
    Memisevic, Roland
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, : 2874 - 2882
  • [34] AffordPose: A Large-scale Dataset of Hand-Object Interactions with Affordance-driven Hand Pose
    Jian, Juntao
    Liu, Xiuping
    Li, Manyi
    Hu, Ruizhen
    Liu, Jian
    Proceedings of the IEEE International Conference on Computer Vision, 2023, : 14667 - 14678
  • [35] Large-scale 6D Object Pose Estimation Dataset for Industrial Bin-Picking
    Kleeberger, Kilian
    Landgraf, Christian
    Huber, Marco F.
    2019 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2019, : 2573 - 2578
  • [36] MVDI25K: A large-scale dataset of microscopic vaginal discharge images
    Li L.
    Liu J.
    Yu F.
    Wang X.
    Xiang T.-Z.
    BenchCouncil Transactions on Benchmarks, Standards and Evaluations, 2021, 1 (01):
  • [37] IMD2020: A Large-Scale Annotated Dataset Tailored for Detecting Manipulated Images
    Novozamsky, Adam
    Mandian, Babak
    Saic, Stanislav
    2020 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION WORKSHOPS (WACVW), 2020, : 71 - 80
  • [38] E-POSE: A Large Scale Event Camera Dataset for Object Pose Estimation
    Hay, Oussama Abdul
    Huang, Xiaoqian
    Ayyad, Abdulla
    Sherif, Eslam
    Almadhoun, Randa
    Abdulrahman, Yusra
    Seneviratne, Lakmal
    Abusafieh, Abdulqader
    Zweiri, Yahya
    SCIENTIFIC DATA, 2025, 12 (01)
  • [39] MIND: A Large-scale Dataset for News Recommendation
    Wu, Fangzhao
    Qiao, Ying
    Chen, Jiun-Hung
    Wu, Chuhan
    Qi, Tao
    Lian, Jianxun
    Liu, Danyang
    Xie, Xing
    Gao, Jianfeng
    Wu, Winnie
    Zhou, Ming
    58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020), 2020, : 3597 - 3606
  • [40] DANEWSROOM: A Large-scale Danish Summarisation Dataset
    Varab, Daniel
    Schluter, Natalie
    PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), 2020, : 6731 - 6739