LPHD: A LARGE-SCALE HEAD POSE DATASET FOR RGB IMAGES

被引:1
|
作者
Sun, Wei [1 ]
Fan, Yezhao [1 ]
Min, Xiongkuo [1 ]
Peng, Shihao [1 ]
Ma, Siwei [2 ]
Zhai, Guangtao [1 ]
机构
[1] Shanghai Jiao Tong Univ, Inst Image Commu & Infor Proce, Shanghai, Peoples R China
[2] Peking Univ, Sch Elect Engn & Comp Sci, Beijing, Peoples R China
基金
美国国家科学基金会;
关键词
head pose dataset; head pose estimation; facial landmark detection; convolution nerual network; MOTION;
D O I
10.1109/ICME.2019.00190
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Head pose estimation has attracted many research interest in recent years. With the advent of deep learning, it is possible to predict the head pose accurately from the RGB images without the help of facial landmarks or depth information. However, existing head pose datasets often lack large pose head images, which extremely limits the development of head pose estimation algorithms. In this paper, we build the large-scale head pose dataset (LHPD) including more than 140,000 images with the diverse and accurate head poses. The LHPD dataset includes the head images recorded from different shooting angles between the camera and the human body for the first time, which greatly expands the range of head pose compared to previous datasets. Therefore, the range of head pose can cover +/- 90. for each Euler angle. The accurate and reliable head pose annotation is labeled by the motion capture system and careful calibration procedures. We then propose a head pose estimation method through fine-tuning the ResNet on the LHPD dataset when using the Euclidean distance of quaternions as the loss function. The results show that our method achieves better performance than current state-of-the-art algorithms.
引用
收藏
页码:1084 / 1089
页数:6
相关论文
共 50 条
  • [21] RenderIH: A Large-scale Synthetic Dataset for 3D Interacting Hand Pose Estimation
    Li, Lijun
    Tian, Linrui
    Zhang, Xindi
    Wang, Qi
    Zhang, Bang
    Bo, Liefeng
    Liu, Mengyuan
    Chen, Chen
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 20338 - 20348
  • [22] SemanticRT: A Large-Scale Dataset and Method for Robust Semantic Segmentation in Multispectral Images
    Ji, Wei
    Li, Jingjing
    Bian, Cheng
    Zhang, Zhicheng
    Cheng, Li
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 3307 - 3316
  • [23] EarVN1.0: A new large-scale ear images dataset in the wild
    Vinh Truong Hoang
    DATA IN BRIEF, 2019, 27
  • [24] MultiScene: A Large-Scale Dataset and Benchmark for Multiscene Recognition in Single Aerial Images
    Hua, Yuansheng
    Mou, Lichao
    Jin, Pu
    Zhu, Xiao Xiang
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [25] FreiHAND: A Dataset for Markerless Capture of Hand Pose and Shape from Single RGB Images
    Zimmermann, Christian
    Ceylan, Duygu
    Yang, Jimei
    Russell, Bryan
    Argus, Max
    Brox, Thomas
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 813 - 822
  • [26] Large-scale RDF Dataset Slicing
    Marx, Edgard
    Shekarpour, Saeedeh
    Auer, Soeren
    Ngomo, Axel-Cyrille Ngonga
    2013 IEEE SEVENTH INTERNATIONAL CONFERENCE ON SEMANTIC COMPUTING (ICSC 2013), 2013, : 228 - 235
  • [27] Euler Clustering on Large-scale Dataset
    Wu, Jian-Sheng
    Zheng, Wei-Shi
    Lai, Jian-Huang
    Suen, Ching Y.
    IEEE TRANSACTIONS ON BIG DATA, 2018, 4 (04) : 502 - 515
  • [28] RGBD1K: A Large-Scale Dataset and Benchmark for RGB-D Object Tracking
    Zhu, Xue-Feng
    Xu, Tianyang
    Tang, Zhangyong
    Wu, Zucheng
    Liu, Haodong
    Yang, Xiao
    Wu, Xiao-Jun
    Kittler, Josef
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 3, 2023, : 3870 - 3878
  • [29] RTDOD: A large-scale RGB-thermal domain-incremental object detection dataset for UAVs
    Feng, Hangtao
    Zhang, Lu
    Zhang, Siqi
    Wang, Dong
    Yang, Xu
    Liu, Zhiyong
    IMAGE AND VISION COMPUTING, 2023, 140
  • [30] RGBD-FG: A LARGE-SCALE RGB-D DATASET FOR FINE-GRAINED CATEGORIZATION
    Tan, Yanhao
    Lu, Ke
    Rahman, Mohammad Muntasir
    Xue, Jian
    2020 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2020,