Coarse-to-fine Animal Pose and Shape Estimation

被引:0
|
作者
Li, Chen [1 ]
Lee, Gim Hee [1 ]
机构
[1] Natl Univ Singapore, Dept Comp Sci, Singapore, Singapore
基金
新加坡国家研究基金会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Most existing animal pose and shape estimation approaches reconstruct animal meshes with a parametric SMAL model. This is because the low-dimensional pose and shape parameters of the SMAL model makes it easier for deep networks to learn the high-dimensional animal meshes. However, the SMAL model is learned from scans of toy animals with limited pose and shape variations, and thus may not be able to represent highly varying real animals well. This may result in poor fittings of the estimated meshes to the 2D evidences, e.g. 2D keypoints or silhouettes. To mitigate this problem, we propose a coarse-to-fine approach to reconstruct 3D animal mesh from a single image. The coarse estimation stage first estimates the pose, shape and translation parameters of the SMAL model. The estimated meshes are then used as a starting point by a graph convolutional network (GCN) to predict a per-vertex deformation in the refinement stage. This combination of SMAL-based and vertex-based representations benefits from both parametric and non-parametric representations. We design our mesh refinement GCN (MRGCN) as an encoderdecoder structure with hierarchical feature representations to overcome the limited receptive field of traditional GCNs. Moreover, we observe that the global image feature used by existing animal mesh reconstruction works is unable to capture detailed shape information for mesh refinement. We thus introduce a local feature extractor to retrieve a vertex-level feature and use it together with the global feature as the input of the MRGCN. We test our approach on the StanfordExtra dataset and achieve state-of-the-art results. Furthermore, we test the generalization capacity of our approach on the Animal Pose and BADJA datasets. Our code is available at the project website(1).
引用
收藏
页数:12
相关论文
共 50 条
  • [1] Coarse-to-fine animal pose and shape estimation
    Li, Chen
    Lee, Gim Hee
    arXiv, 2021,
  • [2] Efficient Monocular Coarse-to-Fine Object Pose Estimation
    Feng, Rong
    Zhang, Hong
    2016 IEEE INTERNATIONAL CONFERENCE ON MECHATRONICS AND AUTOMATION, 2016, : 1617 - 1622
  • [3] Face Alignment by Coarse-to-Fine Shape Estimation
    Wan Jun
    Li Jing
    Chang Jun
    Wu Yujia
    Xiao Yafu
    Song Chengfang
    CHINESE JOURNAL OF ELECTRONICS, 2018, 27 (06) : 1183 - 1191
  • [4] Face Alignment by Coarse-to-Fine Shape Estimation
    WAN Jun
    LI Jing
    CHANG Jun
    WU Yujia
    XIAO Yafu
    SONG Chengfang
    ChineseJournalofElectronics, 2018, 27 (06) : 1183 - 1191
  • [5] Coarse-to-Fine 3D Human Pose Estimation
    Guo, Yu
    Zhao, Lin
    Zhang, Shanshan
    Yang, Jian
    IMAGE AND GRAPHICS, ICIG 2019, PT III, 2019, 11903 : 579 - 592
  • [6] A deep Coarse-to-Fine network for head pose estimation from synthetic data
    Wang, Yujia
    Liang, Wei
    Shen, Jianbing
    Jia, Yunde
    Yu, Lap-Fai
    PATTERN RECOGNITION, 2019, 94 : 196 - 206
  • [7] A Multiscale Coarse-to-Fine Human Pose Estimation Network With Hard Keypoint Mining
    Jiang, Xiaoyan
    Tao, Hangyu
    Hwang, Jenq-Neng
    Fang, Zhijun
    IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2024, 54 (03): : 1730 - 1741
  • [8] A coarse-to-fine method for shape recognition
    Tang H.-X.
    Wei H.
    Journal of Computer Science and Technology, 2007, 22 (02) : 330 - 334
  • [9] Real-Time Facial Pose Estimation and Tracking by Coarse-to-Fine Iterative Optimization
    Yang, Xiaolong
    Jia, Xiaohong
    Yuan, Mengke
    Yan, Dong-Ming
    TSINGHUA SCIENCE AND TECHNOLOGY, 2020, 25 (05) : 690 - 700
  • [10] PoseDiffusion: A Coarse-to-Fine Framework for Unseen Object 6-DoF Pose Estimation
    Zhou, Jiaming
    Zhu, Qing
    Wang, Yaonan
    Feng, Mingtao
    Wu, Chengzhong
    Liu, Xuebing
    Huang, Jianan
    Mian, Ajmal
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2024, 20 (09) : 11127 - 11138