Deformable Mesh Transformer for 3D Human Mesh Recovery

被引:7
|
作者
Yoshiyasu, Yusuke [1 ]
机构
[1] Natl Inst Adv Ind Sci & Technol, 1-1-1 Umezono, Tsukuba, Ibaraki, Japan
关键词
D O I
10.1109/CVPR52729.2023.01631
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present Deformable mesh transFormer (DeFormer), a novel vertex-based approach to monocular 3D human mesh recovery. DeFormer iteratively fits a body mesh model to an input image via a mesh alignment feedback loop formed within a transformer decoder that is equipped with efficient body mesh driven attention modules: 1) body sparse self-attention and 2) deformable mesh cross attention. As a result, DeFormer can effectively exploit high-resolution image feature maps and a dense mesh model which were computationally expensive to deal with in previous approaches using the standard transformer attention. Experimental results show that DeFormer achieves state-of-the-art performances on the Human3.6M and 3DPW benchmarks. Ablation study is also conducted to show the effectiveness of the DeFormer model designs for leveraging multi-scale feature maps. Code is available at https://github.com/yusukey03012/DeFormer.
引用
收藏
页码:17006 / 17015
页数:10
相关论文
共 50 条
  • [1] Learning Human Mesh Recovery in 3D Scenes
    Shen, Zehong
    Cen, Zhi
    Peng, Sida
    Shuai, Qing
    Bao, Hujun
    Zhou, Xiaowei
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 17038 - 17047
  • [2] PostureHMR: Posture Transformation for 3D Human Mesh Recovery
    Song, Yu-Pei
    Wu, Xiao
    Yuan, Zhaoquan
    Qiao, Jian-Jun
    Peng, Qiang
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 9732 - 9741
  • [3] A review of 3D human body pose estimation and mesh recovery
    Muhammad, Zaka-Ud-Din
    Huang, Zhangjin
    Khan, Rashid
    DIGITAL SIGNAL PROCESSING, 2022, 128
  • [4] 3D Human Mesh Recovery with Sequentially Global Rotation Estimation
    Wang, Dongkai
    Zhang, Shiliang
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 14907 - 14916
  • [5] Video2mesh: 3D human pose and shape recovery by a temporal convolutional transformer network
    Chao, Xianjin
    Ge, Zhipeng
    Leung, Howard
    IET COMPUTER VISION, 2023, 17 (04) : 379 - 388
  • [6] Delaunay deformable mesh for the weathering and erosion of 3D terrain
    Tychonievich, L. A.
    Jones, M. D.
    VISUAL COMPUTER, 2010, 26 (12): : 1485 - 1495
  • [7] Delaunay deformable mesh for the weathering and erosion of 3D terrain
    L. A. Tychonievich
    M. D. Jones
    The Visual Computer, 2010, 26 : 1485 - 1495
  • [8] Deep learning for 3D human pose estimation and mesh recovery: A survey
    Liu, Yang
    Qiu, Changzhen
    Zhang, Zhiyong
    NEUROCOMPUTING, 2024, 596
  • [9] Probabilistic Human Mesh Recovery in 3D Scenes from Egocentric Views
    Zhang, Siwei
    Ma, Qianli
    Zhang, Yan
    Aliakbarian, Sadegh
    Cosker, Darren
    Tang, Siyu
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 7955 - 7966
  • [10] A Progressive Quadric Graph Convolutional Network for 3D Human Mesh Recovery
    Wang, Lei
    Liu, Xunyu
    Ma, Xiaoliang
    Wu, Jiaji
    Cheng, Jun
    Zhou, Mengchu
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (01) : 104 - 117