Deformable Mesh Transformer for 3D Human Mesh Recovery

被引：7

作者：

Yoshiyasu, Yusuke ^{[1
]}

机构：

[1] Natl Inst Adv Ind Sci & Technol, 1-1-1 Umezono, Tsukuba, Ibaraki, Japan

来源：

2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) | 2023年

关键词：

D O I：

10.1109/CVPR52729.2023.01631

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We present Deformable mesh transFormer (DeFormer), a novel vertex-based approach to monocular 3D human mesh recovery. DeFormer iteratively fits a body mesh model to an input image via a mesh alignment feedback loop formed within a transformer decoder that is equipped with efficient body mesh driven attention modules: 1) body sparse self-attention and 2) deformable mesh cross attention. As a result, DeFormer can effectively exploit high-resolution image feature maps and a dense mesh model which were computationally expensive to deal with in previous approaches using the standard transformer attention. Experimental results show that DeFormer achieves state-of-the-art performances on the Human3.6M and 3DPW benchmarks. Ablation study is also conducted to show the effectiveness of the DeFormer model designs for leveraging multi-scale feature maps. Code is available at https://github.com/yusukey03012/DeFormer.

引用

页码：17006 / 17015

页数：10

共 50 条

[1] Learning Human Mesh Recovery in 3D Scenes
Shen, Zehong
Cen, Zhi
Peng, Sida
Shuai, Qing
Bao, Hujun
Zhou, Xiaowei
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 17038 - 17047
[2] PostureHMR: Posture Transformation for 3D Human Mesh Recovery
Song, Yu-Pei
Wu, Xiao
Yuan, Zhaoquan
Qiao, Jian-Jun
Peng, Qiang
2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 9732 - 9741
[3] A review of 3D human body pose estimation and mesh recovery
Muhammad, Zaka-Ud-Din
Huang, Zhangjin
Khan, Rashid
DIGITAL SIGNAL PROCESSING, 2022, 128
[4] 3D Human Mesh Recovery with Sequentially Global Rotation Estimation
Wang, Dongkai
Zhang, Shiliang
2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 14907 - 14916
[5] Video2mesh: 3D human pose and shape recovery by a temporal convolutional transformer network
Chao, Xianjin
Ge, Zhipeng
Leung, Howard
IET COMPUTER VISION, 2023, 17 (04) : 379 - 388
[6] Delaunay deformable mesh for the weathering and erosion of 3D terrain
Tychonievich, L. A.
Jones, M. D.
VISUAL COMPUTER, 2010, 26 (12): : 1485 - 1495
[7] Delaunay deformable mesh for the weathering and erosion of 3D terrain
L. A. Tychonievich
M. D. Jones
The Visual Computer, 2010, 26 : 1485 - 1495
[8] Deep learning for 3D human pose estimation and mesh recovery: A survey
Liu, Yang
Qiu, Changzhen
Zhang, Zhiyong
NEUROCOMPUTING, 2024, 596
[9] Probabilistic Human Mesh Recovery in 3D Scenes from Egocentric Views
Zhang, Siwei
Ma, Qianli
Zhang, Yan
Aliakbarian, Sadegh
Cosker, Darren
Tang, Siyu
2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 7955 - 7966
[10] A Progressive Quadric Graph Convolutional Network for 3D Human Mesh Recovery
Wang, Lei
Liu, Xunyu
Ma, Xiaoliang
Wu, Jiaji
Cheng, Jun
Zhou, Mengchu
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (01) : 104 - 117

← 1 2 3 4 5 →