Multi-view Shape Generation for a 3D Human-like Body

被引：15

作者：

Yu, Hang ^{[1
]}

Cheang, Chilam ^{[2
]}

Fu, Yanwei ^{[3
,4
]}

Xue, Xiangyang ^{[2
]}

机构：

[1] Fudan Univ, Acad Engn & Technol, Shanghai, Peoples R China

[2] Fudan Univ, Sch Comp Sci, Shanghai, Peoples R China

[3] Fudan Univ, Sch Data Sci, Shanghai, Peoples R China

[4] Zhejiang Normal Univ, ISTBI ZJNU Algorithm Ctr Brain Inspired Intellige, Jinhua, Zhejiang, Peoples R China

来源：

ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS | 2023年 / 19卷 / 01期

关键词：

3D reconstruction; human body reconstruction; multi-view stereo;

D O I：

10.1145/3514248

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Three-dimensional (3D) human-like body reconstruction via a single RGB image has attracted significant research attention recently. Most of the existing methods rely on the Skinned Multi-Person Linear model and thus can only predict unified human bodies. Moreover, meshes reconstructed by current methods sometimes perform well from a canonical view but not from other views, as the reconstruction process is commonly supervised by only a single view. To address these limitations, this article proposes a multi-view shape generation network for a 3D human-like body. Particularly, we propose a coarse-to-fine learning model that gradually deforms a template body toward the ground truth body. Our model utilizes the information of multi-view renderings and corresponding 3D vertex transformation as supervision. Such supervision will help to generate 3D bodies well aligned to all views. To accurately operate mesh deformation, a graph convolutional network structure is introduced to support the shape generation from 3D vertex representation. Additionally, a graph up-pooling operation is designed over the intermediate representations of the graph convolutional network, and thus our model can generate 3D shapes with higher resolution. Novel loss functions are employed to help optimize the whole multi-view generation model, resulting in smoother surfaces. In addition, twomulti-view human body datasets are produced and contributed to the community. Extensive experiments conducted on the benchmark datasets demonstrate the efficacy of our model over the competitors.

引用

页数：22

共 50 条

[31] Multi-view 3D Reconstruction with Transformers
Wang, Dan
Cui, Xinrui
Chen, Xun
Zou, Zhengxia
Shi, Tianyang
Salcudean, Septimiu
Wang, Z. Jane
Ward, Rabab
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 5702 - 5711
[32] Multi-View Image Capture for Glasses Free Multi-View 3D Displays
Gurbuz, Sabri
Yano, Sumio
Iwasawa, Shoichiro
Ando, Hiroshi
IDW'10: PROCEEDINGS OF THE 17TH INTERNATIONAL DISPLAY WORKSHOPS, VOLS 1-3, 2010, : 2091 - 2094
[33] A generalizable approach for multi-view 3D human pose regression
Kadkhodamohammadi, Abdolrahim
Padoy, Nicolas
MACHINE VISION AND APPLICATIONS, 2020, 32 (01)
[34] Multi-view Reconstruction of 3D Human Pose with Procrustes Analysis
Temiz, Huseyin
Gokherk, Berk
Akarun, Late
2019 NINTH INTERNATIONAL CONFERENCE ON IMAGE PROCESSING THEORY, TOOLS AND APPLICATIONS (IPTA), 2019,
[35] Multi-view 3D Human Pose Estimation in Complex Environment
Hofmann, M.
Gavrila, D. M.
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2012, 96 (01) : 103 - 124
[36] PROGRESSIVE MULTI-VIEW FUSION FOR 3D HUMAN POSE ESTIMATION
Zhang, Lijun
Zhou, Kangkang
Liu, Liangchen
Li, Zhenghao
Zhao, Xunyi
Zhou, Xiang-Dong
Shi, Yu
2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 1600 - 1604
[37] A generalizable approach for multi-view 3D human pose regression
Abdolrahim Kadkhodamohammadi
Nicolas Padoy
Machine Vision and Applications, 2021, 32
[38] Markerless multi-view 3D human pose estimation: A survey
Nogueira, Ana Filipa Rodrigues
Oliveira, Helder P.
Teixeira, Luis F.
IMAGE AND VISION COMPUTING, 2025, 155
[39] 3D second harmonic generation imaging tomography by multi-view excitation
Campbell, Kirby R.
Wen, Bruce
Shelton, Emily M.
Swader, Robert
Cox, Benjamin L.
Eliceiri, Kevin
Campagnola, Paul J.
OPTICA, 2017, 4 (10): : 1171 - 1179
[40] Sequential Fusion of Multi-view Video Frames for 3D Scene Generation
Sun, Weilin
Li, Xiangxian
Li, Manyi
Wang, Yuqing
Zheng, Yuze
Meng, Xiangxu
Meng, Lei
ARTIFICIAL INTELLIGENCE, CICAI 2022, PT I, 2022, 13604 : 597 - 608

← 1 2 3 4 5 →