Multi-view Shape Generation for a 3D Human-like Body

被引：15

作者：

Yu, Hang ^{[1
]}

Cheang, Chilam ^{[2
]}

Fu, Yanwei ^{[3
,4
]}

Xue, Xiangyang ^{[2
]}

机构：

[1] Fudan Univ, Acad Engn & Technol, Shanghai, Peoples R China

[2] Fudan Univ, Sch Comp Sci, Shanghai, Peoples R China

[3] Fudan Univ, Sch Data Sci, Shanghai, Peoples R China

[4] Zhejiang Normal Univ, ISTBI ZJNU Algorithm Ctr Brain Inspired Intellige, Jinhua, Zhejiang, Peoples R China

来源：

ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS | 2023年 / 19卷 / 01期

关键词：

3D reconstruction; human body reconstruction; multi-view stereo;

D O I：

10.1145/3514248

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Three-dimensional (3D) human-like body reconstruction via a single RGB image has attracted significant research attention recently. Most of the existing methods rely on the Skinned Multi-Person Linear model and thus can only predict unified human bodies. Moreover, meshes reconstructed by current methods sometimes perform well from a canonical view but not from other views, as the reconstruction process is commonly supervised by only a single view. To address these limitations, this article proposes a multi-view shape generation network for a 3D human-like body. Particularly, we propose a coarse-to-fine learning model that gradually deforms a template body toward the ground truth body. Our model utilizes the information of multi-view renderings and corresponding 3D vertex transformation as supervision. Such supervision will help to generate 3D bodies well aligned to all views. To accurately operate mesh deformation, a graph convolutional network structure is introduced to support the shape generation from 3D vertex representation. Additionally, a graph up-pooling operation is designed over the intermediate representations of the graph convolutional network, and thus our model can generate 3D shapes with higher resolution. Novel loss functions are employed to help optimize the whole multi-view generation model, resulting in smoother surfaces. In addition, twomulti-view human body datasets are produced and contributed to the community. Extensive experiments conducted on the benchmark datasets demonstrate the efficacy of our model over the competitors.

引用

页数：22

共 50 条

[41] A Transformer-based Network for Multi-view 3D Mesh Generation
Shi, Wuzhen
Liu, Zhijie
Li, Yingxiang
Wen, Yang
Liu, Yutao
Proceedings - 2023 IEEE SmartWorld, Ubiquitous Intelligence and Computing, Autonomous and Trusted Vehicles, Scalable Computing and Communications, Digital Twin, Privacy Computing and Data Security, Metaverse, SmartWorld/UIC/ATC/ScalCom/DigitalTwin/PCDS/Metaverse 2023, 2023,
[42] Learning to Implicitly Represent 3D Human Body From Multi-scale Features and Multi-view Images
Li, Zhongguo
Oskarsson, Magnus
Heyden, Anders
2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 8968 - 8975
[43] MULTI-VIEW PAIRWISE RELATIONSHIP LEARNING FOR SKETCH BASED 3D SHAPE RETRIEVAL
Li, Hanhui
Wu, Hefeng
He, Xiangjian
Lin, Shujin
Wang, Ruomei
Luo, Xiaonan
2017 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2017, : 1434 - 1439
[44] POINTVIEW-GCN: 3D SHAPE CLASSIFICATION WITH MULTI-VIEW POINT CLOUDS
Mohammadi, Seyed Saber
Wang, Yiming
Del Bue, Alessio
2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, : 3103 - 3107
[45] Angular Triplet-Center Loss for Multi-View 3D Shape Retrieval
Li, Zhaoqun
Xu, Cheng
Leng, Biao
THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 8682 - 8689
[46] SKETCH-BASED 3D SHAPE RETRIEVAL WITH MULTI-VIEW FUSION TRANSFORMER
Zhu, Cunjuan
Cui, Dongdong
Jia, Qi
Wang, Weimin
Liu, Yu
Lew, Michael S.
2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, ICASSP 2024, 2024, : 3005 - 3009
[47] 3D Shape Reconstruction from Sketches via Multi-view Convolutional Networks
Lun, Zhaoliang
Gadelha, Matheus
Kalogerakis, Evangelos
Maji, Subhransu
Wang, Rui
PROCEEDINGS 2017 INTERNATIONAL CONFERENCE ON 3D VISION (3DV), 2017, : 67 - 77
[48] 3D Human Pose and Shape Estimation Through Collaborative Learning and Multi-view Model-fitting
Li, Zhongguo
Oskarsson, Magnus
Heyden, Anders
2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2021), 2021, : 1887 - 1896
[49] Learning View-Based Graph Convolutional Network for Multi-View 3D Shape Analysis
Wei, Xin
Yu, Ruixuan
Sun, Jian
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (06) : 7525 - 7541
[50] Research on Multi-view 3D Reconstruction of Human Motion Based on OpenPose
Li, Xuhui
Cai, Cheng
Zhou, Hengyi
COGNITIVE COMPUTING, ICCC 2021, 2022, 12992 : 72 - 78

← 1 2 3 4 5 →