Multi-view Shape Generation for a 3D Human-like Body

被引:15
|
作者
Yu, Hang [1 ]
Cheang, Chilam [2 ]
Fu, Yanwei [3 ,4 ]
Xue, Xiangyang [2 ]
机构
[1] Fudan Univ, Acad Engn & Technol, Shanghai, Peoples R China
[2] Fudan Univ, Sch Comp Sci, Shanghai, Peoples R China
[3] Fudan Univ, Sch Data Sci, Shanghai, Peoples R China
[4] Zhejiang Normal Univ, ISTBI ZJNU Algorithm Ctr Brain Inspired Intellige, Jinhua, Zhejiang, Peoples R China
关键词
3D reconstruction; human body reconstruction; multi-view stereo;
D O I
10.1145/3514248
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Three-dimensional (3D) human-like body reconstruction via a single RGB image has attracted significant research attention recently. Most of the existing methods rely on the Skinned Multi-Person Linear model and thus can only predict unified human bodies. Moreover, meshes reconstructed by current methods sometimes perform well from a canonical view but not from other views, as the reconstruction process is commonly supervised by only a single view. To address these limitations, this article proposes a multi-view shape generation network for a 3D human-like body. Particularly, we propose a coarse-to-fine learning model that gradually deforms a template body toward the ground truth body. Our model utilizes the information of multi-view renderings and corresponding 3D vertex transformation as supervision. Such supervision will help to generate 3D bodies well aligned to all views. To accurately operate mesh deformation, a graph convolutional network structure is introduced to support the shape generation from 3D vertex representation. Additionally, a graph up-pooling operation is designed over the intermediate representations of the graph convolutional network, and thus our model can generate 3D shapes with higher resolution. Novel loss functions are employed to help optimize the whole multi-view generation model, resulting in smoother surfaces. In addition, twomulti-view human body datasets are produced and contributed to the community. Extensive experiments conducted on the benchmark datasets demonstrate the efficacy of our model over the competitors.
引用
收藏
页数:22
相关论文
共 50 条
  • [31] Multi-view 3D Reconstruction with Transformers
    Wang, Dan
    Cui, Xinrui
    Chen, Xun
    Zou, Zhengxia
    Shi, Tianyang
    Salcudean, Septimiu
    Wang, Z. Jane
    Ward, Rabab
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 5702 - 5711
  • [32] Multi-View Image Capture for Glasses Free Multi-View 3D Displays
    Gurbuz, Sabri
    Yano, Sumio
    Iwasawa, Shoichiro
    Ando, Hiroshi
    IDW'10: PROCEEDINGS OF THE 17TH INTERNATIONAL DISPLAY WORKSHOPS, VOLS 1-3, 2010, : 2091 - 2094
  • [33] A generalizable approach for multi-view 3D human pose regression
    Kadkhodamohammadi, Abdolrahim
    Padoy, Nicolas
    MACHINE VISION AND APPLICATIONS, 2020, 32 (01)
  • [34] Multi-view Reconstruction of 3D Human Pose with Procrustes Analysis
    Temiz, Huseyin
    Gokherk, Berk
    Akarun, Late
    2019 NINTH INTERNATIONAL CONFERENCE ON IMAGE PROCESSING THEORY, TOOLS AND APPLICATIONS (IPTA), 2019,
  • [35] Multi-view 3D Human Pose Estimation in Complex Environment
    Hofmann, M.
    Gavrila, D. M.
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2012, 96 (01) : 103 - 124
  • [36] PROGRESSIVE MULTI-VIEW FUSION FOR 3D HUMAN POSE ESTIMATION
    Zhang, Lijun
    Zhou, Kangkang
    Liu, Liangchen
    Li, Zhenghao
    Zhao, Xunyi
    Zhou, Xiang-Dong
    Shi, Yu
    2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 1600 - 1604
  • [37] A generalizable approach for multi-view 3D human pose regression
    Abdolrahim Kadkhodamohammadi
    Nicolas Padoy
    Machine Vision and Applications, 2021, 32
  • [38] Markerless multi-view 3D human pose estimation: A survey
    Nogueira, Ana Filipa Rodrigues
    Oliveira, Helder P.
    Teixeira, Luis F.
    IMAGE AND VISION COMPUTING, 2025, 155
  • [39] 3D second harmonic generation imaging tomography by multi-view excitation
    Campbell, Kirby R.
    Wen, Bruce
    Shelton, Emily M.
    Swader, Robert
    Cox, Benjamin L.
    Eliceiri, Kevin
    Campagnola, Paul J.
    OPTICA, 2017, 4 (10): : 1171 - 1179
  • [40] Sequential Fusion of Multi-view Video Frames for 3D Scene Generation
    Sun, Weilin
    Li, Xiangxian
    Li, Manyi
    Wang, Yuqing
    Zheng, Yuze
    Meng, Xiangxu
    Meng, Lei
    ARTIFICIAL INTELLIGENCE, CICAI 2022, PT I, 2022, 13604 : 597 - 608