Multi-view Shape Generation for a 3D Human-like Body

被引:15
|
作者
Yu, Hang [1 ]
Cheang, Chilam [2 ]
Fu, Yanwei [3 ,4 ]
Xue, Xiangyang [2 ]
机构
[1] Fudan Univ, Acad Engn & Technol, Shanghai, Peoples R China
[2] Fudan Univ, Sch Comp Sci, Shanghai, Peoples R China
[3] Fudan Univ, Sch Data Sci, Shanghai, Peoples R China
[4] Zhejiang Normal Univ, ISTBI ZJNU Algorithm Ctr Brain Inspired Intellige, Jinhua, Zhejiang, Peoples R China
关键词
3D reconstruction; human body reconstruction; multi-view stereo;
D O I
10.1145/3514248
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Three-dimensional (3D) human-like body reconstruction via a single RGB image has attracted significant research attention recently. Most of the existing methods rely on the Skinned Multi-Person Linear model and thus can only predict unified human bodies. Moreover, meshes reconstructed by current methods sometimes perform well from a canonical view but not from other views, as the reconstruction process is commonly supervised by only a single view. To address these limitations, this article proposes a multi-view shape generation network for a 3D human-like body. Particularly, we propose a coarse-to-fine learning model that gradually deforms a template body toward the ground truth body. Our model utilizes the information of multi-view renderings and corresponding 3D vertex transformation as supervision. Such supervision will help to generate 3D bodies well aligned to all views. To accurately operate mesh deformation, a graph convolutional network structure is introduced to support the shape generation from 3D vertex representation. Additionally, a graph up-pooling operation is designed over the intermediate representations of the graph convolutional network, and thus our model can generate 3D shapes with higher resolution. Novel loss functions are employed to help optimize the whole multi-view generation model, resulting in smoother surfaces. In addition, twomulti-view human body datasets are produced and contributed to the community. Extensive experiments conducted on the benchmark datasets demonstrate the efficacy of our model over the competitors.
引用
收藏
页数:22
相关论文
共 50 条
  • [41] A Transformer-based Network for Multi-view 3D Mesh Generation
    Shi, Wuzhen
    Liu, Zhijie
    Li, Yingxiang
    Wen, Yang
    Liu, Yutao
    Proceedings - 2023 IEEE SmartWorld, Ubiquitous Intelligence and Computing, Autonomous and Trusted Vehicles, Scalable Computing and Communications, Digital Twin, Privacy Computing and Data Security, Metaverse, SmartWorld/UIC/ATC/ScalCom/DigitalTwin/PCDS/Metaverse 2023, 2023,
  • [42] Learning to Implicitly Represent 3D Human Body From Multi-scale Features and Multi-view Images
    Li, Zhongguo
    Oskarsson, Magnus
    Heyden, Anders
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 8968 - 8975
  • [43] MULTI-VIEW PAIRWISE RELATIONSHIP LEARNING FOR SKETCH BASED 3D SHAPE RETRIEVAL
    Li, Hanhui
    Wu, Hefeng
    He, Xiangjian
    Lin, Shujin
    Wang, Ruomei
    Luo, Xiaonan
    2017 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2017, : 1434 - 1439
  • [44] POINTVIEW-GCN: 3D SHAPE CLASSIFICATION WITH MULTI-VIEW POINT CLOUDS
    Mohammadi, Seyed Saber
    Wang, Yiming
    Del Bue, Alessio
    2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, : 3103 - 3107
  • [45] Angular Triplet-Center Loss for Multi-View 3D Shape Retrieval
    Li, Zhaoqun
    Xu, Cheng
    Leng, Biao
    THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 8682 - 8689
  • [46] SKETCH-BASED 3D SHAPE RETRIEVAL WITH MULTI-VIEW FUSION TRANSFORMER
    Zhu, Cunjuan
    Cui, Dongdong
    Jia, Qi
    Wang, Weimin
    Liu, Yu
    Lew, Michael S.
    2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, ICASSP 2024, 2024, : 3005 - 3009
  • [47] 3D Shape Reconstruction from Sketches via Multi-view Convolutional Networks
    Lun, Zhaoliang
    Gadelha, Matheus
    Kalogerakis, Evangelos
    Maji, Subhransu
    Wang, Rui
    PROCEEDINGS 2017 INTERNATIONAL CONFERENCE ON 3D VISION (3DV), 2017, : 67 - 77
  • [48] 3D Human Pose and Shape Estimation Through Collaborative Learning and Multi-view Model-fitting
    Li, Zhongguo
    Oskarsson, Magnus
    Heyden, Anders
    2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2021), 2021, : 1887 - 1896
  • [49] Learning View-Based Graph Convolutional Network for Multi-View 3D Shape Analysis
    Wei, Xin
    Yu, Ruixuan
    Sun, Jian
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (06) : 7525 - 7541
  • [50] Research on Multi-view 3D Reconstruction of Human Motion Based on OpenPose
    Li, Xuhui
    Cai, Cheng
    Zhou, Hengyi
    COGNITIVE COMPUTING, ICCC 2021, 2022, 12992 : 72 - 78