Multi-view Shape Generation for a 3D Human-like Body

被引:15
|
作者
Yu, Hang [1 ]
Cheang, Chilam [2 ]
Fu, Yanwei [3 ,4 ]
Xue, Xiangyang [2 ]
机构
[1] Fudan Univ, Acad Engn & Technol, Shanghai, Peoples R China
[2] Fudan Univ, Sch Comp Sci, Shanghai, Peoples R China
[3] Fudan Univ, Sch Data Sci, Shanghai, Peoples R China
[4] Zhejiang Normal Univ, ISTBI ZJNU Algorithm Ctr Brain Inspired Intellige, Jinhua, Zhejiang, Peoples R China
关键词
3D reconstruction; human body reconstruction; multi-view stereo;
D O I
10.1145/3514248
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Three-dimensional (3D) human-like body reconstruction via a single RGB image has attracted significant research attention recently. Most of the existing methods rely on the Skinned Multi-Person Linear model and thus can only predict unified human bodies. Moreover, meshes reconstructed by current methods sometimes perform well from a canonical view but not from other views, as the reconstruction process is commonly supervised by only a single view. To address these limitations, this article proposes a multi-view shape generation network for a 3D human-like body. Particularly, we propose a coarse-to-fine learning model that gradually deforms a template body toward the ground truth body. Our model utilizes the information of multi-view renderings and corresponding 3D vertex transformation as supervision. Such supervision will help to generate 3D bodies well aligned to all views. To accurately operate mesh deformation, a graph convolutional network structure is introduced to support the shape generation from 3D vertex representation. Additionally, a graph up-pooling operation is designed over the intermediate representations of the graph convolutional network, and thus our model can generate 3D shapes with higher resolution. Novel loss functions are employed to help optimize the whole multi-view generation model, resulting in smoother surfaces. In addition, twomulti-view human body datasets are produced and contributed to the community. Extensive experiments conducted on the benchmark datasets demonstrate the efficacy of our model over the competitors.
引用
收藏
页数:22
相关论文
共 50 条
  • [1] Multi-view 3D shape style transformation
    Xiuping Liu
    Hua Huang
    Weiming Wang
    Jun Zhou
    The Visual Computer, 2022, 38 : 669 - 684
  • [2] Multi-view 3D shape style transformation
    Liu, Xiuping
    Huang, Hua
    Wang, Weiming
    Zhou, Jun
    VISUAL COMPUTER, 2022, 38 (02): : 669 - 684
  • [3] Dynamic View Aggregation for Multi-View 3D Shape Recognition
    Zhou, Yuan
    Sun, Zhongqi
    Huo, Shuwei
    Kung, Sun-Yuan
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 9163 - 9174
  • [4] 3D Shape Completion with Multi-View Consistent Inference
    Hu, Tao
    Han, Zhizhong
    Zwicker, Matthias
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 10997 - 11004
  • [5] Contrastive Multi-View Learning for 3D Shape Clustering
    Peng, Bo
    Lin, Guoting
    Lei, Jianjun
    Qin, Tianyi
    Cao, Xiaochun
    Ling, Nam
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 6262 - 6272
  • [6] Novel Multi-view Generation Framework for 3D Displays
    Hwang, Kyuyoung
    Cho, Yangho
    Lee, Hoyoung
    Park, Dusik
    Kim, Changyeong
    STEREOSCOPIC DISPLAYS AND APPLICATIONS XXIII, 2012, 8288
  • [7] A Multi-view Deep Learning Approach for Detecting Threats on 3D Human Body
    Yan, Zhicong
    Feng, Shuai
    Li, Fangqi
    Xu, Zhengwu
    Li, Shenghong
    COMMUNICATIONS, SIGNAL PROCESSING, AND SYSTEMS, CSPS 2018, VOL III: SYSTEMS, 2020, 517 : 286 - 294
  • [8] MVPN: Multi-View Prototype Network for 3D Shape Recognition
    Wu, Zizhao
    Yang, Ping
    Wang, Yigang
    IEEE ACCESS, 2019, 7 : 130363 - 130372
  • [9] Multi-view Fusion with Deep Learning for 3D Shape Classification
    Huang, Xiang
    Wang, Mantao
    Zhang, Dejun
    Zhu, Yu
    Zou, Lu
    Sun, Jun
    Han, Fei
    He, Linchao
    2018 INTERNATIONAL CONFERENCE ON AUDIO, LANGUAGE AND IMAGE PROCESSING (ICALIP), 2018, : 189 - 194
  • [10] Multi-view Convolutional Neural Networks for 3D Shape Recognition
    Su, Hang
    Maji, Subhransu
    Kalogerakis, Evangelos
    Learned-Miller, Erik
    2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 945 - 953