3D Shape Reconstruction from 2D Images with Disentangled Attribute Flow

被引:20
|
作者
Wen, Xin [1 ,2 ]
Zhou, Junsheng [1 ]
Liu, Yu-Shen [1 ]
Su, Hua [3 ]
Dong, Zhen [4 ]
Han, Zhizhong [5 ]
机构
[1] Tsinghua Univ, Sch Software, BNRist, Beijing, Peoples R China
[2] JD Com, JD Logist, Beijing, Peoples R China
[3] Kuaishou Technol, Beijing, Peoples R China
[4] Wuhan Univ, Wuhan, Peoples R China
[5] Wayne State Univ, Dept Comp Sci, Detroit, MI 48202 USA
基金
国家重点研发计划; 中国国家自然科学基金;
关键词
D O I
10.1109/CVPR52688.2022.00378
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Reconstructing 3D shape from a single 2D image is a challenging task, which needs to estimate the detailed 3D structures based on the semantic attributes from 2D image. So far, most of the previous methods still struggle to extract semantic attributes for 3D reconstruction task. Since the semantic attributes of a single image are usually implicit and entangled with each other, it is still challenging to reconstruct 3D shape with detailed semantic structures represented by the input image. To address this problem, we propose 3DAttriFlow to disentangle and extract semantic attributes through different semantic levels in the input images. These disentangled semantic attributes will be integrated into the 3D shape reconstruction process, which can provide definite guidance to the reconstruction of specific attribute on 3D shape. As a result, the 3D decoder can explicitly capture high-level semantic features at the bottom of the network, and utilize low-level features at the top of the network, which allows to reconstruct more accurate 3D shapes. Note that the explicit disentangling is learned without extra labels, where the only supervision used in our training is the input image and its corresponding 3D shape. Our comprehensive experiments on ShapeNet dataset demonstrate that 3DAttriFlow outperforms the state-of-the-art shape reconstruction methods, and we also validate its generalization ability on shape completion task. Code is available at https://github.com/junshengzhou/3DAttriFlow.
引用
收藏
页码:3793 / 3803
页数:11
相关论文
共 50 条
  • [41] Evaluation of Tools Used for 3D Reconstruction of 2D Medical Images
    Durisetti, Srinikhil
    Alapati, Darsani
    Vadnala, Sai Keerthi
    Kotha, Keerthana
    Chandra, G. Ramesh
    Govindarajan, Sathya
    PROCEEDINGS OF SECOND INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTER ENGINEERING AND COMMUNICATION SYSTEMS, ICACECS 2021, 2022, : 287 - 298
  • [42] 3D Tibia Reconstruction Using 2D Computed Tomography Images
    Iyoho, Anthony E.
    Young, Jonathan M.
    Volman, Vladislav
    Shelley, David A.
    Ng, Laurel J.
    Wang, Henry
    MILITARY MEDICINE, 2019, 184 (3-4) : 621 - 626
  • [43] 2D Feature Recognition And 3d Reconstruction In Solar Euv Images
    Markus J. Aschwanden
    Solar Physics, 2005, 228 : 339 - 358
  • [44] 3D object understanding from 2D images
    Wang, PSP
    INTERNATIONAL SYMPOSIUM ON MULTISPECTRAL IMAGE PROCESSING, 1998, 3545 : 33 - 43
  • [45] Face recognition from 2D and 3D images
    Wang, YJ
    Chua, CS
    Ho, YK
    AUDIO- AND VIDEO-BASED BIOMETRIC PERSON AUTHENTICATION, PROCEEDINGS, 2001, 2091 : 26 - 31
  • [46] From 2D images to 3D face geometry
    Lengagne, R
    Tarel, JP
    Monga, O
    PROCEEDINGS OF THE SECOND INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION, 1996, : 301 - 306
  • [47] A methodology for realistic human shape reconstruction from 2D images
    Curbelo J.P.
    Spiteri R.J.
    Multimedia Tools and Applications, 2024, 83 (21) : 61025 - 61046
  • [48] Optimization of reconstruction of 2D medical images based on computer 3D reconstruction technology
    Information Science and Technology, College of Northwestern University, China
    J. Digit. Inf. Manage., 3 (142-146):
  • [49] 3D IMAGES WITH 2D FOOTPRINT
    Ferre Ferri, Enrique
    REVISTA SONDA-INVESTIGACION Y DOCENCIA EN ARTES Y LETRAS, 2020, (09): : 73 - 82
  • [50] 3D Reconstruction of 2D Crystals
    Zeng, Xiangyan
    Glover, James Ervin
    Hughes, Owen
    Stahlberg, Henning
    PROCEEDINGS OF THE 49TH ANNUAL ASSOCIATION FOR COMPUTING MACHINERY SOUTHEAST CONFERENCE (ACMSE '11), 2011, : 160 - 165