SE-MD: a single-encoder multiple-decoder deep network for point cloud reconstruction from 2D images

被引:3
|
作者
Hafiz, Abdul Mueed [1 ]
Bhat, Rouf Ul Alam [1 ]
Parah, Shabir Ahmad [2 ]
Hassaballah, M. [3 ,4 ]
机构
[1] Univ Kashmir, Dept Elect & Commun Engn, Inst Technol, Srinagar 190006, J&K, India
[2] Univ Kashmir, Dept Elect & Instrumentat Technol, Srinagar 190006, J&K, India
[3] Prince Sattam Bin Abdulaziz Univ, Dept Comp Sci, Coll Comp Engn & Sci, Al Kharj 16278, Saudi Arabia
[4] South Valley Univ, Fac Comp & Informat, Dept Comp Sci, Qena, Egypt
关键词
3D model reconstruction; 3D shape reconstruction; 2D images; Point clouds; ShapeNet; 3D convolutional networks;
D O I
10.1007/s10044-023-01155-x
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
3D model reconstruction from single 2D RGB images is a challenging and actively researched computer vision task. Several techniques based on conventional network architectures have been proposed for the same. However, the body of research work is limited and there are some issues like using inefficient 3D representation formats, weak 3D model reconstruction backbones, inability to reconstruct dense point clouds, dependence of post-processing for reconstruction of dense point clouds and dependence on silhouettes in RGB images. In this paper, a new 2D RGB image to point cloud conversion technique is proposed, which improves the state-of-the-art in the field due to its efficient, robust and simple model by using the concept of parallelization in network architecture. It not only uses efficient and rich 3D representation of point clouds, but also uses a new robust point cloud reconstruction backbone to address the prevalent issues. This involves using a single-encoder multiple-decoder deep network architecture wherein each decoder reconstructs certain fixed viewpoints. This is followed by fusing all the viewpoints to reconstruct a dense point cloud. Various experiments are conducted to evaluate the proposed technique and to compare its performance with those of the state-of-the-arts and impressive gains in performance are demonstrated.
引用
收藏
页码:1291 / 1302
页数:12
相关论文
共 50 条
  • [31] Finding a closed boundary by growing minimal paths from a single point on 2D or 3D images
    Bemnansour, Fethallah
    Bonneau, Stephane
    Cohen, Laurent D.
    2007 IEEE 11TH INTERNATIONAL CONFERENCE ON COMPUTER VISION, VOLS 1-6, 2007, : 2556 - 2563
  • [32] The presentation of a semi-supervised deep learning platform for 3D face reconstruction from 2D images
    Hao, Bianyuan
    JOURNAL OF OPTICS-INDIA, 2024, 53 (03): : 2202 - 2211
  • [33] The presentation of a semi-supervised deep learning platform for 3D face reconstruction from 2D images
    Hao, Bianyuan
    JOURNAL OF OPTICS-INDIA, 2024, 53 (03): : 2202 - 2211
  • [34] 3D Reconstruction and Estimation from Single-view 2D Image by Deep Learning A Survey
    Shan, Yongfeng
    Liang, Christy Jie
    Xu, Min
    2024 IEEE CONFERENCE ON ARTIFICIAL INTELLIGENCE, CAI 2024, 2024, : 1 - 7
  • [35] SOANets: Encoder-decoder based Skeleton Orientation Alignment Network for White Cane User Recognition from 2D Human Skeleton Sequence
    Nishida, Naoki
    Kawanishi, Yasutomo
    Deguchi, Daisuke
    Ide, Ichiro
    Murase, Hiroshi
    Piao, Jun
    PROCEEDINGS OF THE 15TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS, VOL 5: VISAPP, 2020, : 435 - 443
  • [36] Deep Learning of Submerged Body Images from 2D Sonar Sensor based on Convolutional Neural Network
    Lee, Sejin
    2017 IEEE UNDERWATER TECHNOLOGY (UT), 2017,
  • [37] 3D Motion Reconstruction From 2D Motion Data Using Multimodal Conditional Deep Belief Network
    Heydari, Muhammad Javad
    Ghidary, Saeed Shiry
    IEEE ACCESS, 2019, 7 : 56389 - 56408
  • [38] 3D Convolutional Encoder-Decoder Network for Low-Dose CT via Transfer Learning From a 2D Trained Network (vol 37, pg 1522, 2018)
    Shan, Hongming
    Zhang, Yi
    Yang, Qingsong
    Kruger, Uwe
    Kalra, Mannudeep K.
    Sun, Ling
    Cong, Wenxiang
    Wang, Ge
    IEEE TRANSACTIONS ON MEDICAL IMAGING, 2018, 37 (12) : 2750 - 2750
  • [39] Reconstruction of patient-specific 3D bone surface from 2D calibrated fluoroscopic images and point distribution model
    Zheng, Guoyan
    Ballester, Miguel A. G.
    Styner, Martin
    Nolte, Lutz-Peter
    MEDICAL IMAGE COMPUTING AND COMPUTER-ASSISTED INTERVENTION - MICCAI 2006, PT 1, 2006, 4190 : 25 - 32
  • [40] Patient -Specific 3D CT Images Reconstruction from 2D KV Images Via Vision Transformer-Based Deep-Learning
    Ding, Y.
    Holmes, J.
    Li, B.
    Vargas, C. E.
    Vora, S. A.
    Wong, W. W.
    Fatyga, M.
    Foote, R. L.
    Patel, S. H.
    Liu, W.
    INTERNATIONAL JOURNAL OF RADIATION ONCOLOGY BIOLOGY PHYSICS, 2023, 117 (02): : E660 - E660