PlaneFormers: From Sparse View Planes to 3D Reconstruction

被引:9
|
作者
Agarwala, Samir [1 ]
Jin, Linyi [1 ]
Rockwell, Chris [1 ]
Fouhey, David F. [1 ]
机构
[1] Univ Michigan, Ann Arbor, MI USA
来源
关键词
D O I
10.1007/978-3-031-20062-5_12
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present an approach for the planar surface reconstruction of a scene from images with limited overlap. This reconstruction task is challenging since it requires jointly reasoning about single image 3D reconstruction, correspondence between images, and the relative camera pose between images. Past work has proposed optimization-based approaches. We introduce a simpler approach, the PlaneFormer, that uses a transformer applied to 3D-aware plane tokens to perform 3D reasoning. Our experiments show that our approach is substantially more effective than prior work, and that several 3D-specific design decisions are crucial for its success. Code is available at https://github.com/samiragarwala/PlaneFormers.
引用
收藏
页码:192 / 209
页数:18
相关论文
共 50 条
  • [41] A statistical method for robust 3D surface reconstruction from sparse data
    Blanz, V
    Mehl, A
    Vetter, T
    Seidel, HP
    2ND INTERNATIONAL SYMPOSIUM ON 3D DATA PROCESSING, VISUALIZATION, AND TRANSMISSION, PROCEEDINGS, 2004, : 293 - 300
  • [42] Neural 3D reconstruction from sparse views using geometric priors
    Tai-Jiang Mu
    Hao-Xiang Chen
    Jun-Xiong Cai
    Ning Guo
    Computational Visual Media, 2023, 9 : 687 - 697
  • [43] Automated Reconstruction of 3D Open Surfaces from Sparse Point Clouds
    Arshad, Mohammad Samiul
    Beksi, William J.
    2022 IEEE INTERNATIONAL SYMPOSIUM ON MIXED AND AUGMENTED REALITY ADJUNCT (ISMAR-ADJUNCT 2022), 2022, : 216 - 221
  • [44] 3D curve structure reconstruction from a sparse set of unordered images
    Zheng Jian-dong
    Zhang Li-yan
    Du Xiao-yu
    Ding Zhi-an
    COMPUTERS IN INDUSTRY, 2009, 60 (02) : 126 - 134
  • [45] Sparse-view planar 3D reconstruction method based on hierarchical token pooling Transformer
    Zhang, Jiahui
    Yang, Jinfu
    Fu, Fuji
    Ma, Jiaqi
    APPLIED SOFT COMPUTING, 2025, 174
  • [46] An FPGA Accelerator for 3D Cone-beam Sparse-view Computed Tomography Reconstruction
    Gu, Yuhan
    Wu, Qing
    Yuan, Zhechen
    Zhang, Xiangyu
    Su, Wenyan
    Zhang, Yuyao
    Lou, Xin
    2024 IEEE 6TH INTERNATIONAL CONFERENCE ON AI CIRCUITS AND SYSTEMS, AICAS 2024, 2024, : 577 - 581
  • [47] High-precision 3D reconstruction of terahertz computed tomography under extremely sparse view
    Dou, Jiazhen
    Fang, Jiongshen
    Jiang, Wenjun
    Di, Jianglei
    Qin, Yuwen
    OPTICS AND LASERS IN ENGINEERING, 2025, 186
  • [48] Learning 3D Gaussians for Extremely Sparse-View Cone-Beam CT Reconstruction
    Lin, Yiqun
    Wang, Hualiang
    Chen, Jixiang
    Li, Xiaomeng
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2024, PT VII, 2024, 15007 : 425 - 435
  • [49] Learning View Priors for Single-view 3D Reconstruction
    Kato, Hiroharu
    Harada, Tatsuya
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 9770 - 9779
  • [50] A View-Planning Approach to 3D Reconstruction
    Turkar, Yash
    Aluckal, Christo
    Adhivarahan, Charuvahan
    Sebastiani, Alessandro
    Dantu, Karthik
    EXTENDED REALITY, PT III, XR SALENTO 2024, 2024, 15029 : 340 - 350