PlaneFormers: From Sparse View Planes to 3D Reconstruction

被引:9
|
作者
Agarwala, Samir [1 ]
Jin, Linyi [1 ]
Rockwell, Chris [1 ]
Fouhey, David F. [1 ]
机构
[1] Univ Michigan, Ann Arbor, MI USA
来源
关键词
D O I
10.1007/978-3-031-20062-5_12
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present an approach for the planar surface reconstruction of a scene from images with limited overlap. This reconstruction task is challenging since it requires jointly reasoning about single image 3D reconstruction, correspondence between images, and the relative camera pose between images. Past work has proposed optimization-based approaches. We introduce a simpler approach, the PlaneFormer, that uses a transformer applied to 3D-aware plane tokens to perform 3D reasoning. Our experiments show that our approach is substantially more effective than prior work, and that several 3D-specific design decisions are crucial for its success. Code is available at https://github.com/samiragarwala/PlaneFormers.
引用
收藏
页码:192 / 209
页数:18
相关论文
共 50 条
  • [21] Indoor scene reconstruction from a sparse set of 3D shots
    Bobenrieth, Cedric
    Seo, Hyewon
    Habibi, Arash
    Cordier, Frederic
    CGI'17: PROCEEDINGS OF THE COMPUTER GRAPHICS INTERNATIONAL CONFERENCE, 2017,
  • [22] 3D reconstruction of genomic regions from sparse interaction data
    Mendieta-Esteban, Julen
    Di Stefano, Marco
    Castillo, David
    Farabella, Irene
    Marti-Renom, Marc A.
    NAR GENOMICS AND BIOINFORMATICS, 2021, 3 (01)
  • [23] Sparse to Dense 3D Reconstruction from Rolling Shutter Images
    Saurer, Olivier
    Pollefeys, Marc
    Lee, Gim Hee
    2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 3337 - 3345
  • [24] Neighborhood transformer for sparse-view X-ray 3D foot reconstruction
    Wang, Wei
    An, Li
    Zhou, Mingquan
    Han, Gengyin
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2025, 100
  • [25] Structure-Aware Sparse-View X-ray 3D Reconstruction
    Cai, Yuanhao
    Wang, Jiahao
    Yuille, Alan
    Zhou, Zongwei
    Wang, Angtian
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 11174 - 11183
  • [26] Helical CT Reconstruction From Sparse-View Data Through Exploiting the 3D Anatomical Structure Sparsity
    Wang, Yongbo
    Chen, Gaofeng
    Xi, Tao
    Bian, Zhaoying
    Zeng, Dong
    Zaidi, Habib
    He, Ji
    Ma, Jianhua
    IEEE ACCESS, 2021, 9 : 15200 - 15211
  • [27] View Planning for 3D Object Reconstruction
    Irving Vasquez-Gomez, Juan
    Lopez-Damian, Efrain
    Enrique Sucar, Luis
    2009 IEEE-RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, 2009, : 4015 - 4020
  • [28] Soft 3D Reconstruction for View Synthesis
    Penner, Eric
    Zhang, Li
    ACM TRANSACTIONS ON GRAPHICS, 2017, 36 (06):
  • [29] GENERALIZATION OF THE DESARGUES THEOREM FOR SPARSE 3D RECONSTRUCTION
    Fremont, Vincent
    Chellali, Ryad
    Fontaine, Jean-Guy
    INTERNATIONAL JOURNAL OF HUMANOID ROBOTICS, 2009, 6 (01) : 49 - 69
  • [30] Efficient sparse 3D reconstruction by space sweeping
    Bauer, Joachim
    Zach, Christopher
    Karner, Konrad
    Bischof, Horst
    THIRD INTERNATIONAL SYMPOSIUM ON 3D DATA PROCESSING, VISUALIZATION, AND TRANSMISSION, PROCEEDINGS, 2007, : 527 - 534