PlaneFormers: From Sparse View Planes to 3D Reconstruction

被引:9
|
作者
Agarwala, Samir [1 ]
Jin, Linyi [1 ]
Rockwell, Chris [1 ]
Fouhey, David F. [1 ]
机构
[1] Univ Michigan, Ann Arbor, MI USA
来源
关键词
D O I
10.1007/978-3-031-20062-5_12
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present an approach for the planar surface reconstruction of a scene from images with limited overlap. This reconstruction task is challenging since it requires jointly reasoning about single image 3D reconstruction, correspondence between images, and the relative camera pose between images. Past work has proposed optimization-based approaches. We introduce a simpler approach, the PlaneFormer, that uses a transformer applied to 3D-aware plane tokens to perform 3D reasoning. Our experiments show that our approach is substantially more effective than prior work, and that several 3D-specific design decisions are crucial for its success. Code is available at https://github.com/samiragarwala/PlaneFormers.
引用
收藏
页码:192 / 209
页数:18
相关论文
共 50 条
  • [1] A review on 3D Gaussian splatting for sparse view reconstruction
    Haitian Liu
    Binglin Liu
    Qianchao Hu
    Peilun Du
    Jing Li
    Yang Bao
    Feng Wang
    Artificial Intelligence Review, 58 (7)
  • [2] 3D Clothed Human Reconstruction from Sparse Multi-View Images
    Hong, Jin Gyu
    Noh, Seung Young
    Lee, Hee Kyung
    Cheong, Won Sik
    Chang, Ju Yong
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW, 2024, : 677 - 687
  • [3] Single and sparse view 3D reconstruction by learning shape priors
    Chen, Yu
    Cipolla, Roberto
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2011, 115 (05) : 586 - 602
  • [4] 3D Reconstruction and Analysis of Bat Flight Maneuvers from Sparse Multiple View Video
    Bergou, A. J.
    Swartz, S.
    Breuer, K.
    Taubin, G.
    INTEGRATIVE AND COMPARATIVE BIOLOGY, 2012, 52 : E211 - E211
  • [5] Volume reconstruction from sparse 3D ultrasonography
    Gooding, MJ
    Kennedy, S
    Noble, JA
    MEDICAL IMAGE COMPUTING AND COMPUTER-ASSISTED INTERVENTION - MICCAI 2003, PT 2, 2003, 2879 : 416 - 423
  • [6] A 3D Freehand Ultrasound System for Multi-view Reconstructions from Sparse 2D Scanning Planes
    Honggang Yu
    Marios S Pattichis
    Carla Agurto
    M Beth Goens
    BioMedical Engineering OnLine, 10
  • [7] A 3D Freehand Ultrasound System for Multi-view Reconstructions from Sparse 2D Scanning Planes
    Yu, Honggang
    Pattichis, Marios S.
    Agurto, Carla
    Goens, M. Beth
    BIOMEDICAL ENGINEERING ONLINE, 2011, 10
  • [8] 3D road reconstruction from a single view
    Guiducci, A
    COMPUTER VISION AND IMAGE UNDERSTANDING, 1998, 70 (02) : 212 - 226
  • [9] 3D road reconstruction from a single view
    Istituto Elettrotecnico Nazionale, `Galileo Ferraris', Torino, Italy
    Comput Vision Image Undersanding, 2 (212-226):
  • [10] 3D hand reconstruction from a monocular view
    Lee, SU
    Cohen, I
    PROCEEDINGS OF THE 17TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 3, 2004, : 310 - 313