On the Road to Large-Scale 3D Monocular Scene Reconstruction using Deep Implicit Functions

被引:1
|
作者
Roddick, Thomas [1 ]
Biggs, Benjamin [1 ]
Reino, Daniel Olmeda [2 ]
Cipolla, Roberto [1 ]
机构
[1] Univ Cambridge, Cambridge, England
[2] Toyota Motor Europe, Brussels, Belgium
关键词
D O I
10.1109/ICCVW54120.2021.00322
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Autonomous driving relies on building detailed models of a vehicles surroundings, including all hazards, obstacles and other road users. At present, much of the autonomous driving literature reduces the world to a collection of parametric 3D boxes. While this framework is sufficient for many driving scenarios, other important scene details (e.g. overhanging structures, open car doors, debris, potholes etc.) are not modelled. Recently deep implicit functions have been shown to be suitable for representing fine grained details at arbitrarily high resolutions using images alone. However, they have predominantly been employed in constrained situations, such as reconstructing individual objects or small-scale scenes. In this work we explore the application of deep implicit functions to larger scenes in the context of real-world autonomous driving scenarios. In particular we focus on the challenging case where only monocular images are available at test time. While most implicit function networks rely on watertight meshes for training, these are not in general available for real world scenes. We therefore propose an alternative training scheme using LiDAR to provide approximate ground truth occupancy supervision. We also show that incorporating priors such as pre-detected object bounding boxes can improve the quality of reconstruction. Our method is evaluated on a real-world autonomous driving dataset.
引用
收藏
页码:2875 / 2884
页数:10
相关论文
共 50 条
  • [31] TO-Scene: A Large-Scale Dataset for Understanding 3D Tabletop Scenes
    Xu, Mutian
    Chen, Pei
    Liu, Haolin
    Han, Xiaoguang
    COMPUTER VISION - ECCV 2022, PT XXVII, 2022, 13687 : 340 - 356
  • [32] Human-centric Scene Understanding for 3D Large-scale Scenarios
    Xu, Yiteng
    Cong, Peishan
    Yao, Yichen
    Chen, Runnan
    Hou, Yuenan
    Zhu, Xinge
    He, Xuming
    Yu, Jingyi
    Ma, Yuexin
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 20292 - 20302
  • [33] Automatic global path generation for large-scale 3D scene exploration
    Che, Li
    Kang, F. J.
    INTERNATIONAL JOURNAL OF MODELING SIMULATION AND SCIENTIFIC COMPUTING, 2020, 11 (06)
  • [34] Inner-Outer Aware Reconstruction Model for Monocular 3D Scene Reconstruction
    Qiu, Yu-Kun
    Xu, Guo-Hao
    Zheng, Wei-Shi
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [35] 3D Modeling of Large-Scale Geological Structures by Linear Combinations of Implicit Functions: Application to a Large Banded Iron Formation
    Liang Yang
    Peter Achtziger-Zupančič
    Jef Caers
    Natural Resources Research, 2021, 30 : 3139 - 3163
  • [36] 3D Modeling of Large-Scale Geological Structures by Linear Combinations of Implicit Functions: Application to a Large Banded Iron Formation
    Yang, Liang
    Achtziger-Zupancic, Peter
    Caers, Jef
    NATURAL RESOURCES RESEARCH, 2021, 30 (05) : 3139 - 3163
  • [37] 3D Mosaic Method in Monocular Vision Measurement System for Large-scale Equipment
    Xu, Qiaoyu
    Wang, Junwei
    Che, Rensheng
    6TH INTERNATIONAL SYMPOSIUM ON PRECISION ENGINEERING MEASUREMENTS AND INSTRUMENTATION, 2010, 7544
  • [38] 3D mosaic method in monocular vision measurement system for large-scale equipment
    School of Electromechanical Engineering, Henan University of Science and Technology, Luoyang, 471003, China
    不详
    不详
    Proc SPIE Int Soc Opt Eng,
  • [39] Scanning-based 3D reconstruction of large-scale objects
    Bai, Suqin
    Shi, Jinlong
    Ge, Qijie
    Tian, Zhaohui
    PROCEEDINGS OF THE 2019 INTERNATIONAL CONFERENCE ON ROBOTICS, INTELLIGENT CONTROL AND ARTIFICIAL INTELLIGENCE (RICAI 2019), 2019, : 584 - 589
  • [40] MAP Visibility Estimation for Large-Scale Dynamic 3D Reconstruction
    Joo, Hanbyul
    Park, Hyun Soo
    Sheikh, Yaser
    2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, : 1122 - 1129