On the Road to Large-Scale 3D Monocular Scene Reconstruction using Deep Implicit Functions

被引：1

作者：

Roddick, Thomas ^{[1
]}

Biggs, Benjamin ^{[1
]}

Reino, Daniel Olmeda ^{[2
]}

Cipolla, Roberto ^{[1
]}

机构：

[1] Univ Cambridge, Cambridge, England

[2] Toyota Motor Europe, Brussels, Belgium

来源：

2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2021) | 2021年

关键词：

D O I：

10.1109/ICCVW54120.2021.00322

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Autonomous driving relies on building detailed models of a vehicles surroundings, including all hazards, obstacles and other road users. At present, much of the autonomous driving literature reduces the world to a collection of parametric 3D boxes. While this framework is sufficient for many driving scenarios, other important scene details (e.g. overhanging structures, open car doors, debris, potholes etc.) are not modelled. Recently deep implicit functions have been shown to be suitable for representing fine grained details at arbitrarily high resolutions using images alone. However, they have predominantly been employed in constrained situations, such as reconstructing individual objects or small-scale scenes. In this work we explore the application of deep implicit functions to larger scenes in the context of real-world autonomous driving scenarios. In particular we focus on the challenging case where only monocular images are available at test time. While most implicit function networks rely on watertight meshes for training, these are not in general available for real world scenes. We therefore propose an alternative training scheme using LiDAR to provide approximate ground truth occupancy supervision. We also show that incorporating priors such as pre-detected object bounding boxes can improve the quality of reconstruction. Our method is evaluated on a real-world autonomous driving dataset.

引用

页码：2875 / 2884

页数：10

共 50 条

[31] TO-Scene: A Large-Scale Dataset for Understanding 3D Tabletop Scenes
Xu, Mutian
Chen, Pei
Liu, Haolin
Han, Xiaoguang
COMPUTER VISION - ECCV 2022, PT XXVII, 2022, 13687 : 340 - 356
[32] Human-centric Scene Understanding for 3D Large-scale Scenarios
Xu, Yiteng
Cong, Peishan
Yao, Yichen
Chen, Runnan
Hou, Yuenan
Zhu, Xinge
He, Xuming
Yu, Jingyi
Ma, Yuexin
2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 20292 - 20302
[33] Automatic global path generation for large-scale 3D scene exploration
Che, Li
Kang, F. J.
INTERNATIONAL JOURNAL OF MODELING SIMULATION AND SCIENTIFIC COMPUTING, 2020, 11 (06)
[34] Inner-Outer Aware Reconstruction Model for Monocular 3D Scene Reconstruction
Qiu, Yu-Kun
Xu, Guo-Hao
Zheng, Wei-Shi
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
[35] 3D Modeling of Large-Scale Geological Structures by Linear Combinations of Implicit Functions: Application to a Large Banded Iron Formation
Liang Yang
Peter Achtziger-Zupančič
Jef Caers
Natural Resources Research, 2021, 30 : 3139 - 3163
[36] 3D Modeling of Large-Scale Geological Structures by Linear Combinations of Implicit Functions: Application to a Large Banded Iron Formation
Yang, Liang
Achtziger-Zupancic, Peter
Caers, Jef
NATURAL RESOURCES RESEARCH, 2021, 30 (05) : 3139 - 3163
[37] 3D Mosaic Method in Monocular Vision Measurement System for Large-scale Equipment
Xu, Qiaoyu
Wang, Junwei
Che, Rensheng
6TH INTERNATIONAL SYMPOSIUM ON PRECISION ENGINEERING MEASUREMENTS AND INSTRUMENTATION, 2010, 7544
[38] 3D mosaic method in monocular vision measurement system for large-scale equipment
School of Electromechanical Engineering, Henan University of Science and Technology, Luoyang, 471003, China
不详
不详
Proc SPIE Int Soc Opt Eng,
[39] Scanning-based 3D reconstruction of large-scale objects
Bai, Suqin
Shi, Jinlong
Ge, Qijie
Tian, Zhaohui
PROCEEDINGS OF THE 2019 INTERNATIONAL CONFERENCE ON ROBOTICS, INTELLIGENT CONTROL AND ARTIFICIAL INTELLIGENCE (RICAI 2019), 2019, : 584 - 589
[40] MAP Visibility Estimation for Large-Scale Dynamic 3D Reconstruction
Joo, Hanbyul
Park, Hyun Soo
Sheikh, Yaser
2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, : 1122 - 1129

← 1 2 3 4 5 →