Indoor and Outdoor 3D Scene Graph Generation Via Language-Enabled Spatial Ontologies

被引：1

作者：

Strader, Jared ^{[1
]}

Hughes, Nathan ^{[1
]}

Chen, William ^{[2
]}

Speranzon, Alberto ^{[3
]}

Carlone, Luca ^{[1
]}

机构：

[1] MIT, Lab Informat & Decis Syst LIDS, Cambridge, MA 02139 USA

[2] Univ Calif Berkeley, Berkeley Artificial Intelligence Res BAIR, Berkeley, CA 94720 USA

[3] Lockheed Martin, Adv Technol Labs, Eagan, MN 55121 USA

来源：

IEEE ROBOTICS AND AUTOMATION LETTERS | 2024年 / 9卷 / 06期

关键词：

AI-based methods; 3D scene graphs; semantic scene understanding; spatial ontologies;

D O I：

10.1109/LRA.2024.3384084

中图分类号：

TP24 [机器人技术];

学科分类号：

080202 ; 1405 ;

摘要：

This letter proposes an approach to build 3D scene graphs in arbitrary indoor and outdoor environments. Such extension is challenging; the hierarchy of concepts that describe an outdoor environment is more complex than for indoors, and manually defining such hierarchy is time-consuming and does not scale. Furthermore, the lack of training data prevents the straightforward application of learning-based tools used in indoor settings. To address these challenges, we propose two novel extensions. First, we develop methods to build a spatial ontology defining concepts and relations relevant for indoor and outdoor robot operation. In particular, we use a Large Language Model (LLM) to build such an ontology, thus largely reducing the amount of manual effort required. Second, we leverage the spatial ontology for 3D scene graph construction using Logic Tensor Networks (LTN) to add logical rules, or axioms (e.g., "a beach contains sand"), which provide additional supervisory signals at training time thus reducing the need for labelled data, providing better predictions, and even allowing predicting concepts unseen at training time. We test our approach in a variety of datasets, including indoor, rural, and coastal environments, and show that it leads to a significant increase in the quality of the 3D scene graph generation with sparsely annotated data.

引用

页码：4886 / 4893

页数：8

共 50 条

[1] Scene Graph Masked Variational Autoencoders for 3D Scene Generation
Xu, Rui
Hui, Le
Han, Yuehui
Qian, Jianjun
Xie, Jin
PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 5725 - 5733
[2] Spatial Distribution Feature for 3D Indoor Scene Labelling
Lang, Yankun
Wu, Haiyuan
Chen, Qian
PROCEEDINGS 3RD IAPR ASIAN CONFERENCE ON PATTERN RECOGNITION ACPR 2015, 2015, : 66 - 70
[3] 3D spatial pyramid: descriptors generation from point clouds for indoor scene classification
Romero-Gonzalez, Cristina
Martinez-Gomez, Jesus
Garcia-Varea, Ismael
Rodriguez-Ruiz, Luis
MACHINE VISION AND APPLICATIONS, 2016, 27 (02) : 263 - 273
[4] 3D spatial pyramid: descriptors generation from point clouds for indoor scene classification
Cristina Romero-González
Jesus Martínez-Gómez
Ismael García-Varea
Luis Rodríguez-Ruiz
Machine Vision and Applications, 2016, 27 : 263 - 273
[5] The development of a language interface for 3d scene generation
Zeng, Xin
Tan, Manling
PROCEEDINGS OF THE SECOND IASTED INTERNATIONAL CONFERENCE ON HUMAN-COMPUTER INTERACTION, 2007, : 136 - 141
[6] 3D scene graph representation and application for intelligent indoor spaces
Tang, Shengjun
Du, Siqi
Wang, Weixi
Guo, Renzhong
Cehui Xuebao/Acta Geodaetica et Cartographica Sinica, 2024, 53 (07): : 1355 - 1370
[7] SceneHGN: Hierarchical Graph Networks for 3D Indoor Scene Generation With Fine-Grained Geometry
Gao, Lin
Sun, Jia-Mu
Mo, Kaichun
Lai, Yu-Kun
Guibas, Leonidas J.
Yang, Jie
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (07) : 8902 - 8919
[8] Learning Graph Variational Autoencoders with Constraints and Structured Priors for Conditional Indoor 3D Scene Generation
Chattopadhyay, Aditya
Zhang, Xi
Wipf, David Paul
Arora, Himanshu
Vidal, Rene
2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 785 - 794
[9] 3D indoor scene assessment via layout plausibility
Yang, Xinyan
Hu, Fei
Liu, Shaofei
Ye, Long
Wang, Ye
Zhu, Guanghua
Li, Jiyin
DISPLAYS, 2025, 87
[10] 3D Scene Graph Generation From Point Clouds
Wei, Wenwen
Wei, Ping
Qin, Jialu
Liao, Zhimin
Wang, Shuaijie
Cheng, Xiang
Liu, Meiqin
Zheng, Nanning
IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 5358 - 5368

← 1 2 3 4 5 →