Indoor and Outdoor 3D Scene Graph Generation Via Language-Enabled Spatial Ontologies

被引:1
|
作者
Strader, Jared [1 ]
Hughes, Nathan [1 ]
Chen, William [2 ]
Speranzon, Alberto [3 ]
Carlone, Luca [1 ]
机构
[1] MIT, Lab Informat & Decis Syst LIDS, Cambridge, MA 02139 USA
[2] Univ Calif Berkeley, Berkeley Artificial Intelligence Res BAIR, Berkeley, CA 94720 USA
[3] Lockheed Martin, Adv Technol Labs, Eagan, MN 55121 USA
来源
关键词
AI-based methods; 3D scene graphs; semantic scene understanding; spatial ontologies;
D O I
10.1109/LRA.2024.3384084
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
This letter proposes an approach to build 3D scene graphs in arbitrary indoor and outdoor environments. Such extension is challenging; the hierarchy of concepts that describe an outdoor environment is more complex than for indoors, and manually defining such hierarchy is time-consuming and does not scale. Furthermore, the lack of training data prevents the straightforward application of learning-based tools used in indoor settings. To address these challenges, we propose two novel extensions. First, we develop methods to build a spatial ontology defining concepts and relations relevant for indoor and outdoor robot operation. In particular, we use a Large Language Model (LLM) to build such an ontology, thus largely reducing the amount of manual effort required. Second, we leverage the spatial ontology for 3D scene graph construction using Logic Tensor Networks (LTN) to add logical rules, or axioms (e.g., "a beach contains sand"), which provide additional supervisory signals at training time thus reducing the need for labelled data, providing better predictions, and even allowing predicting concepts unseen at training time. We test our approach in a variety of datasets, including indoor, rural, and coastal environments, and show that it leads to a significant increase in the quality of the 3D scene graph generation with sparsely annotated data.
引用
收藏
页码:4886 / 4893
页数:8
相关论文
共 50 条
  • [1] Scene Graph Masked Variational Autoencoders for 3D Scene Generation
    Xu, Rui
    Hui, Le
    Han, Yuehui
    Qian, Jianjun
    Xie, Jin
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 5725 - 5733
  • [2] Spatial Distribution Feature for 3D Indoor Scene Labelling
    Lang, Yankun
    Wu, Haiyuan
    Chen, Qian
    PROCEEDINGS 3RD IAPR ASIAN CONFERENCE ON PATTERN RECOGNITION ACPR 2015, 2015, : 66 - 70
  • [3] 3D spatial pyramid: descriptors generation from point clouds for indoor scene classification
    Romero-Gonzalez, Cristina
    Martinez-Gomez, Jesus
    Garcia-Varea, Ismael
    Rodriguez-Ruiz, Luis
    MACHINE VISION AND APPLICATIONS, 2016, 27 (02) : 263 - 273
  • [4] 3D spatial pyramid: descriptors generation from point clouds for indoor scene classification
    Cristina Romero-González
    Jesus Martínez-Gómez
    Ismael García-Varea
    Luis Rodríguez-Ruiz
    Machine Vision and Applications, 2016, 27 : 263 - 273
  • [5] The development of a language interface for 3d scene generation
    Zeng, Xin
    Tan, Manling
    PROCEEDINGS OF THE SECOND IASTED INTERNATIONAL CONFERENCE ON HUMAN-COMPUTER INTERACTION, 2007, : 136 - 141
  • [6] 3D scene graph representation and application for intelligent indoor spaces
    Tang, Shengjun
    Du, Siqi
    Wang, Weixi
    Guo, Renzhong
    Cehui Xuebao/Acta Geodaetica et Cartographica Sinica, 2024, 53 (07): : 1355 - 1370
  • [7] SceneHGN: Hierarchical Graph Networks for 3D Indoor Scene Generation With Fine-Grained Geometry
    Gao, Lin
    Sun, Jia-Mu
    Mo, Kaichun
    Lai, Yu-Kun
    Guibas, Leonidas J.
    Yang, Jie
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (07) : 8902 - 8919
  • [8] Learning Graph Variational Autoencoders with Constraints and Structured Priors for Conditional Indoor 3D Scene Generation
    Chattopadhyay, Aditya
    Zhang, Xi
    Wipf, David Paul
    Arora, Himanshu
    Vidal, Rene
    2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 785 - 794
  • [9] 3D indoor scene assessment via layout plausibility
    Yang, Xinyan
    Hu, Fei
    Liu, Shaofei
    Ye, Long
    Wang, Ye
    Zhu, Guanghua
    Li, Jiyin
    DISPLAYS, 2025, 87
  • [10] 3D Scene Graph Generation From Point Clouds
    Wei, Wenwen
    Wei, Ping
    Qin, Jialu
    Liao, Zhimin
    Wang, Shuaijie
    Cheng, Xiang
    Liu, Meiqin
    Zheng, Nanning
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 5358 - 5368