Indoor and Outdoor 3D Scene Graph Generation Via Language-Enabled Spatial Ontologies

被引:1
|
作者
Strader, Jared [1 ]
Hughes, Nathan [1 ]
Chen, William [2 ]
Speranzon, Alberto [3 ]
Carlone, Luca [1 ]
机构
[1] MIT, Lab Informat & Decis Syst LIDS, Cambridge, MA 02139 USA
[2] Univ Calif Berkeley, Berkeley Artificial Intelligence Res BAIR, Berkeley, CA 94720 USA
[3] Lockheed Martin, Adv Technol Labs, Eagan, MN 55121 USA
来源
关键词
AI-based methods; 3D scene graphs; semantic scene understanding; spatial ontologies;
D O I
10.1109/LRA.2024.3384084
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
This letter proposes an approach to build 3D scene graphs in arbitrary indoor and outdoor environments. Such extension is challenging; the hierarchy of concepts that describe an outdoor environment is more complex than for indoors, and manually defining such hierarchy is time-consuming and does not scale. Furthermore, the lack of training data prevents the straightforward application of learning-based tools used in indoor settings. To address these challenges, we propose two novel extensions. First, we develop methods to build a spatial ontology defining concepts and relations relevant for indoor and outdoor robot operation. In particular, we use a Large Language Model (LLM) to build such an ontology, thus largely reducing the amount of manual effort required. Second, we leverage the spatial ontology for 3D scene graph construction using Logic Tensor Networks (LTN) to add logical rules, or axioms (e.g., "a beach contains sand"), which provide additional supervisory signals at training time thus reducing the need for labelled data, providing better predictions, and even allowing predicting concepts unseen at training time. We test our approach in a variety of datasets, including indoor, rural, and coastal environments, and show that it leads to a significant increase in the quality of the 3D scene graph generation with sparsely annotated data.
引用
收藏
页码:4886 / 4893
页数:8
相关论文
共 50 条
  • [21] 3D Scene Reconstruction and Object Recognition for Indoor Scene
    Shen, Yangping
    Manabe, Yoshitsugu
    Yata, Noriko
    INTERNATIONAL WORKSHOP ON ADVANCED IMAGE TECHNOLOGY (IWAIT) 2019, 2019, 11049
  • [22] A Survey of 3D Indoor Scene Synthesis
    Song-Hai Zhang
    Shao-Kui Zhang
    Yuan Liang
    Peter Hall
    Journal of Computer Science and Technology, 2019, 34 : 594 - 608
  • [23] A Survey of 3D Indoor Scene Synthesis
    Zhang, Song-Hai
    Zhang, Shao-Kui
    Liang, Yuan
    Hall, Peter
    JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2019, 34 (03) : 594 - 608
  • [24] A detailed analysis of a new 3D spatial feature vector for indoor scene classification
    Swadzba, Agnes
    Wachsmuth, Sven
    ROBOTICS AND AUTONOMOUS SYSTEMS, 2014, 62 (05) : 646 - 662
  • [25] Fast 3D Indoor Scene Synthesis by Learning Spatial Relation Priors of Objects
    Zhang, Song-Hai
    Zhang, Shao-Kui
    Xie, Wei-Yu
    Luo, Cheng-Yang
    Yang, Yong-Liang
    Fu, Hongbo
    IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2022, 28 (09) : 3082 - 3092
  • [26] Automatic generation of 3D outdoor and indoor building scenes from a single image
    Vouzounaras, Georgios
    Daras, Petros
    Strintzis, Michael G.
    MULTIMEDIA TOOLS AND APPLICATIONS, 2014, 70 (01) : 361 - 378
  • [27] Automatic generation of 3D outdoor and indoor building scenes from a single image
    Georgios Vouzounaras
    Petros Daras
    Michael G. Strintzis
    Multimedia Tools and Applications, 2014, 70 : 361 - 378
  • [28] Performance Evaluation and Optimization of 3D Gaussian Splatting in Indoor Scene Generation and Rendering
    Fang, Xinjian
    Zhang, Yingdan
    Tan, Hao
    Liu, Chao
    Yang, Xu
    ISPRS INTERNATIONAL JOURNAL OF GEO-INFORMATION, 2025, 14 (01)
  • [29] Text to Scene: A System of Configurable 3D Indoor Scene Synthesis
    Yang, Xinyan
    Hu, Fei
    Ye, Long
    PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 2819 - 2821
  • [30] SGFormer: Semantic Graph Transformer for Point Cloud-Based 3D Scene Graph Generation
    Lv, Changsheng
    Qi, Mengshi
    Li, Xia
    Yang, Zhengyuan
    Ma, Huadong
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 5, 2024, : 4035 - 4043