Intelligent Spatial Perception by Building Hierarchical 3D Scene Graphs for Indoor Scenarios with the Help of LLMs

被引:0
|
作者
Cheng, Yao [1 ,2 ]
Han, Zhe [2 ]
Jiang, Fengyang [1 ,2 ]
Wang, Huaizhen [1 ,2 ]
Zhou, Fengyu [3 ]
Yin, Qingshan [2 ]
Wei, Lei [2 ]
机构
[1] Shandong New Generat Informat Ind Technol Res Ins, Jinan 250100, Shandong, Peoples R China
[2] Inspur Intelligent Terminal Co Ltd, Jinan 250100, Shandong, Peoples R China
[3] Shandong Univ, Sch Control Sci & Engn, Jinan 250061, Shandong, Peoples R China
关键词
D O I
10.1109/WRCSARA64167.2024.10685765
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper addresses the high demand in advanced intelligent robot navigation for a more holistic understanding of spatial environments, by introducing a novel system that harnesses the capabilities of Large Language Models (LLMs) to construct hierarchical 3D Scene Graphs (3DSGs) for indoor scenarios. The proposed framework constructs 3DSGs consisting of a fundamental layer with rich metric-semantic information, an object layer featuring precise point-cloud representation of object nodes as well as visual descriptors, and higher layers of room, floor, and building nodes. Thanks to the innovative application of LLMs, not only object nodes but also nodes of higher layers, e.g., room nodes, are annotated in an intelligent and accurate manner. A polling mechanism for room classification using LLMs is proposed to enhance the accuracy and reliability of the room node annotation. Thorough numerical experiments demonstrate the system's ability to integrate semantic descriptions with geometric data, creating an accurate and comprehensive representation of the environment instrumental for context-aware navigation and task planning.
引用
收藏
页码:483 / 490
页数:8
相关论文
共 50 条
  • [31] Efficient Structured Prediction for 3D Indoor Scene Understanding
    Schwing, Alexander G.
    Hazan, Tamir
    Pollefeys, Marc
    Urtasun, Raquel
    2012 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2012, : 2815 - 2822
  • [32] Action-Driven 3D Indoor Scene Evolution
    Ma, Rui
    Li, Honghua
    Zou, Changqing
    Liao, Zicheng
    Tong, Xin
    Zhang, Hao
    ACM TRANSACTIONS ON GRAPHICS, 2016, 35 (06):
  • [33] 3D indoor scene assessment via layout plausibility
    Yang, Xinyan
    Hu, Fei
    Liu, Shaofei
    Ye, Long
    Wang, Ye
    Zhu, Guanghua
    Li, Jiyin
    DISPLAYS, 2025, 87
  • [34] Neural 3D Scene Reconstruction With Indoor Planar Priors
    Zhou, Xiaowei
    Guo, Haoyu
    Peng, Sida
    Xiao, Yuxi
    Lin, Haotong
    Wang, Qianqian
    Zhang, Guofeng
    Bao, Hujun
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (09) : 6355 - 6366
  • [35] StyleMesh: Style Transfer for Indoor 3D Scene Reconstructions
    Hoellein, Lukas
    Johnson, Justin
    Niessner, Matthias
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 6188 - 6198
  • [36] Efficient Exact Inference for 3D Indoor Scene Understanding
    Schwing, Alexander G.
    Urtasun, Raquel
    COMPUTER VISION - ECCV 2012, PT VI, 2012, 7577 : 299 - 313
  • [37] Semantic scene analysis of scanned 3D indoor environments
    Nüchter, A
    Surmann, H
    Lingemann, K
    Hertzberg, J
    VISION, MODELING, AND VISUALIZATION 2003, 2003, : 215 - +
  • [38] Automatic 3D Indoor Scene Updating with RGBD Cameras
    Liu, Zhenbao
    Tang, Sicong
    Xu, Weiwei
    Bu, Shuhui
    Han, Junwei
    Zhou, Kun
    COMPUTER GRAPHICS FORUM, 2014, 33 (07) : 269 - 278
  • [39] Indoor and Outdoor 3D Scene Graph Generation Via Language-Enabled Spatial Ontologies
    Strader, Jared
    Hughes, Nathan
    Chen, William
    Speranzon, Alberto
    Carlone, Luca
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (06): : 4886 - 4893
  • [40] 3D Building Scene Reconstruction Based on 3D LiDAR Point Cloud
    Yang, Shih-Chi
    Fan, Yu-Cheng
    2017 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS - TAIWAN (ICCE-TW), 2017,