Intelligent Spatial Perception by Building Hierarchical 3D Scene Graphs for Indoor Scenarios with the Help of LLMs

被引:0
|
作者
Cheng, Yao [1 ,2 ]
Han, Zhe [2 ]
Jiang, Fengyang [1 ,2 ]
Wang, Huaizhen [1 ,2 ]
Zhou, Fengyu [3 ]
Yin, Qingshan [2 ]
Wei, Lei [2 ]
机构
[1] Shandong New Generat Informat Ind Technol Res Ins, Jinan 250100, Shandong, Peoples R China
[2] Inspur Intelligent Terminal Co Ltd, Jinan 250100, Shandong, Peoples R China
[3] Shandong Univ, Sch Control Sci & Engn, Jinan 250061, Shandong, Peoples R China
关键词
D O I
10.1109/WRCSARA64167.2024.10685765
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper addresses the high demand in advanced intelligent robot navigation for a more holistic understanding of spatial environments, by introducing a novel system that harnesses the capabilities of Large Language Models (LLMs) to construct hierarchical 3D Scene Graphs (3DSGs) for indoor scenarios. The proposed framework constructs 3DSGs consisting of a fundamental layer with rich metric-semantic information, an object layer featuring precise point-cloud representation of object nodes as well as visual descriptors, and higher layers of room, floor, and building nodes. Thanks to the innovative application of LLMs, not only object nodes but also nodes of higher layers, e.g., room nodes, are annotated in an intelligent and accurate manner. A polling mechanism for room classification using LLMs is proposed to enhance the accuracy and reliability of the room node annotation. Thorough numerical experiments demonstrate the system's ability to integrate semantic descriptions with geometric data, creating an accurate and comprehensive representation of the environment instrumental for context-aware navigation and task planning.
引用
收藏
页码:483 / 490
页数:8
相关论文
共 50 条
  • [1] Kimera: From SLAM to spatial perception with 3D dynamic scene graphs
    Rosinol, Antoni
    Violette, Andrew
    Abate, Marcus
    Hughes, Nathan
    Chang, Yun
    Shi, Jingnan
    Gupta, Arjun
    Carlone, Luca
    INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2021, 40 (12-14): : 1510 - 1546
  • [2] 3D Dynamic Scene Graphs: Actionable Spatial Perception with Places, Objects, and Humans
    Rosinol, Antoni
    Gupta, Arjun
    Abate, Marcus
    Shi, Jingnan
    Carlone, Luca
    ROBOTICS: SCIENCE AND SYSTEMS XVI, 2020,
  • [3] Modelling dynamics of indoor environments with 3D scene graphs
    Shin D.
    Kim I.
    J. Inst. Control Rob. Syst., 2019, 8 (690-704): : 690 - 704
  • [4] Learning 3D Semantic Scene Graphs from 3D Indoor Reconstructions
    Wald, Johanna
    Dhamo, Helisa
    Navab, Nassir
    Tombari, Federico
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 3960 - 3969
  • [5] Spatial Distribution Feature for 3D Indoor Scene Labelling
    Lang, Yankun
    Wu, Haiyuan
    Chen, Qian
    PROCEEDINGS 3RD IAPR ASIAN CONFERENCE ON PATTERN RECOGNITION ACPR 2015, 2015, : 66 - 70
  • [6] 3D scene graph representation and application for intelligent indoor spaces
    Tang, Shengjun
    Du, Siqi
    Wang, Weixi
    Guo, Renzhong
    Cehui Xuebao/Acta Geodaetica et Cartographica Sinica, 2024, 53 (07): : 1355 - 1370
  • [7] Hierarchical Co-Segmentation of 3D Point Clouds for Indoor Scene
    Lin, Yanting
    2017 INTERNATIONAL CONFERENCE ON SYSTEMS, SIGNALS AND IMAGE PROCESSING (IWSSIP), 2017,
  • [8] Indoor Scene Recognition in 3D
    Huang, Shengyu
    Usvyatsov, Mikhail
    Schindler, Konrad
    2020 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2020, : 8041 - 8048
  • [9] SGAligner: 3D Scene Alignment with Scene Graphs
    Sarkar, Sayan Deb
    Miksik, Ondrej
    Pollefeys, Marc
    Barath, Daniel
    Armeni, Iro
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 21870 - 21880
  • [10] Real-time 3D semantic map building in indoor scene
    Shan J.
    Li X.
    Zhang X.
    Jia S.
    Yi Qi Yi Biao Xue Bao/Chinese Journal of Scientific Instrument, 2019, 40 (05): : 240 - 248