Intelligent Spatial Perception by Building Hierarchical 3D Scene Graphs for Indoor Scenarios with the Help of LLMs

被引:0
|
作者
Cheng, Yao [1 ,2 ]
Han, Zhe [2 ]
Jiang, Fengyang [1 ,2 ]
Wang, Huaizhen [1 ,2 ]
Zhou, Fengyu [3 ]
Yin, Qingshan [2 ]
Wei, Lei [2 ]
机构
[1] Shandong New Generat Informat Ind Technol Res Ins, Jinan 250100, Shandong, Peoples R China
[2] Inspur Intelligent Terminal Co Ltd, Jinan 250100, Shandong, Peoples R China
[3] Shandong Univ, Sch Control Sci & Engn, Jinan 250061, Shandong, Peoples R China
关键词
D O I
10.1109/WRCSARA64167.2024.10685765
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper addresses the high demand in advanced intelligent robot navigation for a more holistic understanding of spatial environments, by introducing a novel system that harnesses the capabilities of Large Language Models (LLMs) to construct hierarchical 3D Scene Graphs (3DSGs) for indoor scenarios. The proposed framework constructs 3DSGs consisting of a fundamental layer with rich metric-semantic information, an object layer featuring precise point-cloud representation of object nodes as well as visual descriptors, and higher layers of room, floor, and building nodes. Thanks to the innovative application of LLMs, not only object nodes but also nodes of higher layers, e.g., room nodes, are annotated in an intelligent and accurate manner. A polling mechanism for room classification using LLMs is proposed to enhance the accuracy and reliability of the room node annotation. Thorough numerical experiments demonstrate the system's ability to integrate semantic descriptions with geometric data, creating an accurate and comprehensive representation of the environment instrumental for context-aware navigation and task planning.
引用
收藏
页码:483 / 490
页数:8
相关论文
共 50 条
  • [21] Recognition of Indoor Scenes Using 3-D Scene Graphs
    Yue, Han
    Lehtola, Ville
    Wu, Hangbin
    Vosselman, George
    Li, Jincheng
    Liu, Chun
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62 : 1 - 16
  • [22] Unsupervised Traffic Scene Generation with Synthetic 3D Scene Graphs
    Savkin, Artem
    Ellouze, Rachid
    Navab, Nassir
    Tombari, Federico
    2021 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2021, : 1229 - 1235
  • [23] 3D Interference Mapping for Indoor IoT Scenarios
    Ivanov, Antoni
    Stoynov, Viktor
    Angelov, Kliment
    Stefanov, Radostin
    Atamyan, Dimitar
    Tonchev, Krasimir
    Poulkov, Vladimir
    2020 43RD INTERNATIONAL CONFERENCE ON TELECOMMUNICATIONS AND SIGNAL PROCESSING (TSP), 2020, : 265 - 269
  • [24] 3D spatial pyramid: descriptors generation from point clouds for indoor scene classification
    Romero-Gonzalez, Cristina
    Martinez-Gomez, Jesus
    Garcia-Varea, Ismael
    Rodriguez-Ruiz, Luis
    MACHINE VISION AND APPLICATIONS, 2016, 27 (02) : 263 - 273
  • [25] 3D spatial pyramid: descriptors generation from point clouds for indoor scene classification
    Cristina Romero-González
    Jesus Martínez-Gómez
    Ismael García-Varea
    Luis Rodríguez-Ruiz
    Machine Vision and Applications, 2016, 27 : 263 - 273
  • [26] Hierarchical Higher-order Regression Forest Fields: An Application to 3D Indoor Scene Labelling
    Pham, Trung T.
    Reid, Ian
    Latif, Yasir
    Gould, Stephen
    2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 2246 - 2254
  • [27] SceneHGN: Hierarchical Graph Networks for 3D Indoor Scene Generation With Fine-Grained Geometry
    Gao, Lin
    Sun, Jia-Mu
    Mo, Kaichun
    Lai, Yu-Kun
    Guibas, Leonidas J.
    Yang, Jie
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (07) : 8902 - 8919
  • [28] Perception of 3D spatial relations for 3D displays
    Rosen, P
    Pizlo, Z
    Hoffmann, C
    Popescu, V
    STEREOSCOPIC DISPLAYS AND VIRTUAL REALITY SYSTEMS XI, 2004, 5291 : 9 - 16
  • [29] Semantic and Geometric Modeling with Neural Message Passing in 3D Scene Graphs for Hierarchical Mechanical Search
    Kurenkov, Andrey
    Martin-Martin, Roberto
    Ichnowski, Jeff
    Goldberg, Ken
    Savarese, Silvio
    2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2021), 2021, : 11227 - 11233
  • [30] Spatial inference in scene perception for 3-D scenes
    Intraub, H.
    PERCEPTION, 2000, 29 : 113 - 113