HI-SLAM: Hierarchical implicit neural representation for SLAM

被引:0
|
作者
Li, Jingbo [1 ]
Firkat, Eksan [4 ,5 ]
Zhu, Jingyu [3 ]
Zhu, Bin [2 ]
Zhu, Jihong [3 ]
Hamdulla, Askar [1 ]
机构
[1] Xinjiang Univ, Sch Informat Sci & Engn, 666 Shengli Rd, Urumqi, Xinjiang, Peoples R China
[2] Tsinghua Univ, Dept Automat, 33 Shuangqing Rd, Beijing, Peoples R China
[3] Tsinghua Univ, Dept Precis Instrument, 33 Shuangqing Rd, Beijing, Peoples R China
[4] Tsinghua Univ, Tsinghua Shenzhen Int Grad Sch, Shenzhen, Peoples R China
[5] Great Bay Univ, Dongguan, Guangdong, Peoples R China
关键词
Dense visual SLAM; Neural implicit representations; Localization; RGB-D camera; FEATURE FUSION; VERSATILE;
D O I
10.1016/j.eswa.2025.126487
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Implicit neural representation can improve the expressive ability and performance of the model by learning the representation of high-dimensional feature space and has a wide range of applications in many fields and an exciting performance. Dense visual SLAM is one of the beneficiaries of the development of implicit neural representations. Still, the current methods are based on simple fully connected network architectures, resulting in poor generalization ability, insufficient real-time performance and inability to balance global and local optimization. This paper propose a hierarchical scene representation that treats color information and geometric information as equally important, one that encodes geometric and color information into different resolution grid sizes and combines multiple corresponding multi-layer perceptron decoders. The coarse-level grid captures the general shape and structure of the global scene and makes reasonable predictions for unobserved regions.In contrast, the medium-fine-level grid finely represents geometric details and color information. Rich and comprehensive high-fidelity reconstructions can be obtained in large-scale scenes by using meshes of different resolutions to encode geometric and color information. In this study, selectable keyframes are used to ensure that the local information of the scene is optimized while reducing redundant information preservation. Compared with recent dense visual SLAM systems via implicit neural representations, our method generalizes and operates more robustly, efficiently, and precisely in large-scale scenes.
引用
收藏
页数:10
相关论文
共 50 条
  • [21] Real-Time Dense Visual SLAM with Neural Factor Representation
    Wei, Weifeng
    Wang, Jie
    Xie, Xiaolong
    Liu, Jie
    Su, Pengxiang
    ELECTRONICS, 2024, 13 (16)
  • [22] SQ-SLAM: Monocular Semantic SLAM Based on Superquadric Object Representation
    Han, Xiao
    Yang, Lu
    JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2023, 109 (02)
  • [23] A Review of Cloud-Edge SLAM: Toward Asynchronous Collaboration and Implicit Representation Transmission
    Chen, Weinan
    Chen, Shilang
    Leng, Jiewu
    Wang, Jiankun
    Guan, Yisheng
    Meng, Max Q. -H.
    Zhang, Hong
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 25 (11) : 15437 - 15453
  • [24] SQ-SLAM: Monocular Semantic SLAM Based on Superquadric Object Representation
    Xiao Han
    Lu Yang
    Journal of Intelligent & Robotic Systems, 2023, 109
  • [25] CP-SLAM: Collaborative Neural Point-based SLAM
    Hu, Jiarui
    Mao, Mao
    Bao, Hujun
    Zhang, Guofeng
    Cui, Zhaopeng
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [26] HERO-SLAM: Hybrid Enhanced Robust Optimization of Neural SLAM
    Xin, Zhe
    Yue, Yufeng
    Zhang, Liangjun
    Wu, Chenming
    2024 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA 2024, 2024, : 8610 - 8616
  • [27] SGS-SLAM: Semantic Gaussian Splatting for Neural Dense SLAM
    Li, Mingrui
    Liu, Shuhong
    Zhou, Heng
    Zhu, Guohao
    Cheng, Na
    Deng, Tianchen
    Wang, Hongyu
    COMPUTER VISION - ECCV 2024, PT XXXI, 2025, 15089 : 163 - 179
  • [28] Exploiting implicit parallelism in functional programs with SLAM
    Sargeant, J
    Kirkham, C
    Watson, I
    IMPLEMENTATION OF FUNCTIONAL LANGUAGES, 2001, 2011 : 19 - 36
  • [29] On the Importance of Uncertainty Representation in Active SLAM
    Rodriguez-Arevalo, Maria L.
    Neira, Jose
    Castellanos, Jose A.
    IEEE TRANSACTIONS ON ROBOTICS, 2018, 34 (03) : 829 - 834
  • [30] DiT-SLAM: Real-Time Dense Visual-Inertial SLAM with Implicit Depth Representation and Tightly-Coupled Graph Optimization
    Zhao, Mingle
    Zhou, Dingfu
    Song, Xibin
    Chen, Xiuwan
    Zhang, Liangjun
    SENSORS, 2022, 22 (09)