HI-SLAM: Hierarchical implicit neural representation for SLAM

被引:0
|
作者
Li, Jingbo [1 ]
Firkat, Eksan [4 ,5 ]
Zhu, Jingyu [3 ]
Zhu, Bin [2 ]
Zhu, Jihong [3 ]
Hamdulla, Askar [1 ]
机构
[1] Xinjiang Univ, Sch Informat Sci & Engn, 666 Shengli Rd, Urumqi, Xinjiang, Peoples R China
[2] Tsinghua Univ, Dept Automat, 33 Shuangqing Rd, Beijing, Peoples R China
[3] Tsinghua Univ, Dept Precis Instrument, 33 Shuangqing Rd, Beijing, Peoples R China
[4] Tsinghua Univ, Tsinghua Shenzhen Int Grad Sch, Shenzhen, Peoples R China
[5] Great Bay Univ, Dongguan, Guangdong, Peoples R China
关键词
Dense visual SLAM; Neural implicit representations; Localization; RGB-D camera; FEATURE FUSION; VERSATILE;
D O I
10.1016/j.eswa.2025.126487
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Implicit neural representation can improve the expressive ability and performance of the model by learning the representation of high-dimensional feature space and has a wide range of applications in many fields and an exciting performance. Dense visual SLAM is one of the beneficiaries of the development of implicit neural representations. Still, the current methods are based on simple fully connected network architectures, resulting in poor generalization ability, insufficient real-time performance and inability to balance global and local optimization. This paper propose a hierarchical scene representation that treats color information and geometric information as equally important, one that encodes geometric and color information into different resolution grid sizes and combines multiple corresponding multi-layer perceptron decoders. The coarse-level grid captures the general shape and structure of the global scene and makes reasonable predictions for unobserved regions.In contrast, the medium-fine-level grid finely represents geometric details and color information. Rich and comprehensive high-fidelity reconstructions can be obtained in large-scale scenes by using meshes of different resolutions to encode geometric and color information. In this study, selectable keyframes are used to ensure that the local information of the scene is optimized while reducing redundant information preservation. Compared with recent dense visual SLAM systems via implicit neural representations, our method generalizes and operates more robustly, efficiently, and precisely in large-scale scenes.
引用
收藏
页数:10
相关论文
共 50 条
  • [1] ENeRF-SLAM: A Dense Endoscopic SLAM With Neural Implicit Representation
    Shan, Jiwei
    Li, Yirui
    Xie, Ting
    Wang, Hesheng
    IEEE TRANSACTIONS ON MEDICAL ROBOTICS AND BIONICS, 2024, 6 (03): : 1030 - 1041
  • [2] HI-SLAM: Monocular Real-Time Dense Mapping With Hybrid Implicit Fields
    Zhang, Wei
    Sun, Tiecheng
    Wang, Sen
    Cheng, Qing
    Haala, Norbert
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (02): : 1548 - 1555
  • [3] Structerf-SLAM: Neural implicit representation SLAM for structural environments
    Wang, Haocheng
    Cao, Yanlong
    Wei, Xiaoyao
    Shou, Yejun
    Shen, Lingfeng
    Xu, Zhijie
    Ren, Kai
    COMPUTERS & GRAPHICS-UK, 2024, 119
  • [4] DMN-SLAM: Multi-MLPs Neural Implicit Representation SLAM for Dynamic Environments
    Lian, Xiaofeng
    Kang, Maomao
    Sun, Xiao
    Tan, Li
    Liu, He
    IEEE ACCESS, 2025, 13 : 29432 - 29444
  • [5] NICE-SLAM: Neural Implicit Scalable Encoding for SLAM
    Zhu, Zihan
    Peng, Songyou
    Larsson, Viktor
    Xu, Weiwei
    Bao, Hujun
    Cui, Zhaopeng
    Oswald, Martin R.
    Pollefeys, Marc
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 12776 - 12786
  • [6] KN-SLAM: Keypoints and Neural Implicit Encoding SLAM
    Wu, Xingming
    Liu, Zimeng
    Tian, Yuxin
    Liu, Zhong
    Chen, Weihai
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2024, 73 : 1 - 12
  • [7] DNIV-SLAM: Neural Implicit Visual SLAM in Dynamic Environments
    Yang, Feng
    Wang, Yanbo
    Tan, Liwen
    Li, Mingrui
    Shan, Hongjian
    Pan, Peng
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2024, PT X, 2025, 15040 : 33 - 47
  • [8] NICER-SLAM: Neural Implicit Scene Encoding for RGB SLAM
    Zhu, Zihan
    Peng, Songyou
    Larsson, Viktor
    Cui, Zhaopeng
    Oswald, Martin R.
    Geiger, Andreas
    Pollefeys, Marc
    2024 INTERNATIONAL CONFERENCE IN 3D VISION, 3DV 2024, 2024, : 42 - 52
  • [9] DDN-SLAM: Real Time Dense Dynamic Neural Implicit SLAM
    Li, Mingrui
    Guo, Zhetao
    Deng, Tianchen
    Zhou, Yiming
    Ren, Yuxiang
    Wang, Hongyu
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2025, 10 (05): : 4300 - 4307
  • [10] Mee-SLAM: Memory efficient endoscopic RGB SLAM with implicit scene representation
    Zhou, Yuchen
    Li, Teng
    Dai, Yu
    Zhang, Jianxun
    EXPERT SYSTEMS WITH APPLICATIONS, 2025, 269