HI-SLAM: Hierarchical implicit neural representation for SLAM

被引:0
|
作者
Li, Jingbo [1 ]
Firkat, Eksan [4 ,5 ]
Zhu, Jingyu [3 ]
Zhu, Bin [2 ]
Zhu, Jihong [3 ]
Hamdulla, Askar [1 ]
机构
[1] Xinjiang Univ, Sch Informat Sci & Engn, 666 Shengli Rd, Urumqi, Xinjiang, Peoples R China
[2] Tsinghua Univ, Dept Automat, 33 Shuangqing Rd, Beijing, Peoples R China
[3] Tsinghua Univ, Dept Precis Instrument, 33 Shuangqing Rd, Beijing, Peoples R China
[4] Tsinghua Univ, Tsinghua Shenzhen Int Grad Sch, Shenzhen, Peoples R China
[5] Great Bay Univ, Dongguan, Guangdong, Peoples R China
关键词
Dense visual SLAM; Neural implicit representations; Localization; RGB-D camera; FEATURE FUSION; VERSATILE;
D O I
10.1016/j.eswa.2025.126487
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Implicit neural representation can improve the expressive ability and performance of the model by learning the representation of high-dimensional feature space and has a wide range of applications in many fields and an exciting performance. Dense visual SLAM is one of the beneficiaries of the development of implicit neural representations. Still, the current methods are based on simple fully connected network architectures, resulting in poor generalization ability, insufficient real-time performance and inability to balance global and local optimization. This paper propose a hierarchical scene representation that treats color information and geometric information as equally important, one that encodes geometric and color information into different resolution grid sizes and combines multiple corresponding multi-layer perceptron decoders. The coarse-level grid captures the general shape and structure of the global scene and makes reasonable predictions for unobserved regions.In contrast, the medium-fine-level grid finely represents geometric details and color information. Rich and comprehensive high-fidelity reconstructions can be obtained in large-scale scenes by using meshes of different resolutions to encode geometric and color information. In this study, selectable keyframes are used to ensure that the local information of the scene is optimized while reducing redundant information preservation. Compared with recent dense visual SLAM systems via implicit neural representations, our method generalizes and operates more robustly, efficiently, and precisely in large-scale scenes.
引用
收藏
页数:10
相关论文
共 50 条
  • [31] Visual-LIDAR SLAM Based on Supervised Hierarchical Deep Neural Networks
    An, Yi
    Sun, Zhuo
    Zhang, Chao
    Yue, Haifeng
    Zhi, Yan
    Xu, Hongliang
    39TH YOUTH ACADEMIC ANNUAL CONFERENCE OF CHINESE ASSOCIATION OF AUTOMATION, YAC 2024, 2024, : 1378 - 1385
  • [32] Point-SLAM: Dense Neural Point Cloud-based SLAM
    Sandstrom, Erik
    Li, Yue
    Van Gool, Luc
    Oswald, Martin R.
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 18387 - 18398
  • [33] DHDP-SLAM: Dynamic Hierarchical Dirichlet Process based data association for semantic SLAM
    Zhao, Yifan
    Wang, Changhong
    Ouyang, Yifan
    Zhong, Jiapeng
    Li, Yuanwei
    Zhao, Nannan
    DISPLAYS, 2025, 86
  • [34] Hierarchical Segment-based Optimization for SLAM
    Tian, Yuxin
    Wang, Yujie
    Ouyang, Ming
    Shi, Xuesong
    2021 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2021, : 6573 - 6580
  • [35] Efficient Map Fusion for Multiple Implicit SLAM Agents
    Liu, Shaofan
    Zhu, Jianke
    IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, 2024, 9 (01): : 852 - 865
  • [36] Learning Deep Representation for Place Recognition in SLAM
    Mukherjee, Aritra
    Chakraborty, Satyaki
    Saha, Sanjoy Kumar
    PATTERN RECOGNITION AND MACHINE INTELLIGENCE, PREMI 2017, 2017, 10597 : 557 - 564
  • [37] Quaternion Representation for Similarity Transformations in Visual SLAM
    Kyrki, Ville
    2008 IEEE/RSJ INTERNATIONAL CONFERENCE ON ROBOTS AND INTELLIGENT SYSTEMS, VOLS 1-3, CONFERENCE PROCEEDINGS, 2008, : 2498 - 2503
  • [38] A Hybrid SLAM Representation for Dynamic Marine Environments
    Bibby, Charles
    Reid, Ian
    2010 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2010, : 257 - 264
  • [39] The M-space feature representation for SLAM
    Folkesson, John
    Jensfelt, Patric
    Christensen, Henrik I.
    IEEE TRANSACTIONS ON ROBOTICS, 2007, 23 (05) : 1024 - 1035
  • [40] NeRF-SLAM: Real-Time Dense Monocular SLAM with Neural Radiance Fields
    Rosinol, Antoni
    Leonard, John J.
    Carlone, Luca
    2023 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, IROS, 2023, : 3437 - 3444