Image-to-Point Registration via Cross-Modality Correspondence Retrieval

被引:0
|
作者
Bie, Lin [1 ]
Li, Siqi [1 ]
Cheng, Kai [2 ]
机构
[1] Tsinghua Univ, Sch Software, Beijing, Peoples R China
[2] Army Engn Univ, Command Control Coll, Nanjing, Peoples R China
关键词
Image-to-Point Cloud registration; cross-modality correspondence retrieval; frustum point retrieval; combined correspondence retrieval; virtual point cloud;
D O I
10.1145/3652583.3658074
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Image-to-Point Cloud registration between 2D images and 3D LiDAR point clouds is a significant task in computer vision. The traditional registration pipeline first establishes correspondences between images and point clouds and then performs pose estimation based on the generated matches. However, 2D-3D correspondences are inherently difficult to be established due to the large modality gap between images and LiDAR point clouds. To this end, we build a bridge to alleviate the 2D-3D modality gap, which aligns LiDAR point clouds to the virtual points generated by images. In this way, the modality gap can be alleviated to the domain gap of different types of point clouds, i.e. original point clouds and virtual point clouds. Concretely, our framework conducts feature fusion from the LiDAR and virtual point cloud by utilizing the Transformer layer. To relieve the domain gap, a frustum points retrieval module and a combined correspondences retrieval module are proposed based on the consistency of the feature and position descriptor to select the correct correspondences among the candidates, which are generated from the simultaneous retrieval of features and position descriptors. In the implementation procedure, we design a frustum retrieval loss and a combined correspondence retrieval loss for cross-modality correspondence retrieval. Experimental results and comparison with state-of-the-art Image-to-Point Cloud methods on KITTI and nuScenes datasets demonstrate our proposed method has achieved superior performance.
引用
收藏
页码:266 / 274
页数:9
相关论文
共 50 条
  • [1] Boosting Cross-Modality Image Registration
    Barbu, Adrian
    Ionasec, Razvan
    2009 JOINT URBAN REMOTE SENSING EVENT, VOLS 1-3, 2009, : 89 - +
  • [2] CorrI2P: Deep Image-to-Point Cloud Registration via Dense Correspondence
    Ren, Siyu
    Zeng, Yiming
    Hou, Junhui
    Chen, Xiaodong
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (03) : 1198 - 1208
  • [3] Correlative techniques for cross-modality medical image registration
    Richardson, DB
    Bury, EA
    MEDICAL IMAGING 1996: IMAGE PROCESSING, 1996, 2710 : 368 - 375
  • [4] Cross-Modality Medical Image Retrieval with Deep Features
    Mbilinyi, Ashery
    Schuldt, Heiko
    2020 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE, 2020, : 2632 - 2639
  • [5] Cross-Modality Personalization for Retrieval
    Murrugarra-Llerena, Nils
    Kovashka, Adriana
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 6422 - 6431
  • [6] Quantity-Aware Coarse-to-Fine Correspondence for Image-to-Point Cloud Registration
    Yao, Gongxin
    Xuan, Yixin
    Chen, Yiwei
    Pan, Yu
    IEEE SENSORS JOURNAL, 2024, 24 (20) : 33826 - 33837
  • [7] Accurate Registration of Cross-Modality Geometry via Consistent Clustering
    Zhao, Mingyang
    Huang, Xiaoshui
    Jiang, Jingen
    Mou, Luntian
    Yan, Dong-Ming
    Ma, Lei
    IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2024, 30 (07) : 4055 - 4067
  • [8] CROSS-MODALITY HASHING WITH PARTIAL CORRESPONDENCE
    Gu, Yun
    Xue, Haoyang
    Yang, Jie
    Shi, Pengfei
    2015 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2015, : 1925 - 1929
  • [9] Cross-Modality Image Registration Using a Training-Time Privileged Third Modality
    Yang, Qianye
    Atkinson, David
    Fu, Yunguan
    Syer, Tom
    Yan, Wen
    Punwani, Shonit
    Clarkson, Matthew J.
    Barratt, Dean C.
    Vercauteren, Tom
    Hu, Yipeng
    IEEE TRANSACTIONS ON MEDICAL IMAGING, 2022, 41 (11) : 3421 - 3431
  • [10] A metric for testing the accuracy of cross-modality image registration: Validation and application
    Black, KJ
    Videen, TO
    Perlmutter, JS
    JOURNAL OF COMPUTER ASSISTED TOMOGRAPHY, 1996, 20 (05) : 855 - 861