Image-to-Point Registration via Cross-Modality Correspondence Retrieval

被引：0

作者：

Bie, Lin ^{[1
]}

Li, Siqi ^{[1
]}

Cheng, Kai ^{[2
]}

机构：

[1] Tsinghua Univ, Sch Software, Beijing, Peoples R China

[2] Army Engn Univ, Command Control Coll, Nanjing, Peoples R China

来源：

PROCEEDINGS OF THE 4TH ANNUAL ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, ICMR 2024 | 2024年

关键词：

Image-to-Point Cloud registration; cross-modality correspondence retrieval; frustum point retrieval; combined correspondence retrieval; virtual point cloud;

D O I：

10.1145/3652583.3658074

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Image-to-Point Cloud registration between 2D images and 3D LiDAR point clouds is a significant task in computer vision. The traditional registration pipeline first establishes correspondences between images and point clouds and then performs pose estimation based on the generated matches. However, 2D-3D correspondences are inherently difficult to be established due to the large modality gap between images and LiDAR point clouds. To this end, we build a bridge to alleviate the 2D-3D modality gap, which aligns LiDAR point clouds to the virtual points generated by images. In this way, the modality gap can be alleviated to the domain gap of different types of point clouds, i.e. original point clouds and virtual point clouds. Concretely, our framework conducts feature fusion from the LiDAR and virtual point cloud by utilizing the Transformer layer. To relieve the domain gap, a frustum points retrieval module and a combined correspondences retrieval module are proposed based on the consistency of the feature and position descriptor to select the correct correspondences among the candidates, which are generated from the simultaneous retrieval of features and position descriptors. In the implementation procedure, we design a frustum retrieval loss and a combined correspondence retrieval loss for cross-modality correspondence retrieval. Experimental results and comparison with state-of-the-art Image-to-Point Cloud methods on KITTI and nuScenes datasets demonstrate our proposed method has achieved superior performance.

引用

页码：266 / 274

页数：9

共 50 条

[1] Boosting Cross-Modality Image Registration
Barbu, Adrian
Ionasec, Razvan
2009 JOINT URBAN REMOTE SENSING EVENT, VOLS 1-3, 2009, : 89 - +
[2] CorrI2P: Deep Image-to-Point Cloud Registration via Dense Correspondence
Ren, Siyu
Zeng, Yiming
Hou, Junhui
Chen, Xiaodong
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (03) : 1198 - 1208
[3] Correlative techniques for cross-modality medical image registration
Richardson, DB
Bury, EA
MEDICAL IMAGING 1996: IMAGE PROCESSING, 1996, 2710 : 368 - 375
[4] Cross-Modality Medical Image Retrieval with Deep Features
Mbilinyi, Ashery
Schuldt, Heiko
2020 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE, 2020, : 2632 - 2639
[5] Cross-Modality Personalization for Retrieval
Murrugarra-Llerena, Nils
Kovashka, Adriana
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 6422 - 6431
[6] Quantity-Aware Coarse-to-Fine Correspondence for Image-to-Point Cloud Registration
Yao, Gongxin
Xuan, Yixin
Chen, Yiwei
Pan, Yu
IEEE SENSORS JOURNAL, 2024, 24 (20) : 33826 - 33837
[7] Accurate Registration of Cross-Modality Geometry via Consistent Clustering
Zhao, Mingyang
Huang, Xiaoshui
Jiang, Jingen
Mou, Luntian
Yan, Dong-Ming
Ma, Lei
IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2024, 30 (07) : 4055 - 4067
[8] CROSS-MODALITY HASHING WITH PARTIAL CORRESPONDENCE
Gu, Yun
Xue, Haoyang
Yang, Jie
Shi, Pengfei
2015 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2015, : 1925 - 1929
[9] Cross-Modality Image Registration Using a Training-Time Privileged Third Modality
Yang, Qianye
Atkinson, David
Fu, Yunguan
Syer, Tom
Yan, Wen
Punwani, Shonit
Clarkson, Matthew J.
Barratt, Dean C.
Vercauteren, Tom
Hu, Yipeng
IEEE TRANSACTIONS ON MEDICAL IMAGING, 2022, 41 (11) : 3421 - 3431
[10] A metric for testing the accuracy of cross-modality image registration: Validation and application
Black, KJ
Videen, TO
Perlmutter, JS
JOURNAL OF COMPUTER ASSISTED TOMOGRAPHY, 1996, 20 (05) : 855 - 861

← 1 2 3 4 5 →