PI-RCNN: An Efficient Multi-Sensor 3D Object Detector with Point-Based Attentive Cont-Conv Fusion Module

被引：0

作者：

Xie, Liang ^{[1
,2
]}

Xiang, Chao ^{[1
]}

Yu, Zhengxu ^{[1
]}

Xu, Guodong ^{[1
,2
]}

Yang, Zheng ^{[2
]}

Cai, Deng ^{[1
,3
]}

He, Xiaofei ^{[1
,2
]}

机构：

[1] Zhejiang Univ, State Key Lab CAD&CG, Hangzhou, Peoples R China

[2] Fabu Inc, Hangzhou, Peoples R China

[3] Alibaba Zhejiang Univ Joint Inst Frontier Technol, Hangzhou, Peoples R China

来源：

THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE | 2020年 / 34卷

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

LIDAR point clouds and RGB-images are both extremely essential for 3D object detection. So many state-of-the-art 3D detection algorithms dedicate in fusing these two types of data effectively. However, their fusion methods based on Bird's Eye View (BEV) or voxel format are not accurate. In this paper, we propose a novel fusion approach named Point-based Attentive Cont-conv Fusion(PACF) module, which fuses multi-sensor features directly on 3D points. Except for continuous convolution, we additionally add a Point-Pooling and an Attentive Aggregation to make the fused features more expressive. Moreover, based on the PACF module, we propose a 3D multi-sensor multi-task network called Pointcloud-Image RCNN(PI-RCNN as brief), which handles the image segmentation and 3D object detection tasks. PI-RCNN employs a segmentation sub-network to extract full-resolution semantic feature maps from images and then fuses the multi-sensor features via powerful PACF module. Beneficial from the effectiveness of the PACF module and the expressive semantic features from the segmentation module, PI-RCNN can improve much in 3D object detection. We demonstrate the effectiveness of the PACF module and PI-RCNN on the KITTI 3D Detection benchmark, and our method can achieve state-of-the-art on the metric of 3D AP.

引用

页码：12460 / 12467

页数：8

共 43 条

[31] MMAF-Net: Multi-view multi-stage adaptive fusion for multi-sensor 3D object detection
Zhang, Wensheng
Shi, Hongli
Zhao, Yunche
Feng, Zhenan
Lovreglio, Ruggiero
EXPERT SYSTEMS WITH APPLICATIONS, 2024, 242
[32] Multiscale 3D Documentation of the Medieval Wall of Jaen (Spain) Based on Multi-Sensor Data Fusion
Perez-Garcia, Jose Luis
Mozas-Calvache, Antonio Tomas
Gomez-Lopez, Jose Miguel
Vico-Garcia, Diego
HERITAGE, 2023, 6 (08): : 5952 - 5966
[33] Mobile robot 3D map building and path planning based on multi-sensor data fusion
Yan, Fei
Zhuang, Yan
Wang, Wei
INTERNATIONAL JOURNAL OF COMPUTER APPLICATIONS IN TECHNOLOGY, 2012, 44 (04) : 276 - 283
[34] Research on Refined 3D Attitude Model of Smart Construction Machinery Based on Multi-sensor Fusion
Zheng, Yanning
Wang, Shengli
Liu, Yang
Xu, Ying
Li, Xu
Chen, Guiping
CHINA SATELLITE NAVIGATION CONFERENCE (CSNC) 2018 PROCEEDINGS, VOL I, 2018, 497 : 117 - 127
[35] Space and Time Registration of Vehicle-Borne 3D Acquisition System Based on Multi-sensor Fusion
Zhu, Linlin
Hu, Shaoxing
EPLWW3S 2011: 2011 INTERNATIONAL CONFERENCE ON ECOLOGICAL PROTECTION OF LAKES-WETLANDS-WATERSHED AND APPLICATION OF 3S TECHNOLOGY, VOL 3, 2011, : 548 - 551
[36] Multi-sensor fusion method based on FFR-FK for 3D trajectory measurement of underground pipelines
Li, Pingfei
Wang, Lu
Zu, Yutong
Bai, Xuesong
Hu, Yuanbiao
TUNNELLING AND UNDERGROUND SPACE TECHNOLOGY, 2023, 141
[37] Generating Adversarial Point Clouds on Multi-modal Fusion Based 3D Object Detection Model
Wang, Huiying
Shen, Huixin
Zhang, Boyang
Wen, Yu
Meng, Dan
INFORMATION AND COMMUNICATIONS SECURITY (ICICS 2021), PT I, 2021, 12918 : 187 - 203
[38] AMFF-Net: An Effective 3D Object Detector Based on Attention and Multi-Scale Feature Fusion
Li, Guangping
Mo, Zuanfang
Ling, Bingo Wing-Kuen
SENSORS, 2023, 23 (23)
[39] VPC-VoxelNet: multi-modal fusion 3D object detection networks based on virtual point clouds
Zhang, Qiang
Shi, Qin
Cheng, Teng
Zhang, Junning
Chen, Jiong
INTERNATIONAL JOURNAL OF MULTIMEDIA INFORMATION RETRIEVAL, 2025, 14 (01)
[40] PV-SSD: A Multi-Modal Point Cloud 3D Object Detector Based on Projection Features and Voxel Features
Shao, Yongxin
Tan, Aihong
Sun, Zhetao
Zheng, Enhui
Yan, Tianhong
Liao, Peng
IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2024, 8 (05): : 3436 - 3449

← 1 2 3 4 5 →