ImFusion: Boosting Two-Stage 3D Object Detection via Image Candidates

被引:5
|
作者
Tao, Manli [1 ,2 ]
Zhao, Chaoyang [1 ,3 ]
Wang, Jinqiao [1 ,2 ,3 ]
Tang, Ming [1 ,2 ]
机构
[1] Chinese Acad Sci, Inst Automat, Beijing 100190, Peoples R China
[2] Univ Chinese Acad Sci, Beijing 100049, Peoples R China
[3] ObjectEye Inc, Beijing 100000, Peoples R China
关键词
Three-dimensional displays; Proposals; Object detection; Feature extraction; Point cloud compression; Aggregates; Sun; 3D object detection; image candidates; pseudo 3D proposal; target missing; NETWORK;
D O I
10.1109/LSP.2023.3336569
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Multi-modal fusion methods combine the advantages of both point clouds and RGB images to boost the performance of 3D object detection. Despite the significant progress, we find that existing two-stage multi-modal fusion methods suffer from the 3D proposal missing in the first stage and projected-style feature fusion mechanism. To solve these problems, we propose a two-stage multi-modal feature fusion network, which improves the recall rate of hard targets in the first stage of network with pseudo 3D proposals generated from image candidates. Then, considering the complementary information between similar image foreground features across multiple objects, we design a multi-modal cross-target fusion module to pay more attention to the foreground objects. It enables a 3D proposal can aggregate the semantic features of multiple image candidates belonging to the same category. Finally, these enhanced fused proposals are processed in the second stage to further boost the performance of 3D detector. Experimental results on SUN RGB-D and KITTI datasets show the effectiveness of our proposed method.
引用
收藏
页码:241 / 245
页数:5
相关论文
共 50 条
  • [21] Equal Emphasis on Data and Network: A Two-Stage 3D Point Cloud Object Detection Algorithm with Feature Alignment
    Xiao, Kai
    Li, Teng
    Li, Jun
    Huang, Da
    Peng, Yuanxi
    REMOTE SENSING, 2024, 16 (02)
  • [22] Pulmonary nodule detection using hybrid two-stage 3D CNNs
    Tan, Man
    Wu, Fa
    Yang, Bei
    Ma, Jinlian
    Kong, Dexing
    Chen, Zengsi
    Long, Dan
    MEDICAL PHYSICS, 2020, 47 (08) : 3376 - 3388
  • [23] Two-Stage Object Detection Based on Deep Pruning for Remote Sensing Image
    Wang, Shengsheng
    Wang, Meng
    Zhao, Xin
    Liu, Dong
    KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT (KSEM 2018), PT I, 2018, 11061 : 137 - 147
  • [24] A Progressive Approach to Generic Object Detection: A Two-Stage Framework for Image Recognition
    Aamir, Muhammad
    Rahman, Ziaur
    Abro, Waheed Ahmed
    Bhatti, Uzair Aslam
    Dayo, Zaheer Ahmed
    Ishfaq, Muhammad
    CMC-COMPUTERS MATERIALS & CONTINUA, 2023, 75 (03): : 6351 - 6373
  • [25] Pyramid-feature-fusion-based Two-stage Vehicle Detection via 3D Point Cloud
    Zhang M.-F.
    Wu Y.-F.
    Wang L.
    Wang P.-W.
    Jiaotong Yunshu Xitong Gongcheng Yu Xinxi/Journal of Transportation Systems Engineering and Information Technology, 2022, 22 (05): : 107 - 116
  • [26] Two-stage uncertainty evaluation of 3D reconstruction
    Chen, Jie-Chun
    Ding, Zhen-Liang
    Yuan, Feng
    Guangxue Jingmi Gongcheng/Optics and Precision Engineering, 2008, 16 (06): : 1110 - 1116
  • [27] Boosting 3D Object Detection by Simulating Multimodality on Point Clouds
    Zheng, Wu
    Hong, Mingxuan
    Jiang, Li
    Fu, Chi-Wing
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 13628 - 13637
  • [28] Correlation Field for Boosting 3D Object Detection in Structured Scenes
    Sun, Jianhua
    Fang, Hao-Shu
    Zhu, Xianghui
    Li, Jiefeng
    Lu, Cewu
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 2298 - 2306
  • [29] CLOCs: Camera-LiDAR Object Candidates Fusion for 3D Object Detection
    Pang, Su
    Morris, Daniel
    Radha, Hayder
    2020 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2020, : 10386 - 10393
  • [30] VISTA: Boosting 3D Object Detection via Dual Cross-VIew SpaTial Attention
    Deng, Shengheng
    Liang, Zhihao
    Sun, Lin
    Jia, Kui
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 8438 - 8447