ImFusion: Boosting Two-Stage 3D Object Detection via Image Candidates

被引:5
|
作者
Tao, Manli [1 ,2 ]
Zhao, Chaoyang [1 ,3 ]
Wang, Jinqiao [1 ,2 ,3 ]
Tang, Ming [1 ,2 ]
机构
[1] Chinese Acad Sci, Inst Automat, Beijing 100190, Peoples R China
[2] Univ Chinese Acad Sci, Beijing 100049, Peoples R China
[3] ObjectEye Inc, Beijing 100000, Peoples R China
关键词
Three-dimensional displays; Proposals; Object detection; Feature extraction; Point cloud compression; Aggregates; Sun; 3D object detection; image candidates; pseudo 3D proposal; target missing; NETWORK;
D O I
10.1109/LSP.2023.3336569
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Multi-modal fusion methods combine the advantages of both point clouds and RGB images to boost the performance of 3D object detection. Despite the significant progress, we find that existing two-stage multi-modal fusion methods suffer from the 3D proposal missing in the first stage and projected-style feature fusion mechanism. To solve these problems, we propose a two-stage multi-modal feature fusion network, which improves the recall rate of hard targets in the first stage of network with pseudo 3D proposals generated from image candidates. Then, considering the complementary information between similar image foreground features across multiple objects, we design a multi-modal cross-target fusion module to pay more attention to the foreground objects. It enables a 3D proposal can aggregate the semantic features of multiple image candidates belonging to the same category. Finally, these enhanced fused proposals are processed in the second stage to further boost the performance of 3D detector. Experimental results on SUN RGB-D and KITTI datasets show the effectiveness of our proposed method.
引用
收藏
页码:241 / 245
页数:5
相关论文
共 50 条
  • [41] Efficient 3D Correspondence Grouping by Two-Stage Filtering
    Lu, Rongrong
    Zhu, Feng
    Wu, Qingxiao
    Kong, Yanzi
    TENTH INTERNATIONAL CONFERENCE ON GRAPHICS AND IMAGE PROCESSING (ICGIP 2018), 2019, 11069
  • [42] Paint and Distill: Boosting 3D Object Detection with Semantic Passing Network
    Ju, Bo
    Zou, Zhikang
    Ye, Xiaoqing
    Jiang, Minyue
    Tan, Xiao
    Ding, Errui
    Wang, Jingdong
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 5639 - 5648
  • [43] Boosting Lidar 3D Object Detection with Point Cloud Semantic Segmentation
    Zhang, Xuchong
    Min, Chong
    Jia, Yijie
    Chen, Liming
    Zhang, Jingmin
    Sun, Hongbin
    2023 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2023, : 7614 - 7621
  • [44] Two-Stage Lesion Detection Approach Based on Dimension-Decomposition and 3D Context
    Jiacheng Jiao
    Haiwei Pan
    Chunling Chen
    Tao Jin
    Yang Dong
    Jingyi Chen
    TsinghuaScienceandTechnology, 2022, 27 (01) : 103 - 113
  • [45] Two-Stage Lesion Detection Approach Based on Dimension-Decomposition and 3D Context
    Jiao, Jiacheng
    Pan, Haiwei
    Chen, Chunling
    Jin, Tao
    Dong, Yang
    Chen, Jingyi
    TSINGHUA SCIENCE AND TECHNOLOGY, 2022, 27 (01) : 103 - 113
  • [46] 3D object detection based on synthetic RGB image
    Xu C.
    Li Z.
    Jiang D.
    Yun J.
    Liu Y.
    Liu Y.
    Bai D.
    Ying S.
    International Journal of Wireless and Mobile Computing, 2021, 20 (01): : 70 - 76
  • [47] PerspectiveNet: 3D Object Detection from a Single RGB Image via Perspective Points
    Huang, Siyuan
    Chen, Yixin
    Yuan, Tao
    Qi, Siyuan
    Zhu, Yixin
    Zhu, Song-Chun
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [48] Two-Stage RGB-Based Action Detection Using Augmented 3D Poses
    Papadopoulos, Konstantinos
    Ghorbel, Enjie
    Baptista, Renato
    Aouada, Djamila
    Ottersten, Bjoern
    COMPUTER ANALYSIS OF IMAGES AND PATTERNS, CAIP 2019, PT I, 2019, 11678 : 26 - 35
  • [49] 3D object tracking via image sets and depth-based occlusion detection
    Chen, Yan
    Shen, Yingju
    Liu, Xin
    Zhong, Bineng
    SIGNAL PROCESSING, 2015, 112 : 146 - 153
  • [50] Faster 3D Object Detection in RGB-D Image Using 3D Selective Search and Object Pruning
    Liu, Jiang
    Chen, Hongliang
    Li, Jianxun
    PROCEEDINGS OF THE 30TH CHINESE CONTROL AND DECISION CONFERENCE (2018 CCDC), 2018, : 4862 - 4866