ImFusion: Boosting Two-Stage 3D Object Detection via Image Candidates

被引:5
|
作者
Tao, Manli [1 ,2 ]
Zhao, Chaoyang [1 ,3 ]
Wang, Jinqiao [1 ,2 ,3 ]
Tang, Ming [1 ,2 ]
机构
[1] Chinese Acad Sci, Inst Automat, Beijing 100190, Peoples R China
[2] Univ Chinese Acad Sci, Beijing 100049, Peoples R China
[3] ObjectEye Inc, Beijing 100000, Peoples R China
关键词
Three-dimensional displays; Proposals; Object detection; Feature extraction; Point cloud compression; Aggregates; Sun; 3D object detection; image candidates; pseudo 3D proposal; target missing; NETWORK;
D O I
10.1109/LSP.2023.3336569
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Multi-modal fusion methods combine the advantages of both point clouds and RGB images to boost the performance of 3D object detection. Despite the significant progress, we find that existing two-stage multi-modal fusion methods suffer from the 3D proposal missing in the first stage and projected-style feature fusion mechanism. To solve these problems, we propose a two-stage multi-modal feature fusion network, which improves the recall rate of hard targets in the first stage of network with pseudo 3D proposals generated from image candidates. Then, considering the complementary information between similar image foreground features across multiple objects, we design a multi-modal cross-target fusion module to pay more attention to the foreground objects. It enables a 3D proposal can aggregate the semantic features of multiple image candidates belonging to the same category. Finally, these enhanced fused proposals are processed in the second stage to further boost the performance of 3D detector. Experimental results on SUN RGB-D and KITTI datasets show the effectiveness of our proposed method.
引用
收藏
页码:241 / 245
页数:5
相关论文
共 50 条
  • [31] 3D Soma Detection in Large-Scale Whole Brain Images via a Two-Stage Neural Network
    Wei, Xiaodan
    Liu, Qinghao
    Liu, Min
    Wang, Yaonan
    Meijering, Erik
    IEEE TRANSACTIONS ON MEDICAL IMAGING, 2023, 42 (01) : 148 - 157
  • [32] Lightweight two-stage transformer for low-light image enhancement and object detection
    Kou, Kangkang
    Yin, Xiangchen
    Gao, Xin
    Nie, Fuhui
    Liu, Jing
    Zhang, Guoying
    DIGITAL SIGNAL PROCESSING, 2024, 150
  • [33] See-Then-Grasp: Object Full 3D Reconstruction via Two-Stage Active Robotic Reconstruction Using Single Manipulator
    Hong, Youngtaek
    Kim, Jonghyeon
    Cha, Geonho
    Kim, Eunwoo
    Lee, Kyungjae
    APPLIED SCIENCES-BASEL, 2025, 15 (01):
  • [34] BirdNet plus : Two-Stage 3D Object Detection in LiDAR Through a Sparsity-Invariant Bird's Eye View
    Barrera, Alejandro
    Beltran, Jorge
    Guindel, Carlos
    Iglesias, Jose Antonio
    Garcia, Fernando
    IEEE ACCESS, 2021, 9 : 160299 - 160316
  • [35] Two-stage Co-salient Object Detection
    Wang, Zuyi
    Zhang, Lihe
    2017 10TH INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTATION TECHNOLOGY AND AUTOMATION (ICICTA 2017), 2017, : 287 - 290
  • [36] Two-Stage Unattended Object Detection Method with Proposals
    Nam Trung Pham
    Leman, Karianto
    Zhang, Jie
    Pek, Isaac
    2017 IEEE 2ND INTERNATIONAL CONFERENCE ON SIGNAL AND IMAGE PROCESSING (ICSIP), 2017, : 1 - 4
  • [37] Two-Stage Approach to Small-Object Detection
    Yu, Mingrui
    Leung, Henry
    SYSTEMS ENGINEERING, 2025,
  • [38] Boosting Monocular 3D Object Detection With Object-Centric Auxiliary Depth Supervision
    Kim, Youngseok
    Kim, Sanmin
    Sim, Sangmin
    Choi, Jun Won
    Kum, Dongsuk
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2023, 24 (02) : 1801 - 1813
  • [39] SMOKE: Single-Stage Monocular 3D Object Detection via Keypoint Estimation
    Liu, Zechen
    Wu, Zizhang
    Toth, Roland
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2020), 2020, : 4289 - 4298
  • [40] Two-stage dynamic deformation for construction of 3D models
    Chen, SW
    Stockman, G
    Dai, CY
    Chuang, CP
    GRAPHICAL MODELS AND IMAGE PROCESSING, 1996, 58 (05): : 484 - 493