Leveraging front and side cues for occlusion handling in monocular 3D object detection

被引:0
|
作者
Yuying Song
Zecheng Li
Jingxuan Wu
Chunyi Song
Zhiwei Xu
机构
[1] Ocean College,The Institute of Marine Electronic and Intelligent System
[2] Zhejiang University,undefined
[3] The Engineering Research Center of Oceanic Sensing Technology and Equipment,undefined
[4] Ministry of Education,undefined
[5] The Donghai Laboratory,undefined
来源
The Visual Computer | 2024年 / 40卷
关键词
Monocular object detection; Occlusion Handling; Compositional model; Uncertainty; Attention mechanism; Autonomous driving;
D O I
暂无
中图分类号
学科分类号
摘要
3D object detection, as an essential part of perception, plays a principal role in the autonomous driving system. The cost-competitive monocular 3D object detection has drawn increasing attention recently. However, it still suffers an inferior accuracy especially for occluded objects due to the limited camera view. Inspired by compositional models, in which an object is represented as a combination of multiple components, this paper proposes a new monocular 3D object detection method that decreases the impact of occlusion by utilizing an object’s front and side cues. To do this, the features are extracted from a decoupled front and side representation and then fused by an attention-based module to obtain a more consistent feature distribution. An uncertainty-guided depth ensemble based on geometry is further applied to refine the depth prediction. Experiment results demonstrate that as compared to the conventional methods, the proposed method significantly improves the detection performance for occluded objects while still satisfying real-time efficiency, with the Average Precision on 40 recall positions (AP40), respectively, increasing by 10.23% for partly occluded objects and 12.22% for mostly occluded objects in the KITTI benchmark. The codes are released at https://github.com/kagurua/Front-Side-Det
引用
收藏
页码:1757 / 1773
页数:16
相关论文
共 50 条
  • [31] Objects are Different: Flexible Monocular 3D Object Detection
    Zhang, Yunpeng
    Lu, Jiwen
    Zhou, Jie
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 3288 - 3297
  • [32] Monocular 3D object detection for construction scene analysis
    Shen, Jie
    Jiao, Lang
    Zhang, Cong
    Peng, Keran
    COMPUTER-AIDED CIVIL AND INFRASTRUCTURE ENGINEERING, 2024, 39 (09) : 1370 - 1389
  • [33] Delving into Localization Errors for Monocular 3D Object Detection
    Ma, Xinzhu
    Zhang, Yinmin
    Xu, Dan
    Zhou, Dongzhan
    Yi, Shuai
    Li, Haojie
    Ouyang, Wanli
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 4719 - 4728
  • [34] Shape-Aware Monocular 3D Object Detection
    Chen, Wei
    Zhao, Jie
    Zhao, Wan-Lei
    Wu, Song-Yuan
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2023, 24 (06) : 6416 - 6424
  • [35] Competition for roadside camera monocular 3D object detection
    Jinrang Jia
    Yifeng Shi
    Yuli Qu
    Rui Wang
    Xing Xu
    Hai Zhang
    NationalScienceReview, 2023, 10 (06) : 34 - 37
  • [36] MonoGRNet: A General Framework for Monocular 3D Object Detection
    Qin, Zengyi
    Wang, Jinglu
    Lu, Yan
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (09) : 5170 - 5184
  • [37] OPA-3D: Occlusion-Aware Pixel-Wise Aggregation for Monocular 3D Object Detection
    Su, Yongzhi
    Di, Yan
    Zhai, Guangyao
    Manhardt, Fabian
    Rambach, Jason
    Busam, Benjamin
    Stricker, Didier
    Tombari, Federico
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2023, 8 (03) : 1327 - 1334
  • [38] Object-Aware Centroid Voting for Monocular 3D Object Detection
    Bao, Wentao
    Yu, Qi
    Kong, Yu
    2020 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2020, : 2197 - 2204
  • [39] Explicit Occlusion Reasoning for 3D Object Detection
    Meger, David
    Wojek, Christian
    Schiele, Bernt
    Little, James J.
    PROCEEDINGS OF THE BRITISH MACHINE VISION CONFERENCE 2011, 2011,
  • [40] Occlusion Problem in 3D Object Detection: A Review
    Kandelkar, Apurva
    Batra, Isha
    Sharma, Shabnam
    Malik, Arun
    INTERNATIONAL CONFERENCE ON INNOVATIVE COMPUTING AND COMMUNICATIONS, ICICC 2022, VOL 1, 2023, 473 : 299 - 312