MonoSample: Synthetic 3D Data Augmentation Method in Monocular 3D Object Detection

被引:0
|
作者
Qiao, Junchao [1 ]
Liu, Biao [1 ]
Yang, Jiaqi [1 ]
Wang, Baohua [1 ]
Xiu, Sanmu [1 ]
Du, Xin [1 ]
Nie, Xiaobo [1 ]
机构
[1] Beijing Jiaotong Univ, Dept Elect Engn & Automat, Beijing 100082, Peoples R China
来源
基金
中国国家自然科学基金;
关键词
Three-dimensional displays; Training; Object detection; Data augmentation; Solid modeling; Uncertainty; Laser radar; Computer vision for transportation; deep learning for visual perception; object detection; VISION;
D O I
10.1109/LRA.2024.3414272
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
In the context of autonomous driving, it is both critical and challenging to locate 3D objects by using a calibrated RGB image. Current methods typically utilize heteroscedastic aleatoric uncertainty loss to regress the depth of objects, thereby reducing the impact of noisy input while also ensuring the reliability of depth predictions. However, experimentation reveals that uncertainty loss can also lead to serious overfitting issue and performance degradation. To address this issue, we propose MonoSample, an augmentation method that collects samples from the dataset and places them randomly during training. MonoSample takes into account the occlusion relationships and applies strict restrictions to ensure the verisimilitude of the enhanced scenes. Furthermore, MonoSample avoids the complex conversion process between 2D and 3D, thereby enabling the extraction of a large number of samples and efficient operation. Experiments on different models have verified its effectiveness. Leveraging MonoSample in DID-M3D, our model achieves state-of-the-art (SOTA) performance on the KITTI 3D object detection benchmark.
引用
收藏
页码:7326 / 7332
页数:7
相关论文
共 50 条
  • [31] Objects are Different: Flexible Monocular 3D Object Detection
    Zhang, Yunpeng
    Lu, Jiwen
    Zhou, Jie
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 3288 - 3297
  • [32] Monocular 3D Object Detection with Depth from Motion
    Wang, Tai
    Pang, Jiangmiao
    Lin, Dahua
    COMPUTER VISION, ECCV 2022, PT IX, 2022, 13669 : 386 - 403
  • [33] Shape-Aware Monocular 3D Object Detection
    Chen, Wei
    Zhao, Jie
    Zhao, Wan-Lei
    Wu, Song-Yuan
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2023, 24 (06) : 6416 - 6424
  • [34] Competition for roadside camera monocular 3D object detection
    Jinrang Jia
    Yifeng Shi
    Yuli Qu
    Rui Wang
    Xing Xu
    Hai Zhang
    NationalScienceReview, 2023, 10 (06) : 34 - 37
  • [35] MonoGRNet: A General Framework for Monocular 3D Object Detection
    Qin, Zengyi
    Wang, Jinglu
    Lu, Yan
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (09) : 5170 - 5184
  • [36] Learning Auxiliary Monocular Contexts Helps Monocular 3D Object Detection
    Liu, Xianpeng
    Xue, Nan
    Wu, Tianfu
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 1810 - 1818
  • [37] About the Ambiguity of Data Augmentation for 3D Object Detection in Autonomous Driving
    Reuse, Matthias
    Simon, Martin
    Sick, Bernhard
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2021), 2021, : 979 - 987
  • [38] Pattern-Aware Data Augmentation for LiDAR 3D Object Detection
    Hu, Jordan S. K.
    Was, Steven L.
    2021 IEEE INTELLIGENT TRANSPORTATION SYSTEMS CONFERENCE (ITSC), 2021, : 2703 - 2710
  • [39] CONTEXT-AWARE DATA AUGMENTATION FOR LIDAR 3D OBJECT DETECTION
    Hu, Xuzhong
    Duan, Zaipeng
    Huang, Xiao
    Xu, Ziwen
    Ming, Delie
    Ma, Jie
    2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 11 - 15
  • [40] Object-Aware Centroid Voting for Monocular 3D Object Detection
    Bao, Wentao
    Yu, Qi
    Kong, Yu
    2020 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2020, : 2197 - 2204