MonoSample: Synthetic 3D Data Augmentation Method in Monocular 3D Object Detection

被引:0
|
作者
Qiao, Junchao [1 ]
Liu, Biao [1 ]
Yang, Jiaqi [1 ]
Wang, Baohua [1 ]
Xiu, Sanmu [1 ]
Du, Xin [1 ]
Nie, Xiaobo [1 ]
机构
[1] Beijing Jiaotong Univ, Dept Elect Engn & Automat, Beijing 100082, Peoples R China
来源
基金
中国国家自然科学基金;
关键词
Three-dimensional displays; Training; Object detection; Data augmentation; Solid modeling; Uncertainty; Laser radar; Computer vision for transportation; deep learning for visual perception; object detection; VISION;
D O I
10.1109/LRA.2024.3414272
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
In the context of autonomous driving, it is both critical and challenging to locate 3D objects by using a calibrated RGB image. Current methods typically utilize heteroscedastic aleatoric uncertainty loss to regress the depth of objects, thereby reducing the impact of noisy input while also ensuring the reliability of depth predictions. However, experimentation reveals that uncertainty loss can also lead to serious overfitting issue and performance degradation. To address this issue, we propose MonoSample, an augmentation method that collects samples from the dataset and places them randomly during training. MonoSample takes into account the occlusion relationships and applies strict restrictions to ensure the verisimilitude of the enhanced scenes. Furthermore, MonoSample avoids the complex conversion process between 2D and 3D, thereby enabling the extraction of a large number of samples and efficient operation. Experiments on different models have verified its effectiveness. Leveraging MonoSample in DID-M3D, our model achieves state-of-the-art (SOTA) performance on the KITTI 3D object detection benchmark.
引用
收藏
页码:7326 / 7332
页数:7
相关论文
共 50 条
  • [1] GRAMO: geometric resampling augmentation for monocular 3D object detection
    Guan, He
    Song, Chunfeng
    Zhang, Zhaoxiang
    FRONTIERS OF COMPUTER SCIENCE, 2024, 18 (05)
  • [2] GRAMO: geometric resampling augmentation for monocular 3D object detection
    He Guan
    Chunfeng Song
    Zhaoxiang Zhang
    Frontiers of Computer Science, 2024, 18
  • [3] Aerial Monocular 3D Object Detection
    Hu, Yue
    Fang, Shaoheng
    Xie, Weidi
    Chen, Siheng
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2023, 8 (04) : 1959 - 1966
  • [4] Disentangling Monocular 3D Object Detection
    Simonelli, Andrea
    Bulo, Samuel Rota
    Porzi, Lorenzo
    Lopez-Antequera, Manuel
    Kontschieder, Peter
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 1991 - 1999
  • [5] Monocular 3D Object Detection with Bounding Box Denoising in 3D by Perceiver
    Liu, Xianpeng
    Zheng, Ce
    Cheng, Kelvin
    Xue, Nan
    Qi, Guo-Jun
    Wu, Tianfu
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 6413 - 6423
  • [6] Monocular 3D Object Detection for Autonomous Driving
    Chen, Xiaozhi
    Kundu, Kaustav
    Zhang, Ziyu
    Ma, Huimin
    Fidler, Sanja
    Urtasun, Raquel
    2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 2147 - 2156
  • [7] Dimension Embeddings for Monocular 3D Object Detection
    Zhang, Yunpeng
    Zheng, Wenzhao
    Zhu, Zheng
    Huang, Guan
    Du, Dalong
    Zhou, Jie
    Lu, Jiwen
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 1579 - 1588
  • [8] Multivariate Probabilistic Monocular 3D Object Detection
    Shi, Xuepeng
    Chen, Zhixiang
    Kim, Tae-Kyun
    2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 4270 - 4279
  • [9] Uncertainty Prediction for Monocular 3D Object Detection
    Mun, Junghwan
    Choi, Hyukdoo
    SENSORS, 2023, 23 (12)
  • [10] Monocular 3D object detection for distant objects
    Li, Jiahao
    Han, Xiaohong
    JOURNAL OF ELECTRONIC IMAGING, 2024, 33 (03) : 33021