The inconvenient truth of ground truth errors in automotive datasets and DNN-based detection

被引:0
|
作者
Chan, Pak Hung [1 ]
Li, Boda [1 ]
Baris, Gabriele [1 ]
Sadiq, Qasim [1 ]
Donzella, Valentina [1 ]
机构
[1] Univ Warwick, WMG, Coventry, England
来源
基金
“创新英国”项目;
关键词
machine learning; automated vehicles; automotive dataset; labeling; ANNOTATION;
D O I
10.1017/dce.2024.39
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Assisted and automated driving functions will rely on machine learning algorithms, given their ability to cope with real-world variations, e.g. vehicles of different shapes, positions, colors, and so forth. Supervised learning needs annotated datasets, and several automotive datasets are available. However, these datasets are tremendous in volume, and labeling accuracy and quality can vary across different datasets and within dataset frames. Accurate and appropriate ground truth is especially important for automotive, as " incomplete " or " incorrect " learning can negatively impact vehicle safety when these neural networks are deployed. This work investigates the ground truth quality of widely adopted automotive datasets, including a detailed analysis of KITTI MoSeg. According to the identified and classified errors in the annotations of different automotive datasets, this article provides three different criteria collections for producing improved annotations. These criteria are enforceable and applicable to a wide variety of datasets. The three annotations sets are created to (i) remove dubious cases; (ii) annotate to the best of human visual system; and (iii) remove clear erroneous BBs. KITTI MoSeg has been reannotated three times according to the specified criteria, and three state-of-the-art deep neural network object detectors are used to evaluate them. The results clearly show that network performance is affected by ground truth variations, and removing clear errors is beneficial for predicting real-world objects only for some networks. The relabeled datasets still present some cases with " arbitrary " / "- controversial" annotations, and therefore, this work concludes with some guidelines related to dataset annotation, metadata/sublabels, and specific automotive use cases.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] Use Errors With Health Care Technologies: An Inconvenient Truth
    Harrington, Linda
    AACN ADVANCED CRITICAL CARE, 2019, 30 (01) : 12 - 15
  • [2] The improvement of ground truth annotation in public datasets for human detection
    Nou, Sotheany
    Lee, Joong-Sun
    Ohyama, Nagaaki
    Obi, Takashi
    MACHINE VISION AND APPLICATIONS, 2024, 35 (03)
  • [3] Automotive DNN-Based Object Detection in the Presence of Lens Obstruction and Video Compression
    Baris, Gabriele
    Li, Boda
    Chan, Pak Hung
    Avizzano, Carlo Alberto
    Donzella, Valentina
    IEEE ACCESS, 2025, 13 : 36575 - 36589
  • [4] Effect of errors in ground truth on classification accuracy
    Carlotto, Mark J.
    INTERNATIONAL JOURNAL OF REMOTE SENSING, 2009, 30 (18) : 4831 - 4849
  • [5] A ground truth based vanishing point detection algorithm
    Gallagher, AC
    PATTERN RECOGNITION, 2002, 35 (07) : 1527 - 1543
  • [6] DNN-Based Estimation for Misalignment State of Automotive Radar Sensor
    Kim, Junho
    Jeong, Taewon
    Lee, Seongwook
    SENSORS, 2023, 23 (14)
  • [7] Repairing Confusion and Bias Errors for DNN-Based Image Classifiers
    Tian, Yuchi
    PROCEEDINGS OF THE 28TH ACM JOINT MEETING ON EUROPEAN SOFTWARE ENGINEERING CONFERENCE AND SYMPOSIUM ON THE FOUNDATIONS OF SOFTWARE ENGINEERING (ESEC/FSE '20), 2020, : 1699 - 1700
  • [8] DNN-Based Radar Target Detection With OTFS
    Tan, Long
    Yuan, Weijie
    Zhang, Xiaoqi
    Zhang, Kecheng
    Li, Zhongjie
    Li, Yonghui
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2024, 73 (10) : 15786 - 15791
  • [9] Attacking DNN-based Intrusion Detection Models
    Zhang, Xingwei
    Zheng, Xiaolong
    Wu, Desheng Dash
    IFAC PAPERSONLINE, 2020, 53 (05): : 415 - 419
  • [10] UnrealGT: Using Unreal Engine to Generate Ground Truth Datasets
    Pollok, Thomas
    Junglas, Lorenz
    Ruf, Boitumelo
    Schumann, Arne
    ADVANCES IN VISUAL COMPUTING, ISVC 2019, PT I, 2020, 11844 : 670 - 682