Improving the Robustness of Pedestrian Detection in Autonomous Driving With Generative Data Augmentation

被引：4

作者：

Wu, Yalun ^{[1
]}

Xiang, Yingxiao ^{[2
]}

Tong, Endong ^{[1
]}

Ye, Yuqi ^{[1
]}

Cui, Zhibo ^{[1
]}

Tian, Yunzhe ^{[1
]}

Zhang, Lejun ^{[3
]}

Liu, Jiqiang ^{[1
]}

Han, Zhen ^{[1
]}

Niu, Wenjia ^{[1
]}

机构：

[1] Beijing Jiaotong Univ, Beijing Key Lab Secur & Privacy Intelligent Transp, Beijing 100044, Peoples R China

[2] Inst Informat Engn, Chinese Acad Sci, Beijing 100085, Peoples R China

[3] Guangzhou Univ, Cyberspace Inst Adv Technol, Guangzhou 510006, Peoples R China

来源：

IEEE NETWORK | 2024年 / 38卷 / 03期

关键词：

Pedestrians; Data augmentation; Data models; Autonomous vehicles; Feature extraction; Semantics; Image capture; pedestrian detection; diffusion model; generative data augmentation; image caption;

D O I：

10.1109/MNET.2024.3366232

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

Pedestrian detection plays a crucial role in autonomous driving by identifying the position, size, orientation, and dynamic features of pedestrians in images or videos, assisting autonomous vehicles in making better decisions and controls. It's worth noting that the performance of pedestrian detection models largely depends on the quality and diversity of available training data. Current datasets for autonomous driving have limitations in terms of diversity, scale, and quality. In recent years, numerous studies have proposed the use of data augmentation strategies to expand the coverage of datasets, aiming to maximize the utilization of existing training data. However, these data augmentation methods often overlook the diversity of data scenarios. To overcome this challenge, in this paper, we propose a more comprehensive method for data augmentation, based on image descriptions and diffusion models. This method aims to cover a wider range of scene variations, including different weather conditions and lighting situations. We have designed a classifier to select data samples for augmentation, followed by extracting visual features based on image captions and converting them into high-level semantic information as textual descriptions for the corresponding samples. Finally, we utilize diffusion models to generate new variants. Additionally, we have designed three modification patterns to increase diversity in aspects such as weather conditions, lighting, and pedestrian poses within the data. We conducted extensive experiments on the KITTI dataset and in real-world environments, demonstrating that our proposed method significantly enhances the performance of pedestrian detection models in complex scenarios. This meticulous consideration of data augmentation will notably enhance the applicability and robustness of pedestrian detection models in actual autonomous driving scenarios.

引用

页码：63 / 69

页数：7

共 50 条

[21] Localized Semantic Feature Mixers for Efficient Pedestrian Detection in Autonomous Driving
Khan, Abdul Hannan
Nawaz, Mohammed Shariq
Dengel, Andreas
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 5476 - 5485
[22] Generative Data Augmentation using LLMs improves Distributional Robustness in Question Answering
Chowdhury, Arijit Ghosh
Chadha, Aman
PROCEEDINGS OF THE 18TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: STUDENT RESEARCH WORKSHOP, 2024, : 258 - 265
[23] Pedestrian tracking algorithm for autonomous driving
Li Z.-H.
Zhong T.
Zhao Y.-H.
Hu Y.-L.
Li H.-T.
Zhao J.-W.
Jilin Daxue Xuebao (Gongxueban)/Journal of Jilin University (Engineering and Technology Edition), 2019, 49 (03): : 680 - 687
[24] Evaluation of Point Cloud Data Augmentation for 3D-LiDAR Object Detection in Autonomous Driving
Martins, Marta
Gomes, Iago P.
Wolf, Denis Fernando
Premebida, Cristiano
ROBOT 2023: SIXTH IBERIAN ROBOTICS CONFERENCE ADVANCES IN ROBOTICS, VOL 1, 2024, 976 : 82 - 92
[25] Autonomous Pedestrian Detection
Nkosi, M. P.
Hancke, G. P.
dos Santos, R. M. A.
PROCEEDINGS OF THE 2015 12TH IEEE AFRICON INTERNATIONAL CONFERENCE - GREEN INNOVATION FOR AFRICAN RENAISSANCE (AFRICON), 2015,
[26] A Data-driven Generative Model for GPS Sensors for Autonomous Driving
Karlsson, Erik
Mohammadiha, Nasser
PROCEEDINGS 2018 IEEE/ACM 1ST INTERNATIONAL WORKSHOP ON SOFTWARE ENGINEERING FOR AI IN AUTONOMOUS SYSTEMS (SEFAIAS), 2018, : 1 - 5
[27] Image-Level Automatic Data Augmentation for Pedestrian Detection
Ma, Yunfeng
Liu, Min
Tang, Yi
Wang, Xueping
Wang, Yaonan
IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2024, 73 : 1 - 12
[28] Improving the Robustness of Synthetic Images Detection by Means of Print and Scan Augmentation
Purnekar, Nischay
Abady, Lydia
Tondi, Benedetta
Barni, Mauro
PROCEEDINGS OF THE 2024 ACM WORKSHOP ON INFORMATION HIDING AND MULTIMEDIA SECURITY, IH&MMSEC 2024, 2024, : 65 - 73
[29] Deep Neural Network Based Vehicle and Pedestrian Detection for Autonomous Driving: A Survey
Chen, Long
Lin, Shaobo
Lu, Xiankai
Cao, Dongpu
Wu, Hangbin
Guo, Chi
Liu, Chun
Wang, Fei-Yue
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2021, 22 (06) : 3234 - 3246
[30] Towards Safe Autonomous Driving: Challenges of Pedestrian Detection in Rain with Automotive Radar
Steinhauser, Dagmar
Held, Patrick
Thoeresz, Bernhard
Brandmeier, Thomas
EURAD 2020 THE 17TH EUROPEAN RADAR CONFERENCE, 2021,

← 1 2 3 4 5 →