GDMNet: A Unified Multi-Task Network for Panoptic Driving Perception

被引：0

作者：

Liu, Yunxiang ^{[1
]}

Ma, Haili ^{[1
]}

Zhu, Jianlin ^{[1
]}

Zhang, Qiangbo ^{[1
]}

机构：

[1] Shanghai Inst Technol, Sch Comp Sci & Informat Engn, Shanghai 201418, Peoples R China

来源：

CMC-COMPUTERS MATERIALS & CONTINUA | 2024年 / 80卷 / 02期

关键词：

Autonomous driving; multitask learning; drivable area segmentation; lane detection; vehicle detection;

D O I：

10.32604/cmc.2024.053710

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

To enhance the efficiency and accuracy of environmental perception for autonomous vehicles, we propose GDMNet, a unified multi-task perception network for autonomous driving, capable of performing drivable area segmentation, lane detection, and traffic object detection. Firstly, in the encoding stage, features are extracted, and Generalized Efficient Layer Aggregation Network (GELAN) is utilized to enhance feature extraction and gradient flow. Secondly, in the decoding stage, specialized detection heads are designed; the drivable area segmentation head employs DySample to expand feature maps, the lane detection head merges early-stage features and processes the output through the Focal Modulation Network (FMN). Lastly, the Minimum Point Distance IoU (MPDIoU) loss function is employed to compute the matching degree between traffic object detection boxes and predicted boxes, facilitating model training adjustments. Experimental results on the BDD100K dataset demonstrate that the proposed network achieves a drivable area segmentation mean intersection over union (mIoU) of 92.2%, lane detection accuracy and intersection over union (IoU) of 75.3% and 26.4%, respectively, and traffic object detection recall and mAP of 89.7% and 78.2%, respectively. The detection performance surpasses that of other single-task or multi-task algorithm models.

引用

页码：2963 / 2978

页数：16

共 50 条

[41] Statistically correlated multi-task learning for autonomous driving
Waseem Abbas
Muhammad Fakhir Khan
Murtaza Taj
Arif Mahmood
Neural Computing and Applications, 2021, 33 : 12921 - 12938
[42] Statistically correlated multi-task learning for autonomous driving
Abbas, Waseem
Khan, Muhammad Fakhir
Taj, Murtaza
Mahmood, Arif
NEURAL COMPUTING & APPLICATIONS, 2021, 33 (19): : 12921 - 12938
[43] Optimal Configuration of Multi-Task Learning for Autonomous Driving
Jun, Woomin
Son, Minjun
Yoo, Jisang
Lee, Sungjin
SENSORS, 2023, 23 (24)
[44] A Unified Neural Network for Panoptic Segmentation
Yao, L.
Chyau, A.
COMPUTER GRAPHICS FORUM, 2019, 38 (07) : 461 - 468
[45] UPSNet: A Unified Panoptic Segmentation Network
Xiong, Yuwen
Liao, Renjie
Zhao, Hengshuang
Hu, Rui
Bai, Min
Yumer, Ersin
Urtasun, Raquel
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 8810 - 8818
[46] Unified ICH quantification and prognosis prediction in NCCT images using a multi-task interpretable network
Gong, Kai
Dai, Qian
Wang, Jiacheng
Zheng, Yingbin
Shi, Tao
Yu, Jiaxing
Chen, Jiangwang
Huang, Shaohui
Wang, Zhanxiang
FRONTIERS IN NEUROSCIENCE, 2023, 17
[47] Unified Autoencoder with Task Embeddings for Multi-Task Learning in Renewable Power Forecasting
Nivarthi, Chandana Priya
Vogt, Stephan
Sick, Bernhard
2022 21ST IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS, ICMLA, 2022, : 1530 - 1536
[48] MultiNet: Multi-Modal Multi-Task Learning for Autonomous Driving
Chowdhuri, Sauhaarda
Pankaj, Tushar
Zipser, Karl
2019 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2019, : 1496 - 1504
[49] TriLiteNet: Lightweight Model for Multi-Task Visual Perception
Che, Quang-Huy
Lam, Duc-Khai
IEEE ACCESS, 2025, 13 : 50152 - 50166
[50] Integrated Perception with Recurrent Multi-Task Neural Networks
Bilen, Hakan
Vedaldi, Andrea
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 29 (NIPS 2016), 2016, 29

← 1 2 3 4 5 →