GDMNet: A Unified Multi-Task Network for Panoptic Driving Perception

被引:0
|
作者
Liu, Yunxiang [1 ]
Ma, Haili [1 ]
Zhu, Jianlin [1 ]
Zhang, Qiangbo [1 ]
机构
[1] Shanghai Inst Technol, Sch Comp Sci & Informat Engn, Shanghai 201418, Peoples R China
来源
CMC-COMPUTERS MATERIALS & CONTINUA | 2024年 / 80卷 / 02期
关键词
Autonomous driving; multitask learning; drivable area segmentation; lane detection; vehicle detection;
D O I
10.32604/cmc.2024.053710
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
To enhance the efficiency and accuracy of environmental perception for autonomous vehicles, we propose GDMNet, a unified multi-task perception network for autonomous driving, capable of performing drivable area segmentation, lane detection, and traffic object detection. Firstly, in the encoding stage, features are extracted, and Generalized Efficient Layer Aggregation Network (GELAN) is utilized to enhance feature extraction and gradient flow. Secondly, in the decoding stage, specialized detection heads are designed; the drivable area segmentation head employs DySample to expand feature maps, the lane detection head merges early-stage features and processes the output through the Focal Modulation Network (FMN). Lastly, the Minimum Point Distance IoU (MPDIoU) loss function is employed to compute the matching degree between traffic object detection boxes and predicted boxes, facilitating model training adjustments. Experimental results on the BDD100K dataset demonstrate that the proposed network achieves a drivable area segmentation mean intersection over union (mIoU) of 92.2%, lane detection accuracy and intersection over union (IoU) of 75.3% and 26.4%, respectively, and traffic object detection recall and mAP of 89.7% and 78.2%, respectively. The detection performance surpasses that of other single-task or multi-task algorithm models.
引用
收藏
页码:2963 / 2978
页数:16
相关论文
共 50 条
  • [31] A New Multi-task Network for Autonomous Driving: Efficientnetv1_Unet
    Li, Jiatian
    Peng, Jiangtao
    Meng, Ran
    Long, Qian
    Luo, Xinyu
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT XI, ICIC 2024, 2024, 14872 : 441 - 451
  • [32] Task Switching Network for Multi-task Learning
    Sun, Guolei
    Probst, Thomas
    Paudel, Danda Pani
    Popovic, Nikola
    Kanakis, Menelaos
    Patel, Jagruti
    Dai, Dengxin
    Van Gool, Luc
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 8271 - 8280
  • [33] LiDAR-Based Multi-Task Road Perception Network for Autonomous Vehicles
    Yan, Fuwu
    Wang, Kewei
    Zou, Bin
    Tang, Luqi
    Li, Wenbo
    Lv, Chen
    IEEE ACCESS, 2020, 8 : 86753 - 86764
  • [34] Detecting Adversarial Perturbations in Multi-Task Perception
    Klingner, Marvin
    Kumar, Varun Ravi
    Yogamani, Senthil
    Baer, Andreas
    Fingscheidt, Tim
    2022 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2022, : 13050 - 13057
  • [35] Autonomous Driving Multi-Task Perception Algorithm Based on Receptive-Field Attention Convolution
    Liu, Yunxiang
    Ma, Haili
    Zhu, Jianlin
    Zhang, Qing
    Jin, Qi
    Computer Engineering and Applications, 2024, 60 (20) : 133 - 141
  • [36] Unified Voice Embedding through Multi-task Learning
    Rajenthiran, Jenarthanan
    Sithamaparanathan, Lakshikka
    Uthayakumar, Saranya
    Thayasivam, Uthayasanker
    2022 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP 2022), 2022, : 178 - 183
  • [37] Multi-Task Network Representation Learning
    Xie, Yu
    Jin, Peixuan
    Gong, Maoguo
    Zhang, Chen
    Yu, Bin
    FRONTIERS IN NEUROSCIENCE, 2020, 14
  • [38] Network Clustering for Multi-task Learning
    Mu, Zhiying
    Gao, Dehong
    Guo, Sensen
    NEURAL PROCESSING LETTERS, 2025, 57 (01)
  • [39] Multi-Task Assisted Driving Policy Learning Method for Autonomous Driving
    Luo, Yutao
    Xue, Zhicheng
    Huanan Ligong Daxue Xuebao/Journal of South China University of Technology (Natural Science), 2024, 52 (10): : 31 - 40
  • [40] Attentive Task Interaction Network for Multi-Task Learning
    Sinodinos, Dimitrios
    Armanfard, Narges
    2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 2885 - 2891