GDMNet: A Unified Multi-Task Network for Panoptic Driving Perception

被引:0
|
作者
Liu, Yunxiang [1 ]
Ma, Haili [1 ]
Zhu, Jianlin [1 ]
Zhang, Qiangbo [1 ]
机构
[1] Shanghai Inst Technol, Sch Comp Sci & Informat Engn, Shanghai 201418, Peoples R China
来源
CMC-COMPUTERS MATERIALS & CONTINUA | 2024年 / 80卷 / 02期
关键词
Autonomous driving; multitask learning; drivable area segmentation; lane detection; vehicle detection;
D O I
10.32604/cmc.2024.053710
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
To enhance the efficiency and accuracy of environmental perception for autonomous vehicles, we propose GDMNet, a unified multi-task perception network for autonomous driving, capable of performing drivable area segmentation, lane detection, and traffic object detection. Firstly, in the encoding stage, features are extracted, and Generalized Efficient Layer Aggregation Network (GELAN) is utilized to enhance feature extraction and gradient flow. Secondly, in the decoding stage, specialized detection heads are designed; the drivable area segmentation head employs DySample to expand feature maps, the lane detection head merges early-stage features and processes the output through the Focal Modulation Network (FMN). Lastly, the Minimum Point Distance IoU (MPDIoU) loss function is employed to compute the matching degree between traffic object detection boxes and predicted boxes, facilitating model training adjustments. Experimental results on the BDD100K dataset demonstrate that the proposed network achieves a drivable area segmentation mean intersection over union (mIoU) of 92.2%, lane detection accuracy and intersection over union (IoU) of 75.3% and 26.4%, respectively, and traffic object detection recall and mAP of 89.7% and 78.2%, respectively. The detection performance surpasses that of other single-task or multi-task algorithm models.
引用
收藏
页码:2963 / 2978
页数:16
相关论文
共 50 条
  • [41] Statistically correlated multi-task learning for autonomous driving
    Waseem Abbas
    Muhammad Fakhir Khan
    Murtaza Taj
    Arif Mahmood
    Neural Computing and Applications, 2021, 33 : 12921 - 12938
  • [42] Statistically correlated multi-task learning for autonomous driving
    Abbas, Waseem
    Khan, Muhammad Fakhir
    Taj, Murtaza
    Mahmood, Arif
    NEURAL COMPUTING & APPLICATIONS, 2021, 33 (19): : 12921 - 12938
  • [43] Optimal Configuration of Multi-Task Learning for Autonomous Driving
    Jun, Woomin
    Son, Minjun
    Yoo, Jisang
    Lee, Sungjin
    SENSORS, 2023, 23 (24)
  • [44] A Unified Neural Network for Panoptic Segmentation
    Yao, L.
    Chyau, A.
    COMPUTER GRAPHICS FORUM, 2019, 38 (07) : 461 - 468
  • [45] UPSNet: A Unified Panoptic Segmentation Network
    Xiong, Yuwen
    Liao, Renjie
    Zhao, Hengshuang
    Hu, Rui
    Bai, Min
    Yumer, Ersin
    Urtasun, Raquel
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 8810 - 8818
  • [46] Unified ICH quantification and prognosis prediction in NCCT images using a multi-task interpretable network
    Gong, Kai
    Dai, Qian
    Wang, Jiacheng
    Zheng, Yingbin
    Shi, Tao
    Yu, Jiaxing
    Chen, Jiangwang
    Huang, Shaohui
    Wang, Zhanxiang
    FRONTIERS IN NEUROSCIENCE, 2023, 17
  • [47] Unified Autoencoder with Task Embeddings for Multi-Task Learning in Renewable Power Forecasting
    Nivarthi, Chandana Priya
    Vogt, Stephan
    Sick, Bernhard
    2022 21ST IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS, ICMLA, 2022, : 1530 - 1536
  • [48] MultiNet: Multi-Modal Multi-Task Learning for Autonomous Driving
    Chowdhuri, Sauhaarda
    Pankaj, Tushar
    Zipser, Karl
    2019 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2019, : 1496 - 1504
  • [49] TriLiteNet: Lightweight Model for Multi-Task Visual Perception
    Che, Quang-Huy
    Lam, Duc-Khai
    IEEE ACCESS, 2025, 13 : 50152 - 50166
  • [50] Integrated Perception with Recurrent Multi-Task Neural Networks
    Bilen, Hakan
    Vedaldi, Andrea
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 29 (NIPS 2016), 2016, 29