Research on Road Scene Understanding of Autonomous Vehicles Based on Multi-Task Learning

被引:10
|
作者
Guo, Jinghua [1 ]
Wang, Jingyao [2 ]
Wang, Huinian [1 ]
Xiao, Baoping [1 ]
He, Zhifei [1 ]
Li, Lubin [1 ]
机构
[1] Xiamen Univ, Dept Mech & Elect Engn, Xiamen 361005, Peoples R China
[2] Xiamen Univ, Dept Automat, Xiamen 361005, Peoples R China
关键词
autonomous vehicles; visual perception; multi-task learning; traffic object detection; drivable area detection; lane line detection;
D O I
10.3390/s23136238
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Road scene understanding is crucial to the safe driving of autonomous vehicles. Comprehensive road scene understanding requires a visual perception system to deal with a large number of tasks at the same time, which needs a perception model with a small size, fast speed, and high accuracy. As multi-task learning has evident advantages in performance and computational resources, in this paper, a multi-task model YOLO-Object, Drivable Area, and Lane Line Detection (YOLO-ODL) based on hard parameter sharing is proposed to realize joint and efficient detection of traffic objects, drivable areas, and lane lines. In order to balance tasks of YOLO-ODL, a weight balancing strategy is introduced so that the weight parameters of the model can be automatically adjusted during training, and a Mosaic migration optimization scheme is adopted to improve the evaluation indicators of the model. Our YOLO-ODL model performs well on the challenging BDD100K dataset, achieving the state of the art in terms of accuracy and computational efficiency.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] Context-Aware Multi-Task Learning for Traffic Scene Recognition in Autonomous Vehicles
    Lee, Younkwan
    Jeon, Jihyo
    Yu, Jongmin
    Jeon, Moongu
    2020 IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV), 2020, : 723 - 730
  • [2] LiDAR-Based Multi-Task Road Perception Network for Autonomous Vehicles
    Yan, Fuwu
    Wang, Kewei
    Zou, Bin
    Tang, Luqi
    Li, Wenbo
    Lv, Chen
    IEEE ACCESS, 2020, 8 : 86753 - 86764
  • [3] HirMTL: Hierarchical Multi-Task Learning for dense scene understanding
    Luo, Huilan
    Hu, Weixia
    Wei, Yixiao
    He, Jianlong
    Yu, Minghao
    NEURAL NETWORKS, 2025, 181
  • [4] Increasing the Efficiency of Policy Learning for Autonomous Vehicles by Multi-Task Representation Learning
    Kargar, Eshagh
    Kyrki, Ville
    IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, 2022, 7 (03): : 701 - 710
  • [5] AdaMT-Net: An Adaptive Weight Learning Based Multi-Task Learning Model For Scene Understanding
    Jha, Ankit
    Kumar, Awanish
    Banerjee, Biplab
    Chaudhuri, Subhasis
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2020), 2020, : 3027 - 3035
  • [6] Global-Reasoned Multi-Task Learning Model for Surgical Scene Understanding
    Seenivasan, Lalithkumar
    Mitheran, Sai
    Islam, Mobarakol
    Ren, Hongliang
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2022, 7 (02) : 3858 - 3865
  • [7] Stixel Based Scene Understanding for Autonomous Vehicles
    Wieszok, Zygfryd
    Aouf, Nabil
    Kechagias-Stamatis, Odysseas
    Chermak, Lounis
    PROCEEDINGS OF THE 2017 IEEE 14TH INTERNATIONAL CONFERENCE ON NETWORKING, SENSING AND CONTROL (ICNSC 2017), 2017, : 43 - 48
  • [8] RESEARCH OF MULTI-TASK LEARNING BASED ON EXTREME LEARNING MACHINE
    Mao, Wentao
    Xu, Jiucheng
    Zhao, Shengjie
    Tian, Mei
    INTERNATIONAL JOURNAL OF UNCERTAINTY FUZZINESS AND KNOWLEDGE-BASED SYSTEMS, 2013, 21 : 75 - 85
  • [9] Semi-Supervised Learning for Multi-Task Scene Understanding by Neural Graph Consensus
    Leordeanu, Marius
    Pirvu, Mihai Cristian
    Costea, Dragos
    Marcu, Alina E.
    Slusanschi, Emil
    Sukthankar, Rahul
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 1882 - 1892
  • [10] Multi-view representation learning in multi-task scene
    Run-kun Lu
    Jian-wei Liu
    Si-ming Lian
    Xin Zuo
    Neural Computing and Applications, 2020, 32 : 10403 - 10422