Representation Learning for Vision-Based Autonomous Driving via Probabilistic World Modeling

被引:0
|
作者
Chen, Haoqiang [1 ]
Liu, Yadong [1 ]
Hu, Dewen [1 ]
机构
[1] Natl Univ Def Technol, Coll Intelligence Sci & Technol, Changsha 410073, Peoples R China
关键词
autonomous driving; representation learning; world model; imitation learning;
D O I
10.3390/machines13030231
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Representation learning plays a vital role in autonomous driving by extracting meaningful features from raw sensory inputs. World models emerge as an effective approach to representation learning by capturing predictive features that can anticipate multiple possible futures, which is particularly suited for driving scenarios. However, existing world model approaches face two critical limitations: First, conventional methods rely heavily on computationally expensive variational inference that requires decoding back to high-dimensional observation space. Second, current end-to-end autonomous driving systems demand extensive labeled data for training, resulting in prohibitive annotation costs. To address these challenges, we present BYOL-Drive, a novel method that firstly introduces the self-supervised representation-learning paradigm BYOL (Bootstrap Your Own Latent) to implement world modeling. Our method eliminates the computational burden of observation space decoding while requiring substantially fewer labeled data compared to mainstream approaches. Additionally, our model only relies on monocular camera images as input, making it easy to deploy and generalize. Based on this learned representation, experiments on the standard closed-loop CARLA benchmark demonstrate that our BYOL-Drive achieves competitive performance with improved computational efficiency and significantly reduced annotation requirements compared to the state-of-the-art methods. Our work contributes to the development of end-to-end autonomous driving.
引用
收藏
页数:20
相关论文
共 50 条
  • [31] Ultra-Fast Deraining Plugin for Vision-Based Perception of Autonomous Driving
    Li, Jihao
    Hu, Jincheng
    Fu, Pengyu
    Yang, Jun
    Jiang, Jingjing
    Zhang, Yuanjian
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2025, 26 (01) : 1227 - 1240
  • [32] Attacking vision-based perception in end-to-end autonomous driving models
    Boloor, Adith
    Garimella, Karthik
    He, Xin
    Gill, Christopher
    Vorobeychik, Yevgeniy
    Zhang, Xuan
    JOURNAL OF SYSTEMS ARCHITECTURE, 2020, 110
  • [33] A comparative study of vision-based lateral control strategies for autonomous highway driving
    Kosecka, J
    Blasi, R
    Taylor, CJ
    Malik, J
    1998 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, VOLS 1-4, 1998, : 1903 - 1908
  • [34] Challenges of Designing Computer Vision-based Pedestrian Detector for Supporting Autonomous Driving
    Sun, Peng
    Boukerche, Azzedine
    2019 IEEE 16TH INTERNATIONAL CONFERENCE ON MOBILE AD HOC AND SMART SYSTEMS (MASS 2019), 2019, : 28 - 36
  • [35] GRI: General Reinforced Imitation and Its Application to Vision-Based Autonomous Driving
    Chekroun, Raphael
    Toromanoff, Marin
    Hornauer, Sascha
    Moutarde, Fabien
    ROBOTICS, 2023, 12 (05)
  • [36] A color vision-based lane tracking system for autonomous driving on unmarked roads
    Sotelo, MA
    Rodriguez, FJ
    Magdalena, L
    Bergasa, LM
    Boquete, L
    AUTONOMOUS ROBOTS, 2004, 16 (01) : 95 - 116
  • [37] Learning Interpretable End-to-End Vision-Based Motion Planning for Autonomous Driving with Optical Flow Distillation
    Wang, Hengli
    Cai, Peide
    Sun, Yuxiang
    Wang, Lujia
    Liu, Ming
    2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2021), 2021, : 13731 - 13737
  • [38] AN AUTONOMOUS VISION-BASED MOBILE ROBOT
    BAUMGARTNER, ET
    SKAAR, SB
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 1994, 39 (03) : 493 - 502
  • [39] Vision-based autonomous soccer robots
    Khessal, NO
    Naing, MY
    Hwee, ENB
    Oo, PS
    Antony, LHS
    IEEE 2000 TENCON PROCEEDINGS, VOLS I-III: INTELLIGENT SYSTEMS AND TECHNOLOGIES FOR THE NEW MILLENNIUM, 2000, : 207 - 212
  • [40] Autonomous Learning of Vision-based Layered Object Models on Mobile Robots
    Li, Xiang
    Sridharan, Mohan
    Zhang, Shiqi
    2011 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2011,