Representation Learning for Vision-Based Autonomous Driving via Probabilistic World Modeling

被引:0
|
作者
Chen, Haoqiang [1 ]
Liu, Yadong [1 ]
Hu, Dewen [1 ]
机构
[1] Natl Univ Def Technol, Coll Intelligence Sci & Technol, Changsha 410073, Peoples R China
关键词
autonomous driving; representation learning; world model; imitation learning;
D O I
10.3390/machines13030231
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Representation learning plays a vital role in autonomous driving by extracting meaningful features from raw sensory inputs. World models emerge as an effective approach to representation learning by capturing predictive features that can anticipate multiple possible futures, which is particularly suited for driving scenarios. However, existing world model approaches face two critical limitations: First, conventional methods rely heavily on computationally expensive variational inference that requires decoding back to high-dimensional observation space. Second, current end-to-end autonomous driving systems demand extensive labeled data for training, resulting in prohibitive annotation costs. To address these challenges, we present BYOL-Drive, a novel method that firstly introduces the self-supervised representation-learning paradigm BYOL (Bootstrap Your Own Latent) to implement world modeling. Our method eliminates the computational burden of observation space decoding while requiring substantially fewer labeled data compared to mainstream approaches. Additionally, our model only relies on monocular camera images as input, making it easy to deploy and generalize. Based on this learned representation, experiments on the standard closed-loop CARLA benchmark demonstrate that our BYOL-Drive achieves competitive performance with improved computational efficiency and significantly reduced annotation requirements compared to the state-of-the-art methods. Our work contributes to the development of end-to-end autonomous driving.
引用
收藏
页数:20
相关论文
共 50 条
  • [1] Vision-Based Autonomous Driving: A Model Learning Approach
    Baheri, Ali
    Kolmanovsky, Ilya
    Girard, Anouck
    Tseng, H. Eric
    Filev, Dimitar
    2020 AMERICAN CONTROL CONFERENCE (ACC), 2020, : 2520 - 2525
  • [2] Vision-Based Autonomous Driving: A Hierarchical Reinforcement Learning Approach
    Wang, Jiao
    Sun, Haoyi
    Zhu, Can
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2023, 72 (09) : 11213 - 11226
  • [3] Vision-based environmental perception for autonomous driving
    Liu, Fei
    Lu, Zihao
    Lin, Xianke
    PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART D-JOURNAL OF AUTOMOBILE ENGINEERING, 2025, 239 (01) : 39 - 69
  • [4] Autonomous driving in traffic with end-to-end vision-based deep learning
    Paniego, Sergio
    Shinohara, Enrique
    Canas, Josemaria
    NEUROCOMPUTING, 2024, 594
  • [5] Exploring Data Aggregation in Policy Learning for Vision-based Urban Autonomous Driving
    Prakash, Aditya
    Behl, Aseem
    Ohn-Bar, Eshed
    Chitta, Kashyap
    Geiger, Andreas
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, : 11760 - 11770
  • [6] A DISCRIMINATING FEATURE TRACKER FOR VISION-BASED AUTONOMOUS DRIVING
    SCHNEIDERMAN, H
    NASHMAN, M
    IEEE TRANSACTIONS ON ROBOTICS AND AUTOMATION, 1994, 10 (06): : 769 - 775
  • [7] Vision-Based Trajectory Planning via Imitation Learning for Autonomous Vehicles
    Cai, Peide
    Sun, Yuxiang
    Chen, Yuying
    Liu, Ming
    2019 IEEE INTELLIGENT TRANSPORTATION SYSTEMS CONFERENCE (ITSC), 2019, : 2736 - 2742
  • [8] Navigation Command Matching for Vision-based Autonomous Driving
    Pan, Yuxin
    Xue, Jianru
    Zhang, Pengfei
    Ouyang, Wanli
    Fang, Jianwu
    Chen, Xingyu
    2020 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2020, : 4343 - 4349
  • [9] CADRE: A Cascade Deep Reinforcement Learning Framework for Vision-Based Autonomous Urban Driving
    Zhao, Yinuo
    Wu, Kun
    Xu, Zhiyuan
    Che, Zhengping
    Lu, Qi
    Tang, Jian
    Liu, Chi Harold
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 3481 - 3489
  • [10] Vision-Based Autonomous Navigation with Evolutionary Learning
    Moya-Albor, Ernesto
    Ponce, Hiram
    Brieva, Jorge
    Coronel, Sandra L.
    Chavez-Domingue, Rodrigo
    ADVANCES IN COMPUTATIONAL INTELLIGENCE, MICAI 2020, PT II, 2020, 12469 : 459 - 471