Representation Learning for Vision-Based Autonomous Driving via Probabilistic World Modeling

被引：0

作者：

Chen, Haoqiang ^{[1
]}

Liu, Yadong ^{[1
]}

Hu, Dewen ^{[1
]}

机构：

[1] Natl Univ Def Technol, Coll Intelligence Sci & Technol, Changsha 410073, Peoples R China

来源：

MACHINES | 2025年 / 13卷 / 03期

关键词：

autonomous driving; representation learning; world model; imitation learning;

D O I：

10.3390/machines13030231

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Representation learning plays a vital role in autonomous driving by extracting meaningful features from raw sensory inputs. World models emerge as an effective approach to representation learning by capturing predictive features that can anticipate multiple possible futures, which is particularly suited for driving scenarios. However, existing world model approaches face two critical limitations: First, conventional methods rely heavily on computationally expensive variational inference that requires decoding back to high-dimensional observation space. Second, current end-to-end autonomous driving systems demand extensive labeled data for training, resulting in prohibitive annotation costs. To address these challenges, we present BYOL-Drive, a novel method that firstly introduces the self-supervised representation-learning paradigm BYOL (Bootstrap Your Own Latent) to implement world modeling. Our method eliminates the computational burden of observation space decoding while requiring substantially fewer labeled data compared to mainstream approaches. Additionally, our model only relies on monocular camera images as input, making it easy to deploy and generalize. Based on this learned representation, experiments on the standard closed-loop CARLA benchmark demonstrate that our BYOL-Drive achieves competitive performance with improved computational efficiency and significantly reduced annotation requirements compared to the state-of-the-art methods. Our work contributes to the development of end-to-end autonomous driving.

引用

页数：20

共 50 条

[1] Vision-Based Autonomous Driving: A Model Learning Approach
Baheri, Ali
Kolmanovsky, Ilya
Girard, Anouck
Tseng, H. Eric
Filev, Dimitar
2020 AMERICAN CONTROL CONFERENCE (ACC), 2020, : 2520 - 2525
[2] Vision-Based Autonomous Driving: A Hierarchical Reinforcement Learning Approach
Wang, Jiao
Sun, Haoyi
Zhu, Can
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2023, 72 (09) : 11213 - 11226
[3] Vision-based environmental perception for autonomous driving
Liu, Fei
Lu, Zihao
Lin, Xianke
PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART D-JOURNAL OF AUTOMOBILE ENGINEERING, 2025, 239 (01) : 39 - 69
[4] Autonomous driving in traffic with end-to-end vision-based deep learning
Paniego, Sergio
Shinohara, Enrique
Canas, Josemaria
NEUROCOMPUTING, 2024, 594
[5] Exploring Data Aggregation in Policy Learning for Vision-based Urban Autonomous Driving
Prakash, Aditya
Behl, Aseem
Ohn-Bar, Eshed
Chitta, Kashyap
Geiger, Andreas
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, : 11760 - 11770
[6] A DISCRIMINATING FEATURE TRACKER FOR VISION-BASED AUTONOMOUS DRIVING
SCHNEIDERMAN, H
NASHMAN, M
IEEE TRANSACTIONS ON ROBOTICS AND AUTOMATION, 1994, 10 (06): : 769 - 775
[7] Vision-Based Trajectory Planning via Imitation Learning for Autonomous Vehicles
Cai, Peide
Sun, Yuxiang
Chen, Yuying
Liu, Ming
2019 IEEE INTELLIGENT TRANSPORTATION SYSTEMS CONFERENCE (ITSC), 2019, : 2736 - 2742
[8] Navigation Command Matching for Vision-based Autonomous Driving
Pan, Yuxin
Xue, Jianru
Zhang, Pengfei
Ouyang, Wanli
Fang, Jianwu
Chen, Xingyu
2020 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2020, : 4343 - 4349
[9] CADRE: A Cascade Deep Reinforcement Learning Framework for Vision-Based Autonomous Urban Driving
Zhao, Yinuo
Wu, Kun
Xu, Zhiyuan
Che, Zhengping
Lu, Qi
Tang, Jian
Liu, Chi Harold
THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 3481 - 3489
[10] Vision-Based Autonomous Navigation with Evolutionary Learning
Moya-Albor, Ernesto
Ponce, Hiram
Brieva, Jorge
Coronel, Sandra L.
Chavez-Domingue, Rodrigo
ADVANCES IN COMPUTATIONAL INTELLIGENCE, MICAI 2020, PT II, 2020, 12469 : 459 - 471

← 1 2 3 4 5 →