Representation Learning for Vision-Based Autonomous Driving via Probabilistic World Modeling

被引：0

作者：

Chen, Haoqiang ^{[1
]}

Liu, Yadong ^{[1
]}

Hu, Dewen ^{[1
]}

机构：

[1] Natl Univ Def Technol, Coll Intelligence Sci & Technol, Changsha 410073, Peoples R China

来源：

MACHINES | 2025年 / 13卷 / 03期

关键词：

autonomous driving; representation learning; world model; imitation learning;

D O I：

10.3390/machines13030231

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Representation learning plays a vital role in autonomous driving by extracting meaningful features from raw sensory inputs. World models emerge as an effective approach to representation learning by capturing predictive features that can anticipate multiple possible futures, which is particularly suited for driving scenarios. However, existing world model approaches face two critical limitations: First, conventional methods rely heavily on computationally expensive variational inference that requires decoding back to high-dimensional observation space. Second, current end-to-end autonomous driving systems demand extensive labeled data for training, resulting in prohibitive annotation costs. To address these challenges, we present BYOL-Drive, a novel method that firstly introduces the self-supervised representation-learning paradigm BYOL (Bootstrap Your Own Latent) to implement world modeling. Our method eliminates the computational burden of observation space decoding while requiring substantially fewer labeled data compared to mainstream approaches. Additionally, our model only relies on monocular camera images as input, making it easy to deploy and generalize. Based on this learned representation, experiments on the standard closed-loop CARLA benchmark demonstrate that our BYOL-Drive achieves competitive performance with improved computational efficiency and significantly reduced annotation requirements compared to the state-of-the-art methods. Our work contributes to the development of end-to-end autonomous driving.

引用

页数：20

共 50 条

[31] Ultra-Fast Deraining Plugin for Vision-Based Perception of Autonomous Driving
Li, Jihao
Hu, Jincheng
Fu, Pengyu
Yang, Jun
Jiang, Jingjing
Zhang, Yuanjian
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2025, 26 (01) : 1227 - 1240
[32] Attacking vision-based perception in end-to-end autonomous driving models
Boloor, Adith
Garimella, Karthik
He, Xin
Gill, Christopher
Vorobeychik, Yevgeniy
Zhang, Xuan
JOURNAL OF SYSTEMS ARCHITECTURE, 2020, 110
[33] A comparative study of vision-based lateral control strategies for autonomous highway driving
Kosecka, J
Blasi, R
Taylor, CJ
Malik, J
1998 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, VOLS 1-4, 1998, : 1903 - 1908
[34] Challenges of Designing Computer Vision-based Pedestrian Detector for Supporting Autonomous Driving
Sun, Peng
Boukerche, Azzedine
2019 IEEE 16TH INTERNATIONAL CONFERENCE ON MOBILE AD HOC AND SMART SYSTEMS (MASS 2019), 2019, : 28 - 36
[35] GRI: General Reinforced Imitation and Its Application to Vision-Based Autonomous Driving
Chekroun, Raphael
Toromanoff, Marin
Hornauer, Sascha
Moutarde, Fabien
ROBOTICS, 2023, 12 (05)
[36] A color vision-based lane tracking system for autonomous driving on unmarked roads
Sotelo, MA
Rodriguez, FJ
Magdalena, L
Bergasa, LM
Boquete, L
AUTONOMOUS ROBOTS, 2004, 16 (01) : 95 - 116
[37] Learning Interpretable End-to-End Vision-Based Motion Planning for Autonomous Driving with Optical Flow Distillation
Wang, Hengli
Cai, Peide
Sun, Yuxiang
Wang, Lujia
Liu, Ming
2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2021), 2021, : 13731 - 13737
[38] AN AUTONOMOUS VISION-BASED MOBILE ROBOT
BAUMGARTNER, ET
SKAAR, SB
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 1994, 39 (03) : 493 - 502
[39] Vision-based autonomous soccer robots
Khessal, NO
Naing, MY
Hwee, ENB
Oo, PS
Antony, LHS
IEEE 2000 TENCON PROCEEDINGS, VOLS I-III: INTELLIGENT SYSTEMS AND TECHNOLOGIES FOR THE NEW MILLENNIUM, 2000, : 207 - 212
[40] Autonomous Learning of Vision-based Layered Object Models on Mobile Robots
Li, Xiang
Sridharan, Mohan
Zhang, Shiqi
2011 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2011,

← 1 2 3 4 5 →