A novel approach for self-driving car in partially observable environment using life long reinforcement learning

被引:1
|
作者
Quadir, Md Abdul [1 ]
Jaiswal, Dibyanshu [1 ]
Mohan, Senthilkumar [2 ]
Innab, Nisreen [3 ]
Sulaiman, Riza [4 ]
Alaoui, Mohammed Kbiri [5 ]
Ahmadian, Ali [6 ,7 ]
机构
[1] Vellore Inst Technol, Sch Comp Sci & Engn, Chennai 600127, India
[2] Vellore Inst Technol, Sch Comp Sci Engn & Informat Syst, Vellore 632014, Tamilnadu, India
[3] AlMaarefa Univ, Coll Appl Sci, Dept Comp Sci & Informat Syst, Riyadh, Saudi Arabia
[4] Univ Kebangsaan Malaysia, Inst Visual Informat, Bangi 43600, Malaysia
[5] King Khalid Univ, Coll Sci, Dept Math, Abha 61413, POB 9004, Saudi Arabia
[6] Mediterranea Univ Reggio Calabria, Decis Lab, Reggio Di Calabria, Italy
[7] Istanbul Okan Univ, Fac Engn & Nat Sci, Istanbul, Turkiye
来源
关键词
Reinforcement Learning; Lifelong Learning; Self-driving car; Lifelong reinforcement learning; Partially observable Environment; POLICY; GAMES;
D O I
10.1016/j.segan.2024.101356
中图分类号
TE [石油、天然气工业]; TK [能源与动力工程];
学科分类号
0807 ; 0820 ;
摘要
Despite ground-breaking advancements in robotics, gaming, and other challenging domains, reinforcement learning still faces significant challenges in solving dynamic, open-world problems. Since reinforcement learning algorithms usually perform poorly when exposed to new tasks outside of their data distribution, continuous learning algorithms have drawn significant attention. In parallel with work on lifelong learning algorithms, there is a need for challenging environments, properly planned trials, and metrics to measure research success. In this context, a Deep Asynchronous Autonomous Learning System (DAALS) is proposed in this paper for training a selfdriving car in a partially observable environment for real-world scenarios in a continuous state-action space. To cater to three different use cases, three different algorithms were used. To train their agents for learning and upgrading discrete state policies, DAALS used the Asynchronous Advantage Stager Reviewer (AASR) algorithm. To train its agent for continuous state spaces, DAALS also uses an Extensive Deterministic Policy Gradient (EDPG) algorithm. To train the agent in a lifelong form of learning for partially observable environments, DAALS uses a Deep Deterministic Policy Gradient Novel Lifelong Learning Algorithm (DDPGNLLA). The system offers flexibility to the user to train the agents for both discrete and continuous state-action spaces. Compared to previous models in continuous state-action spaces, Deep deterministic policy gradient lifelong learning algorithm outperforms previous models by 46.09%. Furthermore, the Deep Asynchronous Autonomous System tends to outperform all previous reinforcement learning algorithms, making our proposed approach a real-world solution. As DAALS has tested on number of different environments it provides the insights on how modern Artificial Intelligence (AI) solutions can be generalized making it one of the better solutions for AI general domain problems.
引用
收藏
页数:12
相关论文
共 50 条
  • [41] CoPace: Edge Computation Offloading and Caching for Self-Driving With Deep Reinforcement Learning
    Tian, Hao
    Xu, Xiaolong
    Qi, Lianyong
    Zhang, Xuyun
    Dou, Wanchun
    Yu, Shui
    Ni, Qiang
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2021, 70 (12) : 13281 - 13293
  • [42] A Novel Cognitively Inspired Deep Learning Approach to Detect Drivable Areas for Self-driving Cars
    Jiang, Fengling
    Wang, Zeling
    Yue, Guoqing
    COGNITIVE COMPUTATION, 2024, 16 (02) : 517 - 533
  • [43] A Reinforcement Learning Approach to Dynamic Trajectory Optimization with Consideration of Imbalanced Sub-Goals in Self-Driving Vehicles
    Kim, Yu-Jin
    Ahn, Woo-Jin
    Jang, Sun-Ho
    Lim, Myo-Taeg
    Pae, Dong-Sung
    APPLIED SCIENCES-BASEL, 2024, 14 (12):
  • [44] A deep reinforcement learning assisted task offloading and resource allocation approach towards self-driving object detection
    Lili Nie
    Huiqiang Wang
    Guangsheng Feng
    Jiayu Sun
    Hongwu Lv
    Hang Cui
    Journal of Cloud Computing, 12
  • [45] A deep reinforcement learning assisted task offloading and resource allocation approach towards self-driving object detection
    Nie, Lili
    Wang, Huiqiang
    Feng, Guangsheng
    Sun, Jiayu
    Lv, Hongwu
    Cui, Hang
    JOURNAL OF CLOUD COMPUTING-ADVANCES SYSTEMS AND APPLICATIONS, 2023, 12 (01):
  • [46] A Novel Cognitively Inspired Deep Learning Approach to Detect Drivable Areas for Self-driving Cars
    Fengling Jiang
    Zeling Wang
    Guoqing Yue
    Cognitive Computation, 2024, 16 : 517 - 533
  • [47] Self-driving Cars Using CNN and Q-learning
    Chishti, Syed Owais Ali
    Riaz, Sana
    Zaib, Muhammad Bilal
    Nauman, Mohammad
    2018 IEEE 21ST INTERNATIONAL MULTI-TOPIC CONFERENCE (INMIC), 2018,
  • [48] Fixed Settling Time Control for Self-Driving Car: Two-Timescales Approach
    Khan, Rameez
    Azim, Raja Amer
    Malik, Fahad Mumtaz
    Mazhar, Naveed
    Raza, Abid
    Ullah, Hameed
    IEEE ACCESS, 2022, 10 : 36518 - 36537
  • [49] Vanishing point detection for self-driving car using harmony search algorithm
    Moon, Yoon Young
    Geem, Zong Woo
    Han, Gi-Tae
    SWARM AND EVOLUTIONARY COMPUTATION, 2018, 41 : 111 - 119
  • [50] Hierarchical reinforcement learning for self-driving decision-making without reliance on labelled driving data
    Duan, Jingliang
    Eben Li, Shengbo
    Guan, Yang
    Sun, Qi
    Cheng, Bo
    IET INTELLIGENT TRANSPORT SYSTEMS, 2020, 14 (05) : 297 - 305