A novel approach for self-driving car in partially observable environment using life long reinforcement learning

被引:1
|
作者
Quadir, Md Abdul [1 ]
Jaiswal, Dibyanshu [1 ]
Mohan, Senthilkumar [2 ]
Innab, Nisreen [3 ]
Sulaiman, Riza [4 ]
Alaoui, Mohammed Kbiri [5 ]
Ahmadian, Ali [6 ,7 ]
机构
[1] Vellore Inst Technol, Sch Comp Sci & Engn, Chennai 600127, India
[2] Vellore Inst Technol, Sch Comp Sci Engn & Informat Syst, Vellore 632014, Tamilnadu, India
[3] AlMaarefa Univ, Coll Appl Sci, Dept Comp Sci & Informat Syst, Riyadh, Saudi Arabia
[4] Univ Kebangsaan Malaysia, Inst Visual Informat, Bangi 43600, Malaysia
[5] King Khalid Univ, Coll Sci, Dept Math, Abha 61413, POB 9004, Saudi Arabia
[6] Mediterranea Univ Reggio Calabria, Decis Lab, Reggio Di Calabria, Italy
[7] Istanbul Okan Univ, Fac Engn & Nat Sci, Istanbul, Turkiye
来源
关键词
Reinforcement Learning; Lifelong Learning; Self-driving car; Lifelong reinforcement learning; Partially observable Environment; POLICY; GAMES;
D O I
10.1016/j.segan.2024.101356
中图分类号
TE [石油、天然气工业]; TK [能源与动力工程];
学科分类号
0807 ; 0820 ;
摘要
Despite ground-breaking advancements in robotics, gaming, and other challenging domains, reinforcement learning still faces significant challenges in solving dynamic, open-world problems. Since reinforcement learning algorithms usually perform poorly when exposed to new tasks outside of their data distribution, continuous learning algorithms have drawn significant attention. In parallel with work on lifelong learning algorithms, there is a need for challenging environments, properly planned trials, and metrics to measure research success. In this context, a Deep Asynchronous Autonomous Learning System (DAALS) is proposed in this paper for training a selfdriving car in a partially observable environment for real-world scenarios in a continuous state-action space. To cater to three different use cases, three different algorithms were used. To train their agents for learning and upgrading discrete state policies, DAALS used the Asynchronous Advantage Stager Reviewer (AASR) algorithm. To train its agent for continuous state spaces, DAALS also uses an Extensive Deterministic Policy Gradient (EDPG) algorithm. To train the agent in a lifelong form of learning for partially observable environments, DAALS uses a Deep Deterministic Policy Gradient Novel Lifelong Learning Algorithm (DDPGNLLA). The system offers flexibility to the user to train the agents for both discrete and continuous state-action spaces. Compared to previous models in continuous state-action spaces, Deep deterministic policy gradient lifelong learning algorithm outperforms previous models by 46.09%. Furthermore, the Deep Asynchronous Autonomous System tends to outperform all previous reinforcement learning algorithms, making our proposed approach a real-world solution. As DAALS has tested on number of different environments it provides the insights on how modern Artificial Intelligence (AI) solutions can be generalized making it one of the better solutions for AI general domain problems.
引用
收藏
页数:12
相关论文
共 50 条
  • [31] Decision Making for Self-Driving Vehicles in Unexpected Environments Using Efficient Reinforcement Learning Methods
    Kim, Min-Seong
    Eoh, Gyuho
    Park, Tae-Hyoung
    ELECTRONICS, 2022, 11 (11)
  • [32] Towards Self-Driving Radios: Physical-Layer Control using Deep Reinforcement Learning
    Joseph, Samuel
    Misra, Rakesh
    Katti, Sachin
    HOTMOBILE '19 - PROCEEDINGS OF THE 20TH INTERNATIONAL WORKSHOP ON MOBILE COMPUTING SYSTEMS AND APPLICATIONS, 2019, : 69 - 74
  • [33] A Self-Driving Car Implementation using Computer Vision for Detection and Navigation
    Barua, Bhaskar
    Gomes, Clarence
    Baghe, Shubham
    Sisodia, Jignesh
    PROCEEDINGS OF THE 2019 INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING AND CONTROL SYSTEMS (ICCS), 2019, : 271 - 274
  • [34] Collaborative Partially-Observable Reinforcement Learning Using Wireless Communications
    Ko, Eisaku
    Chen, Kwang-Cheng
    Lien, Shao-Yu
    IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC 2021), 2021,
  • [35] Deep Reinforcement Learning for a Self-Driving Vehicle Operating Solely on Visual Information
    Audinys, Robertas
    Slikas, Zygimantas
    Radkevicius, Justas
    Sutas, Mantas
    Ostreika, Armantas
    ELECTRONICS, 2025, 14 (05):
  • [36] ICRAN: Intelligent Control for Self-Driving RAN Based on Deep Reinforcement Learning
    Ahmed, Azza H.
    Elmokashfi, Ahmed
    IEEE TRANSACTIONS ON NETWORK AND SERVICE MANAGEMENT, 2022, 19 (03): : 2751 - 2766
  • [37] Lane-Merging Strategy for a Self-Driving Car in Dense Traffic Using the Stackelberg Game Approach
    Ji, Kyoungtae
    Orsag, Matko
    Han, Kyoungseok
    ELECTRONICS, 2021, 10 (08)
  • [38] Hierarchical Deep Reinforcement Learning for Multi-robot Cooperation in Partially Observable Environment
    Liang, Zhixuan
    Cao, Jiannong
    Lin, Wanyu
    Chen, Jinlin
    Xu, Huafeng
    2021 IEEE THIRD INTERNATIONAL CONFERENCE ON COGNITIVE MACHINE INTELLIGENCE (COGMI 2021), 2021, : 272 - 281
  • [39] Towards Self-Driving Optical Networking with Reinforcement Learning and Knowledge Transferring (Invited)
    Chen, Xiaoliang
    Proietti, Roberto
    Liu, Che-Yu
    Ben Yoo, S. J.
    2020 INTERNATIONAL CONFERENCE ON OPTICAL NETWORK DESIGN AND MODELING (ONDM), 2020,
  • [40] CIRL: Controllable Imitative Reinforcement Learning for Vision-Based Self-driving
    Liang, Xiaodan
    Wang, Tairui
    Yang, Luona
    Xing, Eric
    COMPUTER VISION - ECCV 2018, PT VII, 2018, 11211 : 604 - 620