Opposition-based Q(ℷ) algorithm

被引:0
|
作者
Shokri, Maryam [1 ]
Tizhooshl, Hamid R. [1 ]
Kamel, Mohamed [2 ]
机构
[1] Univ Waterloo, Dept Syst Design Engn, Pattern Anal & Machine Intelligence Lab, 200 Univ Ave W, Waterloo, ON N2L 3G1, Canada
[2] Univ Waterloo, Dept Elect & Comp Engn, Waterloo, ON N2L 3G1, Canada
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The problem of delayed reward in reinforcement learning is usually tackled by implementing the mechanism of eligibility traces. In this paper we introduce an extension of eligibility traces to solve one of the challenging problems in reinforcement learning. The concept of opposition traces is proposed in this work to deal with large state space problems in reinforcement learning applications. We combine the idea of opposition and eligibility traces to construct the opposition-based Q(lambda). The results are compared with the conventional Watkins' Q(lambda) and reflect a remarkable performance increase.
引用
收藏
页码:254 / +
页数:2
相关论文
共 50 条
  • [21] Opposition-based learning in global harmony search algorithm
    Zhai J.-C.
    Qin Y.-P.
    Kongzhi yu Juece/Control and Decision, 2019, 34 (07): : 1449 - 1455
  • [22] Opposition-based Particle Swarm Algorithm with Cauchy mutation
    Wang, Hui
    Liu, Yong
    Zeng, Sanyou
    Li, Hui
    Li, Changhe
    2007 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION, VOLS 1-10, PROCEEDINGS, 2007, : 4750 - +
  • [23] Generalized Opposition-Based Artificial Bee Colony Algorithm
    El-Abd, Mohammed
    2012 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2012,
  • [24] Opposition-Based Computation
    Rahnamayan, Shahryar
    COMPUTING AND COMPUTATIONAL TECHNIQUES IN SCIENCES, 2008, : 15 - 15
  • [25] An opposition-based harmony search algorithm for engineering optimization problems
    Banerjee, Abhik
    Mukherjee, V.
    Ghoshal, S. P.
    AIN SHAMS ENGINEERING JOURNAL, 2014, 5 (01) : 85 - 101
  • [26] An Improved Snake Optimization Algorithm with Opposition-Based Population Initialization
    Xu, Yuancheng
    Shi, Mengji
    You, Long
    Li, Weihao
    Lin, Boxian
    Qin, Kaiyu
    2022 INTERNATIONAL CONFERENCE ON INDUSTRIAL AUTOMATION, ROBOTICS AND CONTROL ENGINEERING, IARCE, 2022, : 34 - 39
  • [27] Dynamic cuckoo search algorithm based on Taguchi opposition-based search
    Li, Juan
    Li, Yuan-xiang
    Tian, Sha-sha
    Zou, Jie
    INTERNATIONAL JOURNAL OF BIO-INSPIRED COMPUTATION, 2019, 13 (01) : 59 - 69
  • [28] An improved Opposition-Based Sine Cosine Algorithm for global optimization
    Abd Elaziz, Mohamed
    Oliva, Diego
    Xiong, Shengwu
    EXPERT SYSTEMS WITH APPLICATIONS, 2017, 90 : 484 - 500
  • [29] Opposition-based firefly algorithm for earth slope stability evaluation
    Mohammad Khajehzadeh
    Mohd Raihan Taha
    Mahdiyeh Eslami
    China Ocean Engineering, 2014, 28 : 713 - 724
  • [30] Salp swarm algorithm based on orthogonal refracted opposition-based learning
    Wang Z.
    Ding H.
    Wang J.
    Li B.
    Hou P.
    Yang Z.
    Harbin Gongye Daxue Xuebao/Journal of Harbin Institute of Technology, 2022, 54 (11): : 122 - 136