Opposition-based Q(ℷ) algorithm

被引:0
|
作者
Shokri, Maryam [1 ]
Tizhooshl, Hamid R. [1 ]
Kamel, Mohamed [2 ]
机构
[1] Univ Waterloo, Dept Syst Design Engn, Pattern Anal & Machine Intelligence Lab, 200 Univ Ave W, Waterloo, ON N2L 3G1, Canada
[2] Univ Waterloo, Dept Elect & Comp Engn, Waterloo, ON N2L 3G1, Canada
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The problem of delayed reward in reinforcement learning is usually tackled by implementing the mechanism of eligibility traces. In this paper we introduce an extension of eligibility traces to solve one of the challenging problems in reinforcement learning. The concept of opposition traces is proposed in this work to deal with large state space problems in reinforcement learning applications. We combine the idea of opposition and eligibility traces to construct the opposition-based Q(lambda). The results are compared with the conventional Watkins' Q(lambda) and reflect a remarkable performance increase.
引用
收藏
页码:254 / +
页数:2
相关论文
共 50 条
  • [31] Opposition-based quantum firework algorithm for continuous optimisation problems
    Gao, Hongyuan
    Li, Chenwan
    INTERNATIONAL JOURNAL OF COMPUTING SCIENCE AND MATHEMATICS, 2015, 6 (03) : 256 - 265
  • [32] An improved Stud Genetic Algorithm using the Opposition-based Strategy
    Xu, Hongwei
    PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON ADVANCES IN MECHANICAL ENGINEERING AND INDUSTRIAL INFORMATICS (AMEII 2016), 2016, 73 : 32 - 37
  • [33] Improved grasshopper optimization algorithm using opposition-based learning
    Ewees, Ahmed A.
    Abd Elaziz, Mohamed
    Houssein, Essam H.
    EXPERT SYSTEMS WITH APPLICATIONS, 2018, 112 : 156 - 172
  • [34] Opposition-based Magnetic Optimization Algorithm with parameter adaptation strategy
    Aziz, Mahdi
    Tayarani-N, Mohammad-H.
    SWARM AND EVOLUTIONARY COMPUTATION, 2016, 26 : 97 - 119
  • [35] An Improved Opposition-based Disruption Operator in Gravitational Search Algorithm
    Liu, Hao
    Ding, Guiyan
    Sun, Huafei
    2012 FIFTH INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND DESIGN (ISCID 2012), VOL 2, 2012, : 123 - 126
  • [36] Opposition-based firefly algorithm for earth slope stability evaluation
    Khajehzadeh, Mohammad
    Taha, Mohd Raihan
    Eslami, Mahdiyeh
    CHINA OCEAN ENGINEERING, 2014, 28 (05) : 713 - 724
  • [37] An Adaptive Opposition-Based Learning Selection: The Case for Jaya Algorithm
    Nasser, Abdullah B.
    Zamli, Kamal Z.
    Hujainah, Fadhl
    Ghanem, Waheed Ali H. M.
    Saad, Abdul-Malik H. Y.
    Alduais, Nayef Abdulwahab Mohammed
    IEEE ACCESS, 2021, 9 : 55581 - 55594
  • [38] An Opposition-Based Chaotic Salp Swarm Algorithm for Global Optimization
    Zhao, Xiaoqiang
    Yang, Fan
    Han, Yazhou
    Cui, Yanpeng
    IEEE ACCESS, 2020, 8 : 36485 - 36501
  • [39] Opposition-Based Backtracking Search Algorithm for Numerical Optimization Problems
    Xu, Qingzheng
    Guo, Lemeng
    Wang, Na
    Xu, Li
    INTELLIGENCE SCIENCE AND BIG DATA ENGINEERING: BIG DATA AND MACHINE LEARNING TECHNIQUES, ISCIDE 2015, PT II, 2015, 9243 : 223 - 234
  • [40] Enhancing firefly algorithm using generalized opposition-based learning
    Yu, Shuhao
    Zhu, Shenglong
    Ma, Yan
    Mao, Demei
    COMPUTING, 2015, 97 (07) : 741 - 754