Opposition-based Q(ℷ) algorithm

被引:0
|
作者
Shokri, Maryam [1 ]
Tizhooshl, Hamid R. [1 ]
Kamel, Mohamed [2 ]
机构
[1] Univ Waterloo, Dept Syst Design Engn, Pattern Anal & Machine Intelligence Lab, 200 Univ Ave W, Waterloo, ON N2L 3G1, Canada
[2] Univ Waterloo, Dept Elect & Comp Engn, Waterloo, ON N2L 3G1, Canada
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The problem of delayed reward in reinforcement learning is usually tackled by implementing the mechanism of eligibility traces. In this paper we introduce an extension of eligibility traces to solve one of the challenging problems in reinforcement learning. The concept of opposition traces is proposed in this work to deal with large state space problems in reinforcement learning applications. We combine the idea of opposition and eligibility traces to construct the opposition-based Q(lambda). The results are compared with the conventional Watkins' Q(lambda) and reflect a remarkable performance increase.
引用
收藏
页码:254 / +
页数:2
相关论文
共 50 条
  • [1] Opposition-Based Adaptive Fireworks Algorithm
    Gong, Chibing
    ALGORITHMS, 2016, 9 (03):
  • [2] Opposition-based moth swarm algorithm
    Oliva, Diego
    Esquivel-Torres, Sara
    Hinojosa, Salvador
    Perez-Cisneros, Marco
    Osuna-Enciso, Valentin
    Ortega-Sanchez, Noe
    Dhiman, Gaurav
    Heidari, Ali Asghar
    EXPERT SYSTEMS WITH APPLICATIONS, 2021, 184
  • [3] Opposition-Based Whale Optimization Algorithm
    Alamri, Hammoudeh S.
    Alsariera, Yazan A.
    Zamli, Kamal Z.
    ADVANCED SCIENCE LETTERS, 2018, 24 (10) : 7461 - 7464
  • [4] An opposition-based algorithm for function optimization
    Seif, Z.
    Ahmadi, M. B.
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2015, 37 : 293 - 306
  • [5] The Opposition-based Harmony Search Algorithm
    Singh R.P.
    Mukherjee V.
    Ghoshal S.P.
    Mukherjee, V. (vivek_agamani@yahoo.com), 1600, Springer (94): : 247 - 256
  • [6] Opposition-Based Artificial Bee Colony Algorithm
    El-Abd, Mohammed
    GECCO-2011: PROCEEDINGS OF THE 13TH ANNUAL GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE, 2011, : 109 - 115
  • [7] Elite opposition-based flower pollination algorithm
    Zhou, Yongquan
    Wang, Rui
    Luo, Qifang
    NEUROCOMPUTING, 2016, 188 : 294 - 310
  • [8] Opposition-based Q(λ) with non-Markovian update
    Shokri, Maryam
    Tizhoosh, Hamid R.
    Kamel, Mohamed S.
    2007 IEEE INTERNATIONAL SYMPOSIUM ON APPROXIMATE DYNAMIC PROGRAMMING AND REINFORCEMENT LEARNING, 2007, : 288 - +
  • [9] Improved Clustering Algorithm with Adaptive Opposition-based Learning
    Meng, Qianqian
    Zhou, Lijuan
    2017 IEEE 2ND INTERNATIONAL CONFERENCE ON BIG DATA ANALYSIS (ICBDA), 2017, : 296 - 300
  • [10] Opposition-based learning in the shuffled differential evolution algorithm
    Morteza Alinia Ahandani
    Hosein Alavi-Rad
    Soft Computing, 2012, 16 : 1303 - 1337