Viewing classifier systems as model free learning in POMDPs

被引:0
|
作者
Hayashi, A [1 ]
Suematsu, N [1 ]
机构
[1] Hiroshima City Univ, Fac Informat Sci, Asaminami Ku, Hiroshima 7313194, Japan
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Classifier systems are now viewed disappointing because of their problems such as the rule strength vs rule set performance problem and the credit assignment problem. In order to solve the problems, we have developed a hybrid classifier system: GLS (Generalization Learning System). In designing GLS, we view CSs as model free learning in POMDPs and take a hybrid approach to finding the best generalization, given the total number of rules. GLS uses the policy improvement procedure by Jaakkola et al. for an locally optimal stochastic policy when a set of rule conditions is given. GLS uses GA to search for the best set of rule conditions.
引用
收藏
页码:989 / 995
页数:7
相关论文
共 50 条
  • [1] ODE-based Recurrent Model-free Reinforcement Learning for POMDPs
    Zhao, Xuanle
    Zhang, Duzhen
    Han, Liyuan
    Zhang, Tielin
    Xu, Bo
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [2] Model-based online learning of POMDPs
    Shani, G
    Brafman, RI
    Shimony, SE
    MACHINE LEARNING: ECML 2005, PROCEEDINGS, 2005, 3720 : 353 - 364
  • [3] Dynamic trading strategy learning model using learning classifier systems
    Liao, PY
    Chen, JS
    PROCEEDINGS OF THE 2001 CONGRESS ON EVOLUTIONARY COMPUTATION, VOLS 1 AND 2, 2001, : 783 - 789
  • [4] Learning Classifier Systems
    Butz, Martin V.
    GECCO-2010 COMPANION PUBLICATION: PROCEEDINGS OF THE 12TH ANNUAL GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE, 2010, : 2331 - 2352
  • [5] Learning Classifier Systems
    Larry Bull
    Pler Luca Lanzi
    Wolfgang Stolzmann
    Soft Computing, 2002, 6 (3) : 143 - 143
  • [6] Using Coverage as a Model Building Constraint in Learning Classifier Systems
    Greene, David Perry
    Smith, Stephen F.
    EVOLUTIONARY COMPUTATION, 1994, 2 (01) : 67 - 91
  • [7] Learning classifier systems: a survey
    Sigaud, Olivier
    Wilson, Stewart W.
    SOFT COMPUTING, 2007, 11 (11) : 1065 - 1078
  • [8] Learning classifier systems: then and now
    Lanzi, Pier Luca
    EVOLUTIONARY INTELLIGENCE, 2008, 1 (01) : 63 - 82
  • [9] Symbiogenesis in learning classifier systems
    Tomlinson, A
    Bull, L
    ARTIFICIAL LIFE, 2001, 7 (01) : 33 - 61
  • [10] Learning classifier systems resources
    T. Kovacs
    Soft Computing, 2002, 6 (3) : 240 - 243