Applying Expert Heuristic as an a Priori Knowledge for FRIQ-Learning

被引:0
|
作者
Tompa, Tamas [1 ]
Kovacs, Szilveszter [1 ]
机构
[1] Univ Miskolc, Dept Informat Technol, H-3515 Miskolc, Miskolc, Hungary
关键词
Reinforcement Learning; Heuristically Accelerated Reinforcement Learning; Fuzzy Rule Interpolation; Q-Learning; FRIQ-Learning;
D O I
暂无
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Many Reinforcement Learning methods start the learning phase from an empty, or randomly filled knowledge-base. Having some a priori knowledge about the way as the studied system could be controlled, e.g. in the form of some state-action control rules, the convergence speed of the learning process can be significantly improved. In this case, the learning stage could start from a sketch, from a knowledge-base formed based upon the already existing knowledge. In this paper. the a priori (expert) knowledge is considered to be given in the form state-action fuzzy control rules of a Fuzzy Rule Interpolation (FRI) reasoning model and the studied reinforcement learning method is restricted to be a Fuzzy Rule Interpolation-based Q-Learning (FRIQ-Learning) method. The main goal of this paper is the introduction of a methodology, which is suitable for merging the a priori state-action fuzzy control rule-base to the initial state-action-value function (Q-function) representation. For demonstrating the benefits of the suggested methodology, the a priori knowledge-base accelerated FRIQ-Learning solution of the "mountain car" benchmark is also discussed briefly in the paper.
引用
收藏
页码:27 / 45
页数:19
相关论文
共 50 条
  • [1] The Pong game implementation with the FRIQ-learning reinforcement learning algorithm
    Tompa, Tamas
    Vincze, David
    Kovacs, Szilveszter
    2015 16TH INTERNATIONAL CARPATHIAN CONTROL CONFERENCE (ICCC), 2015, : 542 - 547
  • [2] Q-learning vs. FRIQ-learning in the Maze problem
    Tompa, Tamas
    Kovas, Szilveszter
    2015 6TH IEEE INTERNATIONAL CONFERENCE ON COGNITIVE INFOCOMMUNICATIONS (COGINFOCOM), 2015, : 545 - 550
  • [3] Clustering-based fuzzy knowledgebase reduction in the FRIQ-learning
    Tompa, Tamas
    Kovacs, Szilveszter
    2017 IEEE 15TH INTERNATIONAL SYMPOSIUM ON APPLIED MACHINE INTELLIGENCE AND INFORMATICS (SAMI), 2017, : 197 - 200
  • [4] Determining the minimally allowed rule-distance for the incremental rule-base construction phase of the FRIQ-learning
    Tompa, Tamas
    Kovacs, Szilveszter
    2018 19TH INTERNATIONAL CARPATHIAN CONTROL CONFERENCE (ICCC), 2018, : 480 - 483
  • [5] Incorporating A-priori expert knowledge in genetic algorithms
    AkbarzadehT, MR
    Jamshidi, M
    1997 IEEE INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE IN ROBOTICS AND AUTOMATION - CIRA '97, PROCEEDINGS: TOWARDS NEW COMPUTATIONAL PRINCIPLES FOR ROBOTICS AND AUTOMATION, 1997, : 300 - 305
  • [6] ANALOGY AND DEEP KNOWLEDGE AS A HEURISTIC FOR AN EXPERT SYSTEM
    BRUNEAU, L
    ARTIFICIAL INTELLIGENCE IN SCIENTIFIC COMPUTATION : TOWARDS SECOND GENERATION SYSTEMS, 1989, 2 : 197 - 201
  • [7] LANGUAGE-LEARNING AND A PRIORI KNOWLEDGE
    EDIDIN, A
    AMERICAN PHILOSOPHICAL QUARTERLY, 1986, 23 (04) : 383 - 391
  • [8] Learning plans without a priori knowledge
    Sun, R
    Sessions, C
    ADAPTIVE BEHAVIOR, 2001, 8 (3-4) : 225 - 253
  • [9] Embedding a Priori Knowledge in Reinforcement Learning
    Carlos H. C. Ribeiro
    Journal of Intelligent and Robotic Systems, 1998, 21 : 51 - 71
  • [10] Embedding a priori knowledge in reinforcement learning
    Ribeiro, Carlos H.C.
    Journal of Intelligent and Robotic Systems: Theory and Applications, 1998, 21 (01): : 51 - 71