Applying Expert Heuristic as an a Priori Knowledge for FRIQ-Learning

被引:0
|
作者
Tompa, Tamas [1 ]
Kovacs, Szilveszter [1 ]
机构
[1] Univ Miskolc, Dept Informat Technol, H-3515 Miskolc, Miskolc, Hungary
关键词
Reinforcement Learning; Heuristically Accelerated Reinforcement Learning; Fuzzy Rule Interpolation; Q-Learning; FRIQ-Learning;
D O I
暂无
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Many Reinforcement Learning methods start the learning phase from an empty, or randomly filled knowledge-base. Having some a priori knowledge about the way as the studied system could be controlled, e.g. in the form of some state-action control rules, the convergence speed of the learning process can be significantly improved. In this case, the learning stage could start from a sketch, from a knowledge-base formed based upon the already existing knowledge. In this paper. the a priori (expert) knowledge is considered to be given in the form state-action fuzzy control rules of a Fuzzy Rule Interpolation (FRI) reasoning model and the studied reinforcement learning method is restricted to be a Fuzzy Rule Interpolation-based Q-Learning (FRIQ-Learning) method. The main goal of this paper is the introduction of a methodology, which is suitable for merging the a priori state-action fuzzy control rule-base to the initial state-action-value function (Q-function) representation. For demonstrating the benefits of the suggested methodology, the a priori knowledge-base accelerated FRIQ-Learning solution of the "mountain car" benchmark is also discussed briefly in the paper.
引用
收藏
页码:27 / 45
页数:19
相关论文
共 50 条
  • [31] Q-learning with fuzzy priori knowledge and application in Robot Soccer
    Du, Chun-Xia
    Meng, Qing-Chun
    Gao, Yun
    Kongzhi Lilun Yu Yingyong/Control Theory and Applications, 2004, 21 (SUPPL.): : 68 - 72
  • [32] Iterative learning control design without a priori knowledge of control directions
    Xu, JX
    Yan, R
    PROCEEDINGS OF THE 2003 AMERICAN CONTROL CONFERENCE, VOLS 1-6, 2003, : 3661 - 3666
  • [33] Iterative learning control design without a priori knowledge of the control direction
    Xu, JX
    Yan, R
    AUTOMATICA, 2004, 40 (10) : 1803 - 1809
  • [34] ASK-the-Expert: Active Learning Based Knowledge Discovery Using the Expert
    Das, Kamalika
    Avrekh, Ilya
    Matthews, Bryan
    Sharma, Manali
    Oza, Nikunj
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2017, PT III, 2017, 10536 : 395 - 399
  • [35] Expert Based Learning (EXBL) Methodology for Developing Mobile Expert Learning Knowledge Management Software System
    Ebibi, Mirlinda
    Fetaji, Bekim
    Fetaji, Majlinda
    TECHNICS TECHNOLOGIES EDUCATION MANAGEMENT-TTEM, 2012, 7 (02): : 864 - 874
  • [36] Mapping adaptive capacity and smallholder agriculture: applying expert knowledge at the landscape scale
    Holland, Margaret Buck
    Shamer, Sierra Zaid
    Imbach, Pablo
    Carlos Zamora, Juan
    Medellin Moreno, Claudia
    Leguia Hidalgo, Efrain J.
    Donatti, Camila I.
    Martinez-Rodriguez, M. Ruth
    Harvey, Celia A.
    CLIMATIC CHANGE, 2017, 141 (01) : 139 - 153
  • [38] Mapping adaptive capacity and smallholder agriculture: applying expert knowledge at the landscape scale
    Margaret Buck Holland
    Sierra Zaid Shamer
    Pablo Imbach
    Juan Carlos Zamora
    Claudia Medellin Moreno
    Efraín J. Leguía Hidalgo
    Camila I. Donatti
    M. Ruth Martínez-Rodríguez
    Celia A. Harvey
    Climatic Change, 2017, 141 : 139 - 153
  • [39] Correcting flawed expert knowledge through reinforcement learning
    Aihe, David O.
    Gonzalez, Avelino J.
    EXPERT SYSTEMS WITH APPLICATIONS, 2015, 42 (17-18) : 6457 - 6471
  • [40] Expert knowledge and Supervised learning of rules: Application to Echinoderms
    Ben Nasr, Ines
    Borgi, Amel
    Sellem, Feriel
    2013 INTERNATIONAL CONFERENCE ON CONTROL, DECISION AND INFORMATION TECHNOLOGIES (CODIT), 2013, : 300 - 305