Integrating guidance into relational reinforcement learning

被引:55
|
作者
Driessens, K
Dzeroski, S
机构
[1] Katholieke Univ Leuven, Dept Comp Sci, B-3001 Heverlee, Belgium
[2] Jozef Stefan Inst, Dept Intelligent Syst, SI-1000 Ljubljana, Slovenia
关键词
reinforcement learning; relational learning; guided exploration;
D O I
10.1023/B:MACH.0000039779.47329.3a
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Reinforcement learning, and Q-learning in particular, encounter two major problems when dealing with large state spaces. First, learning the Q-function in tabular form may be infeasible because of the excessive amount of memory needed to store the table, and because the Q-function only converges after each state has been visited multiple times. Second, rewards in the state space may be so sparse that with random exploration they will only be discovered extremely slowly. The first problem is often solved by learning a generalization of the encountered examples ( e. g., using a neural net or decision tree). Relational reinforcement learning (RRL) is such an approach; it makes Q-learning feasible in structural domains by incorporating a relational learner into Q-learning. The problem of sparse rewards has not been addressed for RRL. This paper presents a solution based on the use of "reasonable policies" to provide guidance. Different types of policies and different strategies to supply guidance through these policies are discussed and evaluated experimentally in several relational domains to show the merits of the approach.
引用
收藏
页码:271 / 304
页数:34
相关论文
共 50 条
  • [21] ADAPTIVE GUIDANCE WITH REINFORCEMENT META LEARNING
    Gaudet, Brian
    Linares, Richard
    SPACEFLIGHT MECHANICS 2019, VOL 168, PTS I-IV, 2019, 168 : 4091 - 4109
  • [22] Building relational world models for reinforcement learning
    Walker, Trevor
    Torrey, Lisa
    Shavlik, Jude
    Maclin, Richard
    INDUCTIVE LOGIC PROGRAMMING, 2008, 4894 : 280 - +
  • [23] Interactive relational reinforcement learning of concept semantics
    Nickles, Matthias
    Rettinger, Achim
    MACHINE LEARNING, 2014, 94 (02) : 169 - 204
  • [24] Interactive relational reinforcement learning of concept semantics
    Matthias Nickles
    Achim Rettinger
    Machine Learning, 2014, 94 : 169 - 204
  • [25] Guiding inference through relational reinforcement learning
    Asgharbeygi, N
    Nejati, N
    Langley, P
    Arai, S
    INDUCTIVE LOGIC PROGRAMMING, PROCEEDINGS, 2005, 3625 : 20 - 37
  • [26] Relational Reinforcement Learning for Planning with Exogenous Effects
    Martinez, David
    Alenya, Guillem
    Ribeiro, Tony
    Inoue, Katsumi
    Torras, Carme
    JOURNAL OF MACHINE LEARNING RESEARCH, 2017, 18
  • [27] Relational reinforcement learning for agents in worlds with objects
    Dzeroski, S
    ADAPTIVE AGENTS AND MULTI-AGENT SYSTEMS: ADAPTATION AND MULTI-AGENT LEARNING, 2003, 2636 : 306 - 322
  • [28] Reinforcement learning guidance law of Q-learning
    Zhang Q.
    Ao B.
    Zhang Q.
    Xi Tong Gong Cheng Yu Dian Zi Ji Shu/Systems Engineering and Electronics, 2020, 42 (02): : 414 - 419
  • [29] Integrating symbolic knowledge in reinforcement learning
    Hailu, G
    Sommer, G
    1998 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS, VOLS 1-5, 1998, : 1491 - 1496
  • [30] Relational Reinforcement Learning applied to Shared Attention
    da Silva, Renato R.
    Policastro, Claudio A.
    Romero, Roseli A. F.
    IJCNN: 2009 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1- 6, 2009, : 1074 - 1080