Knowledge of opposite actions for reinforcement learning

被引:18
|
作者
Shokri, Maryam [1 ]
机构
[1] Univ Waterloo Alumni, Waterloo, ON N2L 3G1, Canada
关键词
Reinforcement learning; Q(lambda); Opposite action; Opposition-based learning (OBL); OQ(lambda) algorithm; NOQ(lambda) algorithm; Opposition weight;
D O I
10.1016/j.asoc.2011.01.045
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Reinforcement learning (RL) is one of the machine intelligence techniques with several characteristics that make it suitable for solving real-world problems. However, RL agents generally face a very large state space in many applications. They must take actions in every state many times to find the optimal policy. In this work, a special type of knowledge about actions is employed to improve the performance of the off-policy, incremental, and model-free reinforcement learning with discrete state and action space. One of the components of RL agent is the action. For each action, its associate opposite action is defined. The actions and opposite actions are implemented in the framework of reinforcement learning to update the value function resulting in a faster convergence. The effects of opposite action on some of the reinforcement learning algorithms are investigated. (C) 2011 Elsevier B.V. All rights reserved.
引用
收藏
页码:4097 / 4109
页数:13
相关论文
共 50 条
  • [21] Integrating symbolic knowledge in reinforcement learning
    Hailu, G
    Sommer, G
    1998 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS, VOLS 1-5, 1998, : 1491 - 1496
  • [22] From Reinforcement Learning to Knowledge of Nature
    V. G. Red’ko
    Pattern Recognition and Image Analysis, 2023, 33 : 478 - 482
  • [23] Knowledge Gradient for Online Reinforcement Learning
    Yahyaa, Saba
    Manderick, Bernard
    AGENTS AND ARTIFICIAL INTELLIGENCE, ICAART 2014, 2015, 8946 : 103 - 118
  • [24] Embedding a Priori Knowledge in Reinforcement Learning
    Carlos H. C. Ribeiro
    Journal of Intelligent and Robotic Systems, 1998, 21 : 51 - 71
  • [25] From Reinforcement Learning to Knowledge of Nature
    Red'ko, V. G.
    PATTERN RECOGNITION AND IMAGE ANALYSIS, 2023, 33 (03) : 478 - 482
  • [26] Embedding a priori knowledge in reinforcement learning
    Ribeiro, Carlos H.C.
    Journal of Intelligent and Robotic Systems: Theory and Applications, 1998, 21 (01): : 51 - 71
  • [27] Embedding a priori knowledge in reinforcement learning
    Ribeiro, CHC
    JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 1998, 21 (01) : 51 - 71
  • [28] Reinforcement Learning Control With Knowledge Shaping
    Gao, Xiang
    Si, Jennie
    Huang, He
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (03) : 3156 - 3167
  • [29] Reinforcement Learning with Combinatorial Actions: An Application to Vehicle Routing
    Delarue, Arthur
    Anderson, Ross
    Tjandraatmadja, Christian
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
  • [30] Reinforcement Learning When All Actions Are Not Always Available
    Chandak, Yash
    Theocharous, Georgios
    Metevier, Blossom
    Thomas, Philip S.
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 3381 - 3388