Knowledge of opposite actions for reinforcement learning

被引:18
|
作者
Shokri, Maryam [1 ]
机构
[1] Univ Waterloo Alumni, Waterloo, ON N2L 3G1, Canada
关键词
Reinforcement learning; Q(lambda); Opposite action; Opposition-based learning (OBL); OQ(lambda) algorithm; NOQ(lambda) algorithm; Opposition weight;
D O I
10.1016/j.asoc.2011.01.045
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Reinforcement learning (RL) is one of the machine intelligence techniques with several characteristics that make it suitable for solving real-world problems. However, RL agents generally face a very large state space in many applications. They must take actions in every state many times to find the optimal policy. In this work, a special type of knowledge about actions is employed to improve the performance of the off-policy, incremental, and model-free reinforcement learning with discrete state and action space. One of the components of RL agent is the action. For each action, its associate opposite action is defined. The actions and opposite actions are implemented in the framework of reinforcement learning to update the value function resulting in a faster convergence. The effects of opposite action on some of the reinforcement learning algorithms are investigated. (C) 2011 Elsevier B.V. All rights reserved.
引用
收藏
页码:4097 / 4109
页数:13
相关论文
共 50 条
  • [31] Deep reinforcement learning with combinatorial actions spaces: An to maintenance
    Goby, Niklas
    Brandt, Tobias
    Neumann, Dirk
    COMPUTERS & INDUSTRIAL ENGINEERING, 2023, 179
  • [32] Composing Synergistic Macro Actions for Reinforcement Learning Agents
    Chen, Yu-Ming
    Chang, Kaun-Yu
    Liu, Chien
    Hsiao, Tsu-Ching
    Hong, Zhang-Wei
    Lee, Chun-Yi
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (05) : 7251 - 7258
  • [33] An acquiring method of macro-actions in reinforcement learning
    Yoshikawa, Takeshi
    Kurihara, Masahito
    2006 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS, VOLS 1-6, PROCEEDINGS, 2006, : 4813 - +
  • [34] Acquiring various behaviors by isomorphism of actions in reinforcement learning
    Yamaguchi, T
    Nomura, Y
    Tanaka, Y
    Yachida, M
    INFORMATION INTELLIGENCE AND SYSTEMS, VOLS 1-4, 1996, : 607 - 612
  • [35] Incremental Learning of Planning Actions in Model-Based Reinforcement Learning
    Ng, Jun Hao Alvin
    Petrick, Ronald P. A.
    PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 3195 - 3201
  • [36] Dynamic Spectrum Access Based on Prior Knowledge Enabled Reinforcement Learning with Double Actions in Complex Electromagnetic Environment
    Linghui Zeng
    Fuqiang Yao
    Jianzhao Zhang
    Min Jia
    ChinaCommunications, 2022, 19 (07) : 13 - 24
  • [37] Dynamic spectrum access based on prior knowledge enabled reinforcement learning with double actions in complex electromagnetic environment
    Zeng, Linghui
    Yao, Fuqiang
    Zhang, Jianzhao
    Jia, Min
    CHINA COMMUNICATIONS, 2022, 19 (07) : 13 - 24
  • [38] Integrating Expert Knowledge into Fuzzy Reinforcement Learning
    Tompa, Tomas
    Kovacs, Szilveszter
    18TH INTERNATIONAL SYMPOSIUM ON APPLIED COMPUTATIONAL INTELLIGENCE AND INFORMATICS, SACI 2024, 2024, : 333 - 337
  • [39] Reinforcement Learning with Partial Parametric Model Knowledge
    Wang, Shuyuan
    Loewen, Philip D.
    Lawrence, Nathan P.
    Forbes, Michael G.
    Gopaluni, R. Bhushan
    IFAC PAPERSONLINE, 2023, 56 (02): : 8012 - 8017
  • [40] Knowledge guided fuzzy deep reinforcement learning
    Qin, Peng
    Zhao, Tao
    EXPERT SYSTEMS WITH APPLICATIONS, 2025, 264