Knowledge of opposite actions for reinforcement learning

被引:18
|
作者
Shokri, Maryam [1 ]
机构
[1] Univ Waterloo Alumni, Waterloo, ON N2L 3G1, Canada
关键词
Reinforcement learning; Q(lambda); Opposite action; Opposition-based learning (OBL); OQ(lambda) algorithm; NOQ(lambda) algorithm; Opposition weight;
D O I
10.1016/j.asoc.2011.01.045
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Reinforcement learning (RL) is one of the machine intelligence techniques with several characteristics that make it suitable for solving real-world problems. However, RL agents generally face a very large state space in many applications. They must take actions in every state many times to find the optimal policy. In this work, a special type of knowledge about actions is employed to improve the performance of the off-policy, incremental, and model-free reinforcement learning with discrete state and action space. One of the components of RL agent is the action. For each action, its associate opposite action is defined. The actions and opposite actions are implemented in the framework of reinforcement learning to update the value function resulting in a faster convergence. The effects of opposite action on some of the reinforcement learning algorithms are investigated. (C) 2011 Elsevier B.V. All rights reserved.
引用
收藏
页码:4097 / 4109
页数:13
相关论文
共 50 条
  • [1] Learning to Transform Service Instructions into Actions with Reinforcement Learning and Knowledge Base
    Zhang M.-Y.
    Tian G.-H.
    Li C.-C.
    Gong J.
    International Journal of Automation and Computing, 2018, 15 (5) : 582 - 592
  • [2] Learning to Transform Service Instructions into Actions with Reinforcement Learning and Knowledge Base附视频
    MengYang Zhang
    GuoHui Tian
    CiCi Li
    Jing Gong
    International Journal of Automation and Computing, 2018, (05) : 582 - 592
  • [3] Reinforcement Learning with Multiple Actions
    Nishiyama, Riku
    Yamada, Satoshi
    PROCEEDINGS OF THE 3RD INTERNATIONAL CONFERENCE ON INTELLIGENT TECHNOLOGIES AND ENGINEERING SYSTEMS (ICITES2014), 2016, 345 : 207 - 213
  • [4] Reinforcement Learning with Parameterized Actions
    Masson, Warwick
    Ranchod, Pravesh
    Konidaris, George
    THIRTIETH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2016, : 1934 - 1940
  • [5] Learning macro-actions in reinforcement learning
    Randlov, J
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 11, 1999, 11 : 1045 - 1051
  • [6] Reinforcement learning using multiple actions
    Nakama, Hayato
    Asano, Tsubasa
    Yamada, Satoshi
    NEUROSCIENCE RESEARCH, 2010, 68 : E330 - E330
  • [7] Reinforcement learning with factored states and actions
    Sallans, B
    Hinton, GE
    JOURNAL OF MACHINE LEARNING RESEARCH, 2004, 5 : 1063 - 1088
  • [8] Generalization to New Actions in Reinforcement Learning
    Jain, Ayush
    Szot, Andrew
    Lim, Joseph J.
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 119, 2020, 119
  • [9] Using Combination of Actions in Reinforcement Learning
    Karanik, Marcelo J.
    Gramajo, Sergio D.
    JOURNAL OF COMPUTER SCIENCE & TECHNOLOGY, 2010, 10 (01): : 19 - 23
  • [10] Offline reinforcement learning with representations for actions
    Lou, Xingzhou
    Yin, Qiyue
    Zhang, Junge
    Yu, Chao
    He, Zhaofeng
    Cheng, Nengjie
    Huang, Kaiqi
    INFORMATION SCIENCES, 2022, 610 : 746 - 758