Knowledge of opposite actions for reinforcement learning

被引：18

作者：

Shokri, Maryam ^{[1
]}

机构：

[1] Univ Waterloo Alumni, Waterloo, ON N2L 3G1, Canada

来源：

APPLIED SOFT COMPUTING | 2011年 / 11卷 / 06期

关键词：

Reinforcement learning; Q(lambda); Opposite action; Opposition-based learning (OBL); OQ(lambda) algorithm; NOQ(lambda) algorithm; Opposition weight;

D O I：

10.1016/j.asoc.2011.01.045

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Reinforcement learning (RL) is one of the machine intelligence techniques with several characteristics that make it suitable for solving real-world problems. However, RL agents generally face a very large state space in many applications. They must take actions in every state many times to find the optimal policy. In this work, a special type of knowledge about actions is employed to improve the performance of the off-policy, incremental, and model-free reinforcement learning with discrete state and action space. One of the components of RL agent is the action. For each action, its associate opposite action is defined. The actions and opposite actions are implemented in the framework of reinforcement learning to update the value function resulting in a faster convergence. The effects of opposite action on some of the reinforcement learning algorithms are investigated. (C) 2011 Elsevier B.V. All rights reserved.

引用

页码：4097 / 4109

页数：13

共 50 条

[31] Deep reinforcement learning with combinatorial actions spaces: An to maintenance
Goby, Niklas
Brandt, Tobias
Neumann, Dirk
COMPUTERS & INDUSTRIAL ENGINEERING, 2023, 179
[32] Composing Synergistic Macro Actions for Reinforcement Learning Agents
Chen, Yu-Ming
Chang, Kaun-Yu
Liu, Chien
Hsiao, Tsu-Ching
Hong, Zhang-Wei
Lee, Chun-Yi
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (05) : 7251 - 7258
[33] An acquiring method of macro-actions in reinforcement learning
Yoshikawa, Takeshi
Kurihara, Masahito
2006 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS, VOLS 1-6, PROCEEDINGS, 2006, : 4813 - +
[34] Acquiring various behaviors by isomorphism of actions in reinforcement learning
Yamaguchi, T
Nomura, Y
Tanaka, Y
Yachida, M
INFORMATION INTELLIGENCE AND SYSTEMS, VOLS 1-4, 1996, : 607 - 612
[35] Incremental Learning of Planning Actions in Model-Based Reinforcement Learning
Ng, Jun Hao Alvin
Petrick, Ronald P. A.
PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 3195 - 3201
[36] Dynamic Spectrum Access Based on Prior Knowledge Enabled Reinforcement Learning with Double Actions in Complex Electromagnetic Environment
Linghui Zeng
Fuqiang Yao
Jianzhao Zhang
Min Jia
ChinaCommunications, 2022, 19 (07) : 13 - 24
[37] Dynamic spectrum access based on prior knowledge enabled reinforcement learning with double actions in complex electromagnetic environment
Zeng, Linghui
Yao, Fuqiang
Zhang, Jianzhao
Jia, Min
CHINA COMMUNICATIONS, 2022, 19 (07) : 13 - 24
[38] Integrating Expert Knowledge into Fuzzy Reinforcement Learning
Tompa, Tomas
Kovacs, Szilveszter
18TH INTERNATIONAL SYMPOSIUM ON APPLIED COMPUTATIONAL INTELLIGENCE AND INFORMATICS, SACI 2024, 2024, : 333 - 337
[39] Reinforcement Learning with Partial Parametric Model Knowledge
Wang, Shuyuan
Loewen, Philip D.
Lawrence, Nathan P.
Forbes, Michael G.
Gopaluni, R. Bhushan
IFAC PAPERSONLINE, 2023, 56 (02): : 8012 - 8017
[40] Knowledge guided fuzzy deep reinforcement learning
Qin, Peng
Zhao, Tao
EXPERT SYSTEMS WITH APPLICATIONS, 2025, 264

← 1 2 3 4 5 →