Hierarchical reinforcement learning based on macro actions

被引：0

作者：

Hao Jiang ^{[1
]}

Gongju Wang ^{[2
]}

Shengze Li ^{[1
]}

Jieyuan Zhang ^{[1
]}

Long Yan ^{[2
]}

Xinhai Xu ^{[1
]}

机构：

[1] Chinese Academy of Military Science,Data Intelligence Division

[2] China Unicom Digital Technology Co,undefined

来源：

Complex & Intelligent Systems | 2025年 / 11卷 / 6期

关键词：

Hierarchical reinforcement learning; Macro action mapping model; Combat and non-combat macro actions; Rule-based execution logic;

D O I：

10.1007/s40747-025-01895-9

中图分类号：

学科分类号：

摘要：

The large action space is a key challenge in reinforcement learning. Although hierarchical methods have been proven to be effective in addressing this issue, they are not fully explored. This paper combines domain knowledge with hierarchical concepts to propose a novel Hierarchical Reinforcement Learning framework based on macro actions (HRL-MA). This framework includes a macro action mapping model that abstracts sequences of micro actions into macro actions, thereby simplifying the decision-making process. Macro actions are divided into two categories: combat macro actions (CMA) and non-combat macro actions (NO-CMA). NO-CMA are driven by decision tree-based logical rules and provide conditions for the execution of CMA. CMA form the action space of the reinforcement learning algorithm, which dynamically selects actions based on the current state. Comprehensive tests on the StarCraft II maps Simple64 and AbyssalReefLE demonstrate that the HRL-MA framework exhibits superior performance, achieving higher win rates compared to baseline algorithms. Furthermore, in mini-game scenarios, HRL-MA consistently outperforms baseline algorithms in terms of reward scores. The findings highlight the effectiveness of integrating hierarchical structures and macro actions in reinforcement learning to manage complex decision-making tasks in environments with large action spaces.

引用

共 50 条

[21] A Hierarchical Framework for Quadruped Locomotion Based on Reinforcement Learning
Tan, Wenhao
Fang, Xing
Zhang, Wei
Song, Ran
Chen, Teng
Zheng, Yu
Li, Yibin
2021 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2021, : 8462 - 8468
[22] Potential Based Reward Shaping for Hierarchical Reinforcement Learning
Gao, Yang
Toni, Francesca
PROCEEDINGS OF THE TWENTY-FOURTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE (IJCAI), 2015, : 3504 - 3510
[23] Reinforcement Learning with Multiple Actions
Nishiyama, Riku
Yamada, Satoshi
PROCEEDINGS OF THE 3RD INTERNATIONAL CONFERENCE ON INTELLIGENT TECHNOLOGIES AND ENGINEERING SYSTEMS (ICITES2014), 2016, 345 : 207 - 213
[24] Reinforcement Learning with Parameterized Actions
Masson, Warwick
Ranchod, Pravesh
Konidaris, George
THIRTIETH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2016, : 1934 - 1940
[25] COMBINATIONS OF MICRO-MACRO STATES AND SUBGOALS DISCOVERY IN HIERARCHICAL REINFORCEMENT LEARNING FOR PATH FINDING
Setyawan, Gembong Edhi
Sawada, Hideyuki
Hartono, Pitoyo
INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL, 2022, 18 (02): : 447 - 462
[26] HiMacMic: Hierarchical Multi-Agent Deep Reinforcement Learning with Dynamic Asynchronous Macro Strategy
Zhang, Hancheng
Li, Guozheng
Liu, Chi Harold
Wang, Guoren
Tang, Jian
PROCEEDINGS OF THE 29TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2023, 2023, : 3239 - 3248
[27] Swarm Reinforcement Learning Method Based on Hierarchical Q-Learning
Kuroe, Yasuaki
Takeuchi, Kenya
Maeda, Yutaka
2021 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (IEEE SSCI 2021), 2021,
[28] Hierarchical extreme learning machine based reinforcement learning for goal localization
AlDahoul, Nouar
Htike, Zaw Zaw
Akmeliawati, Rini
3RD INTERNATIONAL CONFERENCE ON MECHANICAL, AUTOMOTIVE AND AEROSPACE ENGINEERING 2016, 2017, 184
[29] Assessment of Reinforcement Learning for Macro Placement
Cheng, Chung-Kuan
Kahng, Andrew B.
Kundu, Sayak
Wang, Yucheng
Wang, Zhiang
PROCEEDINGS OF THE 2023 INTERNATIONAL SYMPOSIUM ON PHYSICAL DESIGN, ISPD 2023, 2023, : 158 - 166
[30] Delving into Macro Placement with Reinforcement Learning
Jiang, Zixuan
Songhori, Ebrahim
Wang, Shen
Goldie, Anna
Mirhoseini, Azalia
Jiang, Joe
Lee, Young-Joon
Pan, David Z.
2021 ACM/IEEE 3RD WORKSHOP ON MACHINE LEARNING FOR CAD (MLCAD), 2021,

← 1 2 3 4 5 →