Faster MIL-based Subgoal Identification for Reinforcement Learning by Tuning Fewer Hyperparameters

被引：0

作者：

Sunel, Saim ^{[1
]}

Cilden, Erkin ^{[2
]}

Polat, Faruk ^{[1
]}

机构：

[1] Middle East Tech Univ, Dept Comp Engn, TR-06800 Ankara, Turkiye

[2] STM Def Technol Engn & Trade Inc, RF & Simulat Syst Directorate, Ankara, Turkiye

来源：

ACM TRANSACTIONS ON AUTONOMOUS AND ADAPTIVE SYSTEMS | 2024年 / 19卷 / 02期

关键词：

Subgoal identification; expectation-maximization; diverse density; hyper-parameter search; multiple instance learning; reinforcement learning; DISCOVERY; FRAMEWORK;

D O I：

10.1145/3643852

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Variousmethods have been proposed in the literature for identifying subgoals in discrete reinforcement learning (RL) tasks. Once subgoals are discovered, task decomposition methods can be employed to improve the learning performance of agents. In this study, we classify prominent subgoal identification methods for discrete RL tasks in the literature into the following three categories: graph-based, statistics-based, and multi-instance learning (MIL)-based. As contributions, first, we introduce a newMIL-based subgoal identification algorithm called EMDD-RL and experimentally compare it with a previous MIL-based method. The previous approach adapts MIL's Diverse Density (DD) algorithm, whereas our method considers Expected-Maximization Diverse Density (EMDD). The advantage of EMDD over DD is that it can yield more accurate results with less computation demand thanks to the expectation-maximization algorithm. EMDD-RL modifies some of the algorithmic steps of EMDD to identify subgoals in discrete RL problems. Second, we evaluate the methods in several RL tasks for the hyperparameter tuning overhead they incur. Third, we propose a new RL problem called key-room and compare the methods for their subgoal identification performances in this new task. Experiment results show that MIL-based subgoal identification methods could be preferred to the algorithms of the other two categories in practice.

引用

页数：29

共 50 条

[31] APT: Adaptive Perceptual quality based camera Tuning using reinforcement learning
Paul, Sibendu
Rao, Kunal
Coviello, Giuseppe
Sankaradas, Murugan
Po, Oliver
Hu, Y. Charlie
Chakradhar, Srimat
2022 9TH INTERNATIONAL CONFERENCE ON INTERNET OF THINGS: SYSTEMS, MANAGEMENT AND SECURITY, IOTSMS, 2022, : 176 - 184
[32] Automated performance tuning of distributed storage system based on deep reinforcement learning
Wang, Lu
Zhang, Wentao
Cheng, Yaodong
19TH INTERNATIONAL WORKSHOP ON ADVANCED COMPUTING AND ANALYSIS TECHNIQUES IN PHYSICS RESEARCH, 2020, 1525
[33] ProRLearn: boosting prompt tuning-based vulnerability detection by reinforcement learning
Ren, Zilong
Ju, Xiaolin
Chen, Xiang
Shen, Hao
AUTOMATED SOFTWARE ENGINEERING, 2024, 31 (02)
[34] Parameters tuning of multi-model database based on deep reinforcement learning
Feng Ye
Yang Li
Xiwen Wang
Nadia Nedjah
Peng Zhang
Hong Shi
Journal of Intelligent Information Systems, 2023, 61 : 167 - 190
[35] Automatic focal EEG identification based on deep reinforcement learning
Liu, Xinyu
Ding, Xin
Liu, Jianping
Nie, Weiwei
Yuan, Qi
BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2023, 83
[36] Fault Identification in Power Network Based on Deep Reinforcement Learning
Li, Mengshi
Zhang, Huanming
Ji, Tianyao
Wu, Q. H.
CSEE JOURNAL OF POWER AND ENERGY SYSTEMS, 2022, 8 (03): : 721 - 731
[37] Combining system identification with reinforcement learning-based MPC
Martinsen, Andreas B.
Lekkas, Anastasios M.
Gros, Sebastien
IFAC PAPERSONLINE, 2020, 53 (02): : 8130 - 8135
[38] Deep Reinforcement Learning-Based Carrier Tuning Algorithm for Mobile Communication Networks
Zhang, Weimin
Zhao, Xinying
INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2024, 15 (09) : 371 - 381
[39] Reinforcement Learning Based Parameter Adaptive Tuning for Electric Power Data Storage System
Tu, Zijian
Mao, Yingchi
Wu, Mingbo
Chen, Yu
Dianli Xitong Zidonghua/Automation of Electric Power Systems, 2022, 46 (04): : 112 - 122
[40] Workload-Aware Performance Tuning for Multimodel Databases Based on Deep Reinforcement Learning
Sun, Jun
Ye, Feng
Nedjah, Nadia
Zhang, Ming
Xu, Dong
INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2023, 2023

← 1 2 3 4 5 →