Faster MIL-based Subgoal Identification for Reinforcement Learning by Tuning Fewer Hyperparameters

被引:0
|
作者
Sunel, Saim [1 ]
Cilden, Erkin [2 ]
Polat, Faruk [1 ]
机构
[1] Middle East Tech Univ, Dept Comp Engn, TR-06800 Ankara, Turkiye
[2] STM Def Technol Engn & Trade Inc, RF & Simulat Syst Directorate, Ankara, Turkiye
关键词
Subgoal identification; expectation-maximization; diverse density; hyper-parameter search; multiple instance learning; reinforcement learning; DISCOVERY; FRAMEWORK;
D O I
10.1145/3643852
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Variousmethods have been proposed in the literature for identifying subgoals in discrete reinforcement learning (RL) tasks. Once subgoals are discovered, task decomposition methods can be employed to improve the learning performance of agents. In this study, we classify prominent subgoal identification methods for discrete RL tasks in the literature into the following three categories: graph-based, statistics-based, and multi-instance learning (MIL)-based. As contributions, first, we introduce a newMIL-based subgoal identification algorithm called EMDD-RL and experimentally compare it with a previous MIL-based method. The previous approach adapts MIL's Diverse Density (DD) algorithm, whereas our method considers Expected-Maximization Diverse Density (EMDD). The advantage of EMDD over DD is that it can yield more accurate results with less computation demand thanks to the expectation-maximization algorithm. EMDD-RL modifies some of the algorithmic steps of EMDD to identify subgoals in discrete RL problems. Second, we evaluate the methods in several RL tasks for the hyperparameter tuning overhead they incur. Third, we propose a new RL problem called key-room and compare the methods for their subgoal identification performances in this new task. Experiment results show that MIL-based subgoal identification methods could be preferred to the algorithms of the other two categories in practice.
引用
收藏
页数:29
相关论文
共 50 条
  • [31] APT: Adaptive Perceptual quality based camera Tuning using reinforcement learning
    Paul, Sibendu
    Rao, Kunal
    Coviello, Giuseppe
    Sankaradas, Murugan
    Po, Oliver
    Hu, Y. Charlie
    Chakradhar, Srimat
    2022 9TH INTERNATIONAL CONFERENCE ON INTERNET OF THINGS: SYSTEMS, MANAGEMENT AND SECURITY, IOTSMS, 2022, : 176 - 184
  • [32] Automated performance tuning of distributed storage system based on deep reinforcement learning
    Wang, Lu
    Zhang, Wentao
    Cheng, Yaodong
    19TH INTERNATIONAL WORKSHOP ON ADVANCED COMPUTING AND ANALYSIS TECHNIQUES IN PHYSICS RESEARCH, 2020, 1525
  • [33] ProRLearn: boosting prompt tuning-based vulnerability detection by reinforcement learning
    Ren, Zilong
    Ju, Xiaolin
    Chen, Xiang
    Shen, Hao
    AUTOMATED SOFTWARE ENGINEERING, 2024, 31 (02)
  • [34] Parameters tuning of multi-model database based on deep reinforcement learning
    Feng Ye
    Yang Li
    Xiwen Wang
    Nadia Nedjah
    Peng Zhang
    Hong Shi
    Journal of Intelligent Information Systems, 2023, 61 : 167 - 190
  • [35] Automatic focal EEG identification based on deep reinforcement learning
    Liu, Xinyu
    Ding, Xin
    Liu, Jianping
    Nie, Weiwei
    Yuan, Qi
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2023, 83
  • [36] Fault Identification in Power Network Based on Deep Reinforcement Learning
    Li, Mengshi
    Zhang, Huanming
    Ji, Tianyao
    Wu, Q. H.
    CSEE JOURNAL OF POWER AND ENERGY SYSTEMS, 2022, 8 (03): : 721 - 731
  • [37] Combining system identification with reinforcement learning-based MPC
    Martinsen, Andreas B.
    Lekkas, Anastasios M.
    Gros, Sebastien
    IFAC PAPERSONLINE, 2020, 53 (02): : 8130 - 8135
  • [38] Deep Reinforcement Learning-Based Carrier Tuning Algorithm for Mobile Communication Networks
    Zhang, Weimin
    Zhao, Xinying
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2024, 15 (09) : 371 - 381
  • [39] Reinforcement Learning Based Parameter Adaptive Tuning for Electric Power Data Storage System
    Tu, Zijian
    Mao, Yingchi
    Wu, Mingbo
    Chen, Yu
    Dianli Xitong Zidonghua/Automation of Electric Power Systems, 2022, 46 (04): : 112 - 122
  • [40] Workload-Aware Performance Tuning for Multimodel Databases Based on Deep Reinforcement Learning
    Sun, Jun
    Ye, Feng
    Nedjah, Nadia
    Zhang, Ming
    Xu, Dong
    INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2023, 2023