Solving Continual Combinatorial Selection via Deep Reinforcement Learning

被引:0
|
作者
Song, Hyungseok [1 ]
Jang, Hyeryung [2 ]
Tran, Hai H. [1 ]
Yoon, Se-eun [1 ]
Son, Kyunghwan [1 ]
Yun, Donggyu [3 ]
Chung, Hyoju [3 ]
Yi, Yung [1 ]
机构
[1] Korea Adv Inst Sci & Technol, Sch Elect Engn, Daejeon, South Korea
[2] Kings Coll London, Informat, London, England
[3] Naver Corp, Seongnam, South Korea
基金
新加坡国家研究基金会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We consider the Markov Decision Process (MDP) of selecting a subset of items at each step, termed the Select-MDP (S-MDP). The large state and action spaces of S-MDPs make them intractable to solve with typical reinforcement learning (RL) algorithms especially when the number of items is huge. In this paper, we present a deep RL algorithm to solve this issue by adopting the following key ideas. First, we convert the original S-MDP into an Iterative Select-MDP (IS-MDP), which is equivalent to the S-MDP in terms of optimal actions. IS-MDP decomposes a joint action of selecting K items simultaneously into K iterative selections resulting in the decrease of actions at the expense of an exponential increase of states. Second, we overcome this state space explosion by exploiting a special symmetry in IS -MDPs with novel weight shared Q-networks, which provably maintain sufficient expressive power. Various experiments demonstrate that our approach works well even when the item space is large and that it scales to environments with item spaces different from those used in training.
引用
收藏
页码:3467 / 3474
页数:8
相关论文
共 50 条
  • [1] Continual portfolio selection in dynamic environments via incremental reinforcement learning
    Shu Liu
    Bo Wang
    Huaxiong Li
    Chunlin Chen
    Zhi Wang
    International Journal of Machine Learning and Cybernetics, 2023, 14 : 269 - 279
  • [2] Continual portfolio selection in dynamic environments via incremental reinforcement learning
    Liu, Shu
    Wang, Bo
    Li, Huaxiong
    Chen, Chunlin
    Wang, Zhi
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2023, 14 (01) : 269 - 279
  • [3] A general deep reinforcement learning hyperheuristic framework for solving combinatorial optimization problems
    Kallestad, Jakob
    Hasibi, Ramin
    Hemmati, Ahmad
    Soerensen, Kenneth
    EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2023, 309 (01) : 446 - 468
  • [4] LOSS OF PLASTICITY IN CONTINUAL DEEP REINFORCEMENT LEARNING
    Abbas, Zaheer
    Zhao, Rosie
    Modayil, Joseph
    White, Adam
    Machado, Marlos C.
    CONFERENCE ON LIFELONG LEARNING AGENTS, VOL 232, 2023, 232 : 620 - 636
  • [5] Solving the train dispatching problem via deep reinforcement learning
    Agasucci, Valerio
    Grani, Giorgio
    Lamorgese, Leonardo
    JOURNAL OF RAIL TRANSPORT PLANNING & MANAGEMENT, 2023, 26
  • [6] Leveraging Transfer Learning in Deep Reinforcement Learning for Solving Combinatorial Optimization Problems Under Uncertainty
    Ezzahra Achamrah, Fatima
    IEEE ACCESS, 2024, 12 : 181477 - 181497
  • [7] Solving Maximal Stable Set Problem via Deep Reinforcement Learning
    Wang, Taiyi
    Shi, Jiahao
    ICAART: PROCEEDINGS OF THE 13TH INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE - VOL 2, 2021, : 483 - 489
  • [8] Solving job shop scheduling problems via deep reinforcement learning
    Yuan, Erdong
    Cheng, Shuli
    Wang, Liejun
    Song, Shiji
    Wu, Fang
    APPLIED SOFT COMPUTING, 2023, 143
  • [9] MathDQN: Solving Arithmetic Word Problems via Deep Reinforcement Learning
    Wang, Lei
    Zhang, Dongxiang
    Gao, Lianli
    Song, Jingkuan
    Guo, Long
    Shen, Heng Tao
    THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 5545 - 5552
  • [10] Solving combinatorial optimization problems over graphs with BERT-Based Deep Reinforcement Learning
    Wang, Qi
    Lai, Kenneth H.
    Tang, Chunlei
    INFORMATION SCIENCES, 2023, 619 : 930 - 946