Solving Continual Combinatorial Selection via Deep Reinforcement Learning

被引：0

作者：

Song, Hyungseok ^{[1
]}

Jang, Hyeryung ^{[2
]}

Tran, Hai H. ^{[1
]}

Yoon, Se-eun ^{[1
]}

Son, Kyunghwan ^{[1
]}

Yun, Donggyu ^{[3
]}

Chung, Hyoju ^{[3
]}

Yi, Yung ^{[1
]}

机构：

[1] Korea Adv Inst Sci & Technol, Sch Elect Engn, Daejeon, South Korea

[2] Kings Coll London, Informat, London, England

[3] Naver Corp, Seongnam, South Korea

来源：

PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE | 2019年

基金：

新加坡国家研究基金会;

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We consider the Markov Decision Process (MDP) of selecting a subset of items at each step, termed the Select-MDP (S-MDP). The large state and action spaces of S-MDPs make them intractable to solve with typical reinforcement learning (RL) algorithms especially when the number of items is huge. In this paper, we present a deep RL algorithm to solve this issue by adopting the following key ideas. First, we convert the original S-MDP into an Iterative Select-MDP (IS-MDP), which is equivalent to the S-MDP in terms of optimal actions. IS-MDP decomposes a joint action of selecting K items simultaneously into K iterative selections resulting in the decrease of actions at the expense of an exponential increase of states. Second, we overcome this state space explosion by exploiting a special symmetry in IS -MDPs with novel weight shared Q-networks, which provably maintain sufficient expressive power. Various experiments demonstrate that our approach works well even when the item space is large and that it scales to environments with item spaces different from those used in training.

引用

页码：3467 / 3474

页数：8

共 50 条

[1] Continual portfolio selection in dynamic environments via incremental reinforcement learning
Shu Liu
Bo Wang
Huaxiong Li
Chunlin Chen
Zhi Wang
International Journal of Machine Learning and Cybernetics, 2023, 14 : 269 - 279
[2] Continual portfolio selection in dynamic environments via incremental reinforcement learning
Liu, Shu
Wang, Bo
Li, Huaxiong
Chen, Chunlin
Wang, Zhi
INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2023, 14 (01) : 269 - 279
[3] A general deep reinforcement learning hyperheuristic framework for solving combinatorial optimization problems
Kallestad, Jakob
Hasibi, Ramin
Hemmati, Ahmad
Soerensen, Kenneth
EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2023, 309 (01) : 446 - 468
[4] LOSS OF PLASTICITY IN CONTINUAL DEEP REINFORCEMENT LEARNING
Abbas, Zaheer
Zhao, Rosie
Modayil, Joseph
White, Adam
Machado, Marlos C.
CONFERENCE ON LIFELONG LEARNING AGENTS, VOL 232, 2023, 232 : 620 - 636
[5] Solving the train dispatching problem via deep reinforcement learning
Agasucci, Valerio
Grani, Giorgio
Lamorgese, Leonardo
JOURNAL OF RAIL TRANSPORT PLANNING & MANAGEMENT, 2023, 26
[6] Leveraging Transfer Learning in Deep Reinforcement Learning for Solving Combinatorial Optimization Problems Under Uncertainty
Ezzahra Achamrah, Fatima
IEEE ACCESS, 2024, 12 : 181477 - 181497
[7] Solving Maximal Stable Set Problem via Deep Reinforcement Learning
Wang, Taiyi
Shi, Jiahao
ICAART: PROCEEDINGS OF THE 13TH INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE - VOL 2, 2021, : 483 - 489
[8] Solving job shop scheduling problems via deep reinforcement learning
Yuan, Erdong
Cheng, Shuli
Wang, Liejun
Song, Shiji
Wu, Fang
APPLIED SOFT COMPUTING, 2023, 143
[9] MathDQN: Solving Arithmetic Word Problems via Deep Reinforcement Learning
Wang, Lei
Zhang, Dongxiang
Gao, Lianli
Song, Jingkuan
Guo, Long
Shen, Heng Tao
THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 5545 - 5552
[10] Solving combinatorial optimization problems over graphs with BERT-Based Deep Reinforcement Learning
Wang, Qi
Lai, Kenneth H.
Tang, Chunlei
INFORMATION SCIENCES, 2023, 619 : 930 - 946

← 1 2 3 4 5 →