Dynamic programming with meta-reinforcement learning: a novel approach for multi-objective optimization

被引：1

作者：

Wang, Qi ^{[1
]}

Zhang, Chengwei ^{[1
]}

Hu, Bin ^{[2
]}

机构：

[1] Dalian Maritime Univ, Informat Sci & Technol Coll, Dalian, Peoples R China

[2] Hangzhou Dianzi Univ, Sch Comp Sci, Hangzhou, Peoples R China

来源：

COMPLEX & INTELLIGENT SYSTEMS | 2024年 / 10卷 / 04期

关键词：

Combinatorial optimization; Meta-learning; Reinforcement learning; Dynamic programming; PERSON REIDENTIFICATION;

D O I：

10.1007/s40747-024-01469-1

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Multi-objective optimization (MOO) endeavors to identify optimal solutions from a finite array of possibilities. In recent years, deep reinforcement learning (RL) has exhibited promise through its well-crafted heuristics in tackling NP-hard combinatorial optimization (CO) problems. Nonetheless, current methodologies grapple with two key challenges: (1) They primarily concentrate on single-objective optimization quandaries, rendering them less adaptable to the more prevalent MOO scenarios encountered in real-world applications. (2) These approaches furnish an approximate solution by imbibing heuristics, lacking a systematic means to enhance or substantiate optimality. Given these challenges, this study introduces an overarching hybrid strategy, dynamic programming with meta-reinforcement learning (DPML), to resolve MOO predicaments. The approach melds meta-learning into an RL framework, addressing multiple subproblems inherent to MOO. Furthermore, the precision of solutions is elevated by endowing exact dynamic programming with the prowess of meta-graph neural networks. Empirical results substantiate the supremacy of our methodology over previous RL and heuristics approaches, bridging the chasm between theoretical underpinnings and real-world applicability within this domain.

引用

页码：5743 / 5758

页数：16

共 50 条

[41] Multi-condition multi-objective optimization using deep reinforcement learning
Kim, Sejin
Kim, Innyoung
You, Donghyun
JOURNAL OF COMPUTATIONAL PHYSICS, 2022, 462
[42] Multiagent Meta-Reinforcement Learning for Adaptive Multipath Routing Optimization
Chen, Long
Hu, Bin
Guan, Zhi-Hong
Zhao, Lian
Shen, Xuemin
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (10) : 5374 - 5386
[43] Multi-objective Meta-return Reinforcement Learning for Sequential Recommendation
Yu, Yemin
Kuang, Kun
Yang, Jiangchao
Wang, Zeke
Jia, Kunyang
Lu, Weiming
Yang, Hongxia
Wu, Fei
ARTIFICIAL INTELLIGENCE, CICAI 2022, PT II, 2022, 13605 : 95 - 111
[44] Meta-Reinforcement Learning Algorithm Based on Reward and Dynamic Inference
Chen, Jinhao
Zhang, Chunhong
Hu, Zheng
ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PT III, PAKDD 2024, 2024, 14647 : 223 - 234
[45] Combining a Multi-Objective Optimization Approach with Meta-Learning for SVM Parameter Selection
de Miranda, Pericles B. C.
Prudencio, Ricardo B. C.
de Carvalho, Andre Carlos P. L. F.
Soares, Carlos
PROCEEDINGS 2012 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2012, : 2909 - 2914
[46] A Novel Multi-Objective Target Value Optimization Approach
Wenzel, S.
Straatmann, S.
Kwiatkowski, L.
Schmelzer, P.
Kunert, J.
CLASSIFICATION AS A TOOL FOR RESEARCH, 2010, : 801 - 809
[47] Deep reinforcement learning for multi-objective combinatorial optimization: A case study on multi-objective traveling salesman problem
Li, Shicheng
Wang, Feng
He, Qi
Wang, Xujie
SWARM AND EVOLUTIONARY COMPUTATION, 2023, 83
[48] Model-Based Meta-reinforcement Learning for Hyperparameter Optimization
Albrechts, Jeroen
Martin, Hugo M.
Tavakol, Maryam
INTELLIGENT DATA ENGINEERING AND AUTOMATED LEARNING - IDEAL 2024, PT I, 2025, 15346 : 27 - 39
[49] Solution to multi-objective fuzzy optimization dynamic programming with uncertain information
Jin, YW
Shen, H
Li, KQ
Chi, ZX
PDCAT 2005: SIXTH INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED COMPUTING, APPLICATIONS AND TECHNOLOGIES, PROCEEDINGS, 2005, : 979 - 984
[50] Meta-Reinforcement Learning in Non-Stationary and Dynamic Environments
Bing, Zhenshan
Lerch, David
Huang, Kai
Knoll, Alois
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (03) : 3476 - 3491

← 1 2 3 4 5 →