Dynamic programming with meta-reinforcement learning: a novel approach for multi-objective optimization

被引:1
|
作者
Wang, Qi [1 ]
Zhang, Chengwei [1 ]
Hu, Bin [2 ]
机构
[1] Dalian Maritime Univ, Informat Sci & Technol Coll, Dalian, Peoples R China
[2] Hangzhou Dianzi Univ, Sch Comp Sci, Hangzhou, Peoples R China
关键词
Combinatorial optimization; Meta-learning; Reinforcement learning; Dynamic programming; PERSON REIDENTIFICATION;
D O I
10.1007/s40747-024-01469-1
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multi-objective optimization (MOO) endeavors to identify optimal solutions from a finite array of possibilities. In recent years, deep reinforcement learning (RL) has exhibited promise through its well-crafted heuristics in tackling NP-hard combinatorial optimization (CO) problems. Nonetheless, current methodologies grapple with two key challenges: (1) They primarily concentrate on single-objective optimization quandaries, rendering them less adaptable to the more prevalent MOO scenarios encountered in real-world applications. (2) These approaches furnish an approximate solution by imbibing heuristics, lacking a systematic means to enhance or substantiate optimality. Given these challenges, this study introduces an overarching hybrid strategy, dynamic programming with meta-reinforcement learning (DPML), to resolve MOO predicaments. The approach melds meta-learning into an RL framework, addressing multiple subproblems inherent to MOO. Furthermore, the precision of solutions is elevated by endowing exact dynamic programming with the prowess of meta-graph neural networks. Empirical results substantiate the supremacy of our methodology over previous RL and heuristics approaches, bridging the chasm between theoretical underpinnings and real-world applicability within this domain.
引用
收藏
页码:5743 / 5758
页数:16
相关论文
共 50 条
  • [41] Multi-condition multi-objective optimization using deep reinforcement learning
    Kim, Sejin
    Kim, Innyoung
    You, Donghyun
    JOURNAL OF COMPUTATIONAL PHYSICS, 2022, 462
  • [42] Multiagent Meta-Reinforcement Learning for Adaptive Multipath Routing Optimization
    Chen, Long
    Hu, Bin
    Guan, Zhi-Hong
    Zhao, Lian
    Shen, Xuemin
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (10) : 5374 - 5386
  • [43] Multi-objective Meta-return Reinforcement Learning for Sequential Recommendation
    Yu, Yemin
    Kuang, Kun
    Yang, Jiangchao
    Wang, Zeke
    Jia, Kunyang
    Lu, Weiming
    Yang, Hongxia
    Wu, Fei
    ARTIFICIAL INTELLIGENCE, CICAI 2022, PT II, 2022, 13605 : 95 - 111
  • [44] Meta-Reinforcement Learning Algorithm Based on Reward and Dynamic Inference
    Chen, Jinhao
    Zhang, Chunhong
    Hu, Zheng
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PT III, PAKDD 2024, 2024, 14647 : 223 - 234
  • [45] Combining a Multi-Objective Optimization Approach with Meta-Learning for SVM Parameter Selection
    de Miranda, Pericles B. C.
    Prudencio, Ricardo B. C.
    de Carvalho, Andre Carlos P. L. F.
    Soares, Carlos
    PROCEEDINGS 2012 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2012, : 2909 - 2914
  • [46] A Novel Multi-Objective Target Value Optimization Approach
    Wenzel, S.
    Straatmann, S.
    Kwiatkowski, L.
    Schmelzer, P.
    Kunert, J.
    CLASSIFICATION AS A TOOL FOR RESEARCH, 2010, : 801 - 809
  • [47] Deep reinforcement learning for multi-objective combinatorial optimization: A case study on multi-objective traveling salesman problem
    Li, Shicheng
    Wang, Feng
    He, Qi
    Wang, Xujie
    SWARM AND EVOLUTIONARY COMPUTATION, 2023, 83
  • [48] Model-Based Meta-reinforcement Learning for Hyperparameter Optimization
    Albrechts, Jeroen
    Martin, Hugo M.
    Tavakol, Maryam
    INTELLIGENT DATA ENGINEERING AND AUTOMATED LEARNING - IDEAL 2024, PT I, 2025, 15346 : 27 - 39
  • [49] Solution to multi-objective fuzzy optimization dynamic programming with uncertain information
    Jin, YW
    Shen, H
    Li, KQ
    Chi, ZX
    PDCAT 2005: SIXTH INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED COMPUTING, APPLICATIONS AND TECHNOLOGIES, PROCEEDINGS, 2005, : 979 - 984
  • [50] Meta-Reinforcement Learning in Non-Stationary and Dynamic Environments
    Bing, Zhenshan
    Lerch, David
    Huang, Kai
    Knoll, Alois
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (03) : 3476 - 3491