Multi-Agent Deep Reinforcement Learning for Multi-Echelon Inventory Management

被引:0
|
作者
Liu, Xiaotian [1 ]
Hu, Ming [2 ]
Peng, Yijie [3 ]
Yang, Yaodong [4 ]
机构
[1] Peking Univ, Guanghua Sch Management, Beijing, Peoples R China
[2] Univ Toronto, Rotman Sch Management, Toronto, ON M5S 3E6, Canada
[3] Peking Univ, PKU Wuhan Inst Artificial Intelligence, Guanghua Sch Management, Xiangjiang Lab, Beijing, Peoples R China
[4] Peking Univ, Inst Artificial Intelligence, PKU Wuhan Inst Artificial Intelligence, Beijing, Peoples R China
基金
加拿大自然科学与工程研究理事会; 美国国家科学基金会;
关键词
Multi-Echelon Inventory Management; Multi-Agent Reinforcement Learning; Bullwhip Effect; OPTIMAL POLICIES; OPTIMALITY;
D O I
10.1177/10591478241305863
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
We apply heterogeneous-agent proximal policy optimization (HAPPO), a multi-agent deep reinforcement learning (MADRL) algorithm, to the decentralized multi-echelon inventory management problems in both a serial supply chain and a supply chain network. We also examine whether the upfront-only information-sharing mechanism used in MADRL helps alleviate the bullwhip effect. Our results show that policies constructed by HAPPO achieve lower overall costs than policies constructed by single-agent deep reinforcement learning and other heuristic policies. Also, the application of HAPPO results in a less significant bullwhip effect than policies constructed by single-agent deep reinforcement learning where information is not shared among actors. Somewhat surprisingly, compared to using the overall costs of the system as a minimization target for each actor, HAPPO achieves lower overall costs when the minimization target for each actor is a combination of its own costs and the overall costs of the system. Our results provide a new perspective on the benefit of information sharing inside the supply chain that helps alleviate the bullwhip effect and improve the overall performance of the system. Upfront information sharing and action coordination in model training among actors is essential, with the former even more essential, for improving a supply chain's overall performance when applying MADRL. Neither actors being fully self-interested nor actors being fully system-focused leads to the best practical performance of policies learned and constructed by MADRL. Our results also verify MADRL's potential in solving various multi-echelon inventory management problems with complex supply chain structures and in non-stationary market environments.
引用
收藏
页数:21
相关论文
共 50 条
  • [41] Multi-Agent Deep Reinforcement Learning for Walker Systems
    Park, Inhee
    Moh, Teng-Sheng
    20TH IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA 2021), 2021, : 490 - 495
  • [42] Action Markets in Deep Multi-Agent Reinforcement Learning
    Schmid, Kyrill
    Belzner, Lenz
    Gabor, Thomas
    Phan, Thomy
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2018, PT II, 2018, 11140 : 240 - 249
  • [43] Strategic Interaction Multi-Agent Deep Reinforcement Learning
    Zhou, Wenhong
    Li, Jie
    Chen, Yiting
    Shen, Lin-Cheng
    IEEE Access, 2020, 8 : 119000 - 119009
  • [44] Multi-Agent Deep Reinforcement Learning in Vehicular OCC
    Islam, Amirul
    Musavian, Leila
    Thomos, Nikolaos
    2022 IEEE 95TH VEHICULAR TECHNOLOGY CONFERENCE (VTC2022-SPRING), 2022,
  • [45] Teaching on a Budget in Multi-Agent Deep Reinforcement Learning
    Ilhan, Ercument
    Gow, Jeremy
    Perez-Liebana, Diego
    2019 IEEE CONFERENCE ON GAMES (COG), 2019,
  • [46] Research Progress of Multi-Agent Deep Reinforcement Learning
    Ding, Shi-Feiu
    Du, Weiu
    Zhang, Jianu
    Guo, Li-Liu
    Ding, Ding
    Jisuanji Xuebao/Chinese Journal of Computers, 2024, 47 (07): : 1547 - 1567
  • [47] A Transfer Learning Framework for Deep Multi-Agent Reinforcement Learning
    Yi Liu
    Xiang Wu
    Yuming Bo
    Jiacun Wang
    Lifeng Ma
    IEEE/CAA Journal of Automatica Sinica, 2024, 11 (11) : 2346 - 2348
  • [48] A Transfer Learning Framework for Deep Multi-Agent Reinforcement Learning
    Liu, Yi
    Wu, Xiang
    Bo, Yuming
    Wang, Jiacun
    Ma, Lifeng
    IEEE-CAA JOURNAL OF AUTOMATICA SINICA, 2024, 11 (11) : 2346 - 2348
  • [49] Multi-Agent Reinforcement Learning
    Stankovic, Milos
    2016 13TH SYMPOSIUM ON NEURAL NETWORKS AND APPLICATIONS (NEUREL), 2016, : 43 - 43
  • [50] Resource Management in Wireless Networks via Multi-Agent Deep Reinforcement Learning
    Naderializadeh, Navid
    Sydir, Jaroslaw J.
    Simsek, Meryem
    Nikopour, Hosein
    IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2021, 20 (06) : 3507 - 3523