A collaborative-learning multi-agent reinforcement learning method for distributed hybrid flow shop scheduling problem

被引:1
|
作者
Di, Yuanzhu [1 ]
Deng, Libao [1 ]
Zhang, Lili [2 ]
机构
[1] Harbin Inst Technol, Sch Informat Sci & Engn, Weihai 264209, Peoples R China
[2] Dublin City Univ, Sch Comp, Dublin, Ireland
基金
中国国家自然科学基金; 国家重点研发计划;
关键词
Multi-agent system; Reinforcement learning; Deep neural network; Collaborative learning; Distributed hybrid flow shop scheduling; problem; EVOLUTIONARY ALGORITHM; TARDINESS; MAKESPAN;
D O I
10.1016/j.swevo.2024.101764
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
As the increasing level of implementation of artificial intelligence technology in solving complex engineering optimization problems, various learning mechanisms, including deep learning (DL) and reinforcement learning (RL), have been developed for manufacturing scheduling. In this paper, a collaborative-learning multi-agent RL method (CL-MARL) is proposed for solving distributed hybrid flow-shop scheduling problem (DHFSP), minimizing both makespan and total energy consumption. First, the DHFSP is formulated as the Markov decision process, the features of machines and jobs are represented as state and observation matrixes according to their characteristics, the candidate operation set is used as action space, and a reward mechanism is designed based on the machine utilization. Next, a set of critic networks and actor networks, consist of recurrent neural networks and fully connected networks, are employed to map the states and observations into the output values. Then, a novel distance matching strategy is designed for each agent to select the most appropriate action at each scheduling step. Finally, the proposed CL-MARL model is trained through multi-agent deep deterministic policy gradient algorithm in collaborative-learning manner. The numerical results prove the effectiveness of the proposed multi-agent system, and the comparisons with existing algorithms demonstrate the high-potential of CL-MARL in solving DHFSP.
引用
收藏
页数:14
相关论文
共 50 条
  • [31] Distributed localization for IoT with multi-agent reinforcement learning
    Jia, Jie
    Yu, Ruoying
    Du, Zhenjun
    Chen, Jian
    Wang, Qinghu
    Wang, Xingwei
    NEURAL COMPUTING & APPLICATIONS, 2022, 34 (09): : 7227 - 7240
  • [32] Distributed Coordination Guidance in Multi-Agent Reinforcement Learning
    Lau, Qiangfeng Peter
    Lee, Mong Li
    Hsu, Wynne
    2011 23RD IEEE INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2011), 2011, : 456 - 463
  • [33] Distributed reinforcement learning in multi-agent decision systems
    Giráldez, JI
    Borrajo, D
    PROGRESS IN ARTIFICIAL INTELLIGENCE-IBERAMIA 98, 1998, 1484 : 148 - 159
  • [34] Distributed localization for IoT with multi-agent reinforcement learning
    Jie Jia
    Ruoying Yu
    Zhenjun Du
    Jian Chen
    Qinghu Wang
    Xingwei Wang
    Neural Computing and Applications, 2022, 34 : 7227 - 7240
  • [35] Collaborative Multi-Agent Tracking based on Distributed Learning
    Qiu, Xuyi
    Zhai, Yiwei
    Wan, Kaifang
    PROCEEDINGS OF THE 38TH CHINESE CONTROL CONFERENCE (CCC), 2019, : 2588 - 2593
  • [36] A Multi-agent Reinforcement Learning Method for Swarm Robots in Space Collaborative Exploration
    Huang, Yixin
    Wu, Shufan
    Mu, Zhongcheng
    Long, Xiangyu
    Chu, Sunhao
    Zhao, Guohong
    2020 6TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND ROBOTICS (ICCAR), 2020, : 139 - 144
  • [37] An Optimization Method for Collaborative Radar Antijamming Based on Multi-Agent Reinforcement Learning
    Feng, Cheng
    Fu, Xiongjun
    Wang, Ziyi
    Dong, Jian
    Zhao, Zhichun
    Pan, Teng
    REMOTE SENSING, 2023, 15 (11)
  • [38] Distributed and Multi-Agent Reinforcement Learning Framework for Optimal Electric Vehicle Charging Scheduling
    Korkas, Christos D.
    Tsaknakis, Christos D.
    Kapoutsis, Athanasios Ch.
    Kosmatopoulos, Elias
    ENERGIES, 2024, 17 (15)
  • [39] Distributed Multi-Agent Reinforcement Learning for Collaborative Path Planning and Scheduling in Blockchain-Based Cognitive Internet of Vehicles
    Chang, Huigang
    Liu, Yiming
    Sheng, Zhengguo
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2024, 73 (05) : 6301 - 6317
  • [40] Deep Reinforcement Learning for Distributed Flow Shop Scheduling with Flexible Maintenance
    Yan, Qi
    Wu, Wenbin
    Wang, Hongfeng
    MACHINES, 2022, 10 (03)