A collaborative-learning multi-agent reinforcement learning method for distributed hybrid flow shop scheduling problem

被引：1

作者：

Di, Yuanzhu ^{[1
]}

Deng, Libao ^{[1
]}

Zhang, Lili ^{[2
]}

机构：

[1] Harbin Inst Technol, Sch Informat Sci & Engn, Weihai 264209, Peoples R China

[2] Dublin City Univ, Sch Comp, Dublin, Ireland

来源：

SWARM AND EVOLUTIONARY COMPUTATION | 2024年 / 91卷

基金：

中国国家自然科学基金; 国家重点研发计划;

关键词：

Multi-agent system; Reinforcement learning; Deep neural network; Collaborative learning; Distributed hybrid flow shop scheduling; problem; EVOLUTIONARY ALGORITHM; TARDINESS; MAKESPAN;

D O I：

10.1016/j.swevo.2024.101764

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

As the increasing level of implementation of artificial intelligence technology in solving complex engineering optimization problems, various learning mechanisms, including deep learning (DL) and reinforcement learning (RL), have been developed for manufacturing scheduling. In this paper, a collaborative-learning multi-agent RL method (CL-MARL) is proposed for solving distributed hybrid flow-shop scheduling problem (DHFSP), minimizing both makespan and total energy consumption. First, the DHFSP is formulated as the Markov decision process, the features of machines and jobs are represented as state and observation matrixes according to their characteristics, the candidate operation set is used as action space, and a reward mechanism is designed based on the machine utilization. Next, a set of critic networks and actor networks, consist of recurrent neural networks and fully connected networks, are employed to map the states and observations into the output values. Then, a novel distance matching strategy is designed for each agent to select the most appropriate action at each scheduling step. Finally, the proposed CL-MARL model is trained through multi-agent deep deterministic policy gradient algorithm in collaborative-learning manner. The numerical results prove the effectiveness of the proposed multi-agent system, and the comparisons with existing algorithms demonstrate the high-potential of CL-MARL in solving DHFSP.

引用

页数：14

共 50 条

[31] Distributed localization for IoT with multi-agent reinforcement learning
Jia, Jie
Yu, Ruoying
Du, Zhenjun
Chen, Jian
Wang, Qinghu
Wang, Xingwei
NEURAL COMPUTING & APPLICATIONS, 2022, 34 (09): : 7227 - 7240
[32] Distributed Coordination Guidance in Multi-Agent Reinforcement Learning
Lau, Qiangfeng Peter
Lee, Mong Li
Hsu, Wynne
2011 23RD IEEE INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2011), 2011, : 456 - 463
[33] Distributed reinforcement learning in multi-agent decision systems
Giráldez, JI
Borrajo, D
PROGRESS IN ARTIFICIAL INTELLIGENCE-IBERAMIA 98, 1998, 1484 : 148 - 159
[34] Distributed localization for IoT with multi-agent reinforcement learning
Jie Jia
Ruoying Yu
Zhenjun Du
Jian Chen
Qinghu Wang
Xingwei Wang
Neural Computing and Applications, 2022, 34 : 7227 - 7240
[35] Collaborative Multi-Agent Tracking based on Distributed Learning
Qiu, Xuyi
Zhai, Yiwei
Wan, Kaifang
PROCEEDINGS OF THE 38TH CHINESE CONTROL CONFERENCE (CCC), 2019, : 2588 - 2593
[36] A Multi-agent Reinforcement Learning Method for Swarm Robots in Space Collaborative Exploration
Huang, Yixin
Wu, Shufan
Mu, Zhongcheng
Long, Xiangyu
Chu, Sunhao
Zhao, Guohong
2020 6TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND ROBOTICS (ICCAR), 2020, : 139 - 144
[37] An Optimization Method for Collaborative Radar Antijamming Based on Multi-Agent Reinforcement Learning
Feng, Cheng
Fu, Xiongjun
Wang, Ziyi
Dong, Jian
Zhao, Zhichun
Pan, Teng
REMOTE SENSING, 2023, 15 (11)
[38] Distributed and Multi-Agent Reinforcement Learning Framework for Optimal Electric Vehicle Charging Scheduling
Korkas, Christos D.
Tsaknakis, Christos D.
Kapoutsis, Athanasios Ch.
Kosmatopoulos, Elias
ENERGIES, 2024, 17 (15)
[39] Distributed Multi-Agent Reinforcement Learning for Collaborative Path Planning and Scheduling in Blockchain-Based Cognitive Internet of Vehicles
Chang, Huigang
Liu, Yiming
Sheng, Zhengguo
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2024, 73 (05) : 6301 - 6317
[40] Deep Reinforcement Learning for Distributed Flow Shop Scheduling with Flexible Maintenance
Yan, Qi
Wu, Wenbin
Wang, Hongfeng
MACHINES, 2022, 10 (03)

← 1 2 3 4 5 →