Energy Constrained Multi-Agent Reinforcement Learning for Coverage Path Planning

被引：0

作者：

Zhao, Chenyang ^{[1
]}

Liu, Juan ^{[2
]}

Yoon, Suk-Un ^{[3
]}

Li, Xinde ^{[1
,4
]}

Li, Heqing ^{[1
]}

Zhang, Zhentong ^{[1
,4
]}

机构：

[1] Southeast Univ, Nanjing 210096, Peoples R China

[2] Samsung Elect China R&D Ctr, Nanjing 210012, Peoples R China

[3] Samsung Elect, Suwon 16677, Gyeonggi Do, South Korea

[4] Nanjing Ctr Appl Math, Nanjing 211135, Peoples R China

来源：

2023 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS) | 2023年

基金：

中国国家自然科学基金;

关键词：

NAVIGATION;

D O I：

10.1109/IROS55552.2023.10341412

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

For multi-agent area coverage path planning problem, existing researches regard it as a combination of Traveling Salesman Problem (TSP) and Coverage Path Planning (CPP). However, these approaches have disadvantages of poor observation ability in online phase and high computational cost in offline phase, making it difficult to be applied to energy-constrained Unmanned Aerial Vehicles (UAVs) and adjust strategy dynamically. In this paper, we decompose the task into two sub-problems: multi-agent path planning and sub-region CPP. We model the multi-agent path planning problem as a Collective Markov Decision Process (C-MDP), and design an Energy Constrained Multi-Agent Reinforcement Learning (ECMARL) algorithm based on the centralized training and distributed execution concept. Taking into account energy constraint of UAVs, the UAV propulsion power model is established to measure the energy consumption of UAVs, and load balancing strategy is applied to dynamically allocate target areas for each UAV. If the UAV is under energy-depleted situation, ECMARL can adjust the mission strategy in real time according to environmental information and energy storage conditions of other UAVs. When UAVs reach each sub-region of interest, Back-an-Forth Paths (BFPs) are adopted to solve CPP problem, which can ensure full coverage, optimality and complexity of the sub-problem. Comprehensive theoretical analysis and experiments demonstrate that ECMARL is superior to the traditional offline TSP-CPP strategy in terms of solution quality and computational time, and can effectively deal with the energy-constrained UAVs.

引用

页码：5590 / 5597

页数：8

共 50 条

[21] Multi-Agent Coverage Path Planning via Proximity Interaction and Cooperation
Jiao, Lei
Peng, Zhihong
Xi, Lele
Ding, Shuxin
Cui, Jinqiang
IEEE SENSORS JOURNAL, 2022, 22 (06) : 6196 - 6207
[22] Optimal Multi-Agent Coverage and Flight Time with Genetic Path Planning
Olson, Jacob M.
Bidstrup, Craig C.
Anderson, Brady K.
Parkinson, Alan R.
McLain, Timothy W.
2020 INTERNATIONAL CONFERENCE ON UNMANNED AIRCRAFT SYSTEMS (ICUAS'20), 2020, : 228 - 237
[23] A Multi-agent Path Planning Algorithm Based on Hierarchical Reinforcement Learning and Artificial Potential Field
Zheng, Yanbin
Li, Bo
An, Deyu
Li, Na
2015 11TH INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION (ICNC), 2015, : 363 - 369
[24] Path planning of multi-agent systems in unknown environment with neural kernel smoothing and reinforcement learning
Luviano Cruz, David
Yu, Wen
NEUROCOMPUTING, 2017, 233 : 34 - 42
[25] Constrained Motion Planning and Multi-Agent Path Finding on directed graphs☆
Ardizzoni, Stefano
Consolini, Luca
Locatelli, Marco
Saccani, Irene
AUTOMATICA, 2024, 165
[26] Multi-Agent Deep Reinforcement Learning-Based Multi-UAV Path Planning for Wireless Data Collection and Energy Transfer
Lee, Chungnyeong
Lee, Sangcheol
Kim, Taehoon
Bang, Inkyu
Lee, Jung Hoon
Chae, Seong Ho
2024 FIFTEENTH INTERNATIONAL CONFERENCE ON UBIQUITOUS AND FUTURE NETWORKS, ICUFN 2024, 2024, : 500 - 504
[27] Multi-agent hierarchical reinforcement learning for energy management
Jendoubi, Imen
Bouffard, Francois
APPLIED ENERGY, 2023, 332
[28] Area Coverage Maximization of Multi UAVs Using Multi-Agent Reinforcement Learning
Wijaya, Glenn B.
Tamba, Tua A.
2023 3RD INTERNATIONAL CONFERENCE ON ROBOTICS, AUTOMATION AND ARTIFICIAL INTELLIGENCE, RAAI 2023, 2023, : 1 - 4
[29] Network Maintenance Planning Via Multi-Agent Reinforcement Learning
Thomas, Jonathan
Hernandez, Marco Perez
Parlikad, Ajith Kumar
Piechocki, Robert
2021 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2021, : 2289 - 2295
[30] Multi-agent reinforcement learning for planning and scheduling multiple goals
Arai, S
Sycara, K
Payne, TR
FOURTH INTERNATIONAL CONFERENCE ON MULTIAGENT SYSTEMS, PROCEEDINGS, 2000, : 359 - 360

← 1 2 3 4 5 →