Energy Constrained Multi-Agent Reinforcement Learning for Coverage Path Planning

被引:0
|
作者
Zhao, Chenyang [1 ]
Liu, Juan [2 ]
Yoon, Suk-Un [3 ]
Li, Xinde [1 ,4 ]
Li, Heqing [1 ]
Zhang, Zhentong [1 ,4 ]
机构
[1] Southeast Univ, Nanjing 210096, Peoples R China
[2] Samsung Elect China R&D Ctr, Nanjing 210012, Peoples R China
[3] Samsung Elect, Suwon 16677, Gyeonggi Do, South Korea
[4] Nanjing Ctr Appl Math, Nanjing 211135, Peoples R China
基金
中国国家自然科学基金;
关键词
NAVIGATION;
D O I
10.1109/IROS55552.2023.10341412
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
For multi-agent area coverage path planning problem, existing researches regard it as a combination of Traveling Salesman Problem (TSP) and Coverage Path Planning (CPP). However, these approaches have disadvantages of poor observation ability in online phase and high computational cost in offline phase, making it difficult to be applied to energy-constrained Unmanned Aerial Vehicles (UAVs) and adjust strategy dynamically. In this paper, we decompose the task into two sub-problems: multi-agent path planning and sub-region CPP. We model the multi-agent path planning problem as a Collective Markov Decision Process (C-MDP), and design an Energy Constrained Multi-Agent Reinforcement Learning (ECMARL) algorithm based on the centralized training and distributed execution concept. Taking into account energy constraint of UAVs, the UAV propulsion power model is established to measure the energy consumption of UAVs, and load balancing strategy is applied to dynamically allocate target areas for each UAV. If the UAV is under energy-depleted situation, ECMARL can adjust the mission strategy in real time according to environmental information and energy storage conditions of other UAVs. When UAVs reach each sub-region of interest, Back-an-Forth Paths (BFPs) are adopted to solve CPP problem, which can ensure full coverage, optimality and complexity of the sub-problem. Comprehensive theoretical analysis and experiments demonstrate that ECMARL is superior to the traditional offline TSP-CPP strategy in terms of solution quality and computational time, and can effectively deal with the energy-constrained UAVs.
引用
收藏
页码:5590 / 5597
页数:8
相关论文
共 50 条
  • [21] Multi-Agent Coverage Path Planning via Proximity Interaction and Cooperation
    Jiao, Lei
    Peng, Zhihong
    Xi, Lele
    Ding, Shuxin
    Cui, Jinqiang
    IEEE SENSORS JOURNAL, 2022, 22 (06) : 6196 - 6207
  • [22] Optimal Multi-Agent Coverage and Flight Time with Genetic Path Planning
    Olson, Jacob M.
    Bidstrup, Craig C.
    Anderson, Brady K.
    Parkinson, Alan R.
    McLain, Timothy W.
    2020 INTERNATIONAL CONFERENCE ON UNMANNED AIRCRAFT SYSTEMS (ICUAS'20), 2020, : 228 - 237
  • [23] A Multi-agent Path Planning Algorithm Based on Hierarchical Reinforcement Learning and Artificial Potential Field
    Zheng, Yanbin
    Li, Bo
    An, Deyu
    Li, Na
    2015 11TH INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION (ICNC), 2015, : 363 - 369
  • [24] Path planning of multi-agent systems in unknown environment with neural kernel smoothing and reinforcement learning
    Luviano Cruz, David
    Yu, Wen
    NEUROCOMPUTING, 2017, 233 : 34 - 42
  • [25] Constrained Motion Planning and Multi-Agent Path Finding on directed graphs☆
    Ardizzoni, Stefano
    Consolini, Luca
    Locatelli, Marco
    Saccani, Irene
    AUTOMATICA, 2024, 165
  • [26] Multi-Agent Deep Reinforcement Learning-Based Multi-UAV Path Planning for Wireless Data Collection and Energy Transfer
    Lee, Chungnyeong
    Lee, Sangcheol
    Kim, Taehoon
    Bang, Inkyu
    Lee, Jung Hoon
    Chae, Seong Ho
    2024 FIFTEENTH INTERNATIONAL CONFERENCE ON UBIQUITOUS AND FUTURE NETWORKS, ICUFN 2024, 2024, : 500 - 504
  • [27] Multi-agent hierarchical reinforcement learning for energy management
    Jendoubi, Imen
    Bouffard, Francois
    APPLIED ENERGY, 2023, 332
  • [28] Area Coverage Maximization of Multi UAVs Using Multi-Agent Reinforcement Learning
    Wijaya, Glenn B.
    Tamba, Tua A.
    2023 3RD INTERNATIONAL CONFERENCE ON ROBOTICS, AUTOMATION AND ARTIFICIAL INTELLIGENCE, RAAI 2023, 2023, : 1 - 4
  • [29] Network Maintenance Planning Via Multi-Agent Reinforcement Learning
    Thomas, Jonathan
    Hernandez, Marco Perez
    Parlikad, Ajith Kumar
    Piechocki, Robert
    2021 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2021, : 2289 - 2295
  • [30] Multi-agent reinforcement learning for planning and scheduling multiple goals
    Arai, S
    Sycara, K
    Payne, TR
    FOURTH INTERNATIONAL CONFERENCE ON MULTIAGENT SYSTEMS, PROCEEDINGS, 2000, : 359 - 360