Energy Constrained Multi-Agent Reinforcement Learning for Coverage Path Planning

被引:0
|
作者
Zhao, Chenyang [1 ]
Liu, Juan [2 ]
Yoon, Suk-Un [3 ]
Li, Xinde [1 ,4 ]
Li, Heqing [1 ]
Zhang, Zhentong [1 ,4 ]
机构
[1] Southeast Univ, Nanjing 210096, Peoples R China
[2] Samsung Elect China R&D Ctr, Nanjing 210012, Peoples R China
[3] Samsung Elect, Suwon 16677, Gyeonggi Do, South Korea
[4] Nanjing Ctr Appl Math, Nanjing 211135, Peoples R China
基金
中国国家自然科学基金;
关键词
NAVIGATION;
D O I
10.1109/IROS55552.2023.10341412
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
For multi-agent area coverage path planning problem, existing researches regard it as a combination of Traveling Salesman Problem (TSP) and Coverage Path Planning (CPP). However, these approaches have disadvantages of poor observation ability in online phase and high computational cost in offline phase, making it difficult to be applied to energy-constrained Unmanned Aerial Vehicles (UAVs) and adjust strategy dynamically. In this paper, we decompose the task into two sub-problems: multi-agent path planning and sub-region CPP. We model the multi-agent path planning problem as a Collective Markov Decision Process (C-MDP), and design an Energy Constrained Multi-Agent Reinforcement Learning (ECMARL) algorithm based on the centralized training and distributed execution concept. Taking into account energy constraint of UAVs, the UAV propulsion power model is established to measure the energy consumption of UAVs, and load balancing strategy is applied to dynamically allocate target areas for each UAV. If the UAV is under energy-depleted situation, ECMARL can adjust the mission strategy in real time according to environmental information and energy storage conditions of other UAVs. When UAVs reach each sub-region of interest, Back-an-Forth Paths (BFPs) are adopted to solve CPP problem, which can ensure full coverage, optimality and complexity of the sub-problem. Comprehensive theoretical analysis and experiments demonstrate that ECMARL is superior to the traditional offline TSP-CPP strategy in terms of solution quality and computational time, and can effectively deal with the energy-constrained UAVs.
引用
收藏
页码:5590 / 5597
页数:8
相关论文
共 50 条
  • [41] Periodic Multi-Agent Path Planning
    Kasaura, Kazumi
    Yonetani, Ryo
    Nishimura, Mai
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 5, 2023, : 6183 - 6191
  • [42] Efficient multi-agent path planning
    Arikan, O
    Chenney, S
    Forsyth, DA
    COMPUTER ANIMATION AND SIMULATION 2001, 2001, : 151 - 162
  • [43] Multi-Agent Reinforcement Learning for Smart Community Energy Management
    Wilk, Patrick
    Wang, Ning
    Li, Jie
    ENERGIES, 2024, 17 (20)
  • [44] Multi-agent deep reinforcement learning strategy for distributed energy
    Xi, Lei
    Sun, Mengmeng
    Zhou, Huan
    Xu, Yanchun
    Wu, Junnan
    Li, Yanying
    MEASUREMENT, 2021, 185
  • [45] Multi-agent Deep Reinforcement Learning for Zero Energy Communities
    Prasad, Amit
    Dusparic, Ivana
    PROCEEDINGS OF 2019 IEEE PES INNOVATIVE SMART GRID TECHNOLOGIES EUROPE (ISGT-EUROPE), 2019,
  • [46] Multi-agent reinforcement learning in a new transactive energy mechanism
    Mohsenzadeh-Yazdi, Hossein
    Kebriaei, Hamed
    Aminifar, Farrokh
    IET GENERATION TRANSMISSION & DISTRIBUTION, 2024, 18 (18) : 2943 - 2955
  • [47] Multi-agent Deep Reinforcement Learning for Microgrid Energy Scheduling
    Zuo, Zhiqiang
    Li, Zhi
    Wang, Yijing
    2022 41ST CHINESE CONTROL CONFERENCE (CCC), 2022, : 6184 - 6189
  • [48] Trajectory planning of space manipulator based on multi-agent reinforcement learning
    Zhao Y.
    Guan G.
    Guo J.
    Yu X.
    Yan P.
    Hangkong Xuebao/Acta Aeronautica et Astronautica Sinica, 2021, 42 (01):
  • [49] Safe multi-agent motion planning via filtered reinforcement learning
    Vinod, Abraham P.
    Safaoui, Sleiman
    Chakrabarty, Ankush
    Quirynen, Rien
    Yoshikawa, Nobuyuki
    Di Cairano, Stefano
    2022 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA 2022, 2022, : 7270 - 7276
  • [50] Multi-Agent Dynamic Area Coverage Based on Reinforcement Learning with Connected Agents
    Aydemir, Fatih
    Cetin, Aydin
    COMPUTER SYSTEMS SCIENCE AND ENGINEERING, 2023, 45 (01): : 215 - 230