Optimizing UAV-UGV coalition operations: A hybrid clustering and multi-agent reinforcement learning approach for path planning in obstructed environment

被引:5
|
作者
Brotee, Shamyo [1 ]
Kabir, Farhan [1 ]
Razzaque, Md. Abdur [1 ]
Roy, Palash [2 ]
Mamun-Or-Rashid, Md. [1 ]
Hassan, Md. Rafiul [3 ]
Hassan, Mohammad Mehedi [4 ]
机构
[1] Univ Dhaka, Green Networking Res Grp, Dept Comp Sci & Engn, Dhaka, Bangladesh
[2] Green Univ Bangladesh, Dept Comp Sci & Engn, Dhaka, Bangladesh
[3] Univ Maine Presque Isle, Coll Arts & Sci, Presque Isle, ME 04769 USA
[4] King Saud Univ, Coll Comp & Informat Sci, Dept Informat Syst, Riyadh, Saudi Arabia
关键词
UAV-UGV coalition; Path planning; Multi-agent deep reinforcement learning; Mean-shift clustering; Obstructed environment;
D O I
10.1016/j.adhoc.2024.103519
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
One of the most critical applications undertaken by Unmanned Aerial Vehicles (UAVs) and Unmanned Ground Vehicles (UGVs) is reaching predefined targets by following the most time-efficient routes while avoiding collisions. Unfortunately, UAVs are hampered by limited battery life, and UGVs face challenges in reachability due to obstacles and elevation variations, which is why a coalition of UAVs and UGVs can be highly effective. Existing literature primarily focuses on one-to-one coalitions, which constrains the efficiency of reaching targets. In this work, we introduce a novel approach for a UAV-UGV coalition with a variable number of vehicles, employing a modified mean-shift clustering algorithm (MEANCRFT) to segment targets into multiple zones. This approach of assigning targets to various circular zones based on density and range significantly reduces the time required to reach these targets. Moreover, introducing variability in the number of UAVs and UGVs in a coalition enhances task efficiency by enabling simultaneous multi-target engagement. In our approach, each vehicle of the coalitions is trained using two advanced deep reinforcement learning algorithms in two separate experiments, namely Multi-agent Deep Deterministic Policy Gradient (MADDPG) and Multi- agent Proximal Policy Optimization (MAPPO). The results of our experimental evaluation demonstrate that the proposed MEANCRFT-MADDPG method substantially surpasses current state-of-the-art techniques, nearly doubling efficiency in terms of target navigation time and task completion rate.
引用
收藏
页数:16
相关论文
共 50 条
  • [1] Proficiency Constrained Multi-Agent Reinforcement Learning for Environment-Adaptive Multi UAV-UGV Teaming
    Yu, Qifei
    Shen, Zhexin
    Pang, Yijiang
    Liu, Rui
    2021 IEEE 17TH INTERNATIONAL CONFERENCE ON AUTOMATION SCIENCE AND ENGINEERING (CASE), 2021, : 2114 - 2118
  • [2] Cooperative Multi-Agent Planning Framework for Fuel Constrained UAV-UGV Routing Problem
    Mondal, Md Safwan
    Ramasamy, Subramanian
    Humann, James D.
    Dotterweich, James M.
    Reddinger, Jean-Paul F.
    Childers, Marshal A.
    Bhounsule, Pranav A.
    JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2025, 111 (01)
  • [3] Multi-agent robotic system (MARS) for UAV-UGV path planning and automatic sensory data collection in cluttered environments
    Hu, Difeng
    Gan, Vincent J. L.
    Wang, Tao
    Ma, Ling
    BUILDING AND ENVIRONMENT, 2022, 221
  • [4] Multi-UAV Path Planning and Following Based on Multi-Agent Reinforcement Learning
    Zhao, Xiaoru
    Yang, Rennong
    Zhong, Liangsheng
    Hou, Zhiwei
    DRONES, 2024, 8 (01)
  • [5] Optimizing Fuel-Constrained UAV-UGV Routes for Large Scale Coverage: Bilevel Planning in Heterogeneous Multi-Agent Systems
    Mondal, Md Safwan
    Ramasamy, Subramanian
    Humann, James D.
    Reddinger, Jean-Paul F.
    Dotterweich, James M.
    Childers, Marshal A.
    Bhounsule, Pranav
    2023 INTERNATIONAL SYMPOSIUM ON MULTI-ROBOT AND MULTI-AGENT SYSTEMS, MRS, 2023, : 114 - 120
  • [6] Multi-Agent Path Planning in Unknown Environment with Reinforcement Learning and Neural Network
    Luviano Cruz, David
    Yu, Wen
    2014 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC), 2014, : 3458 - 3463
  • [7] Path Planning in Unknown Environment with Kernel Smoothing and Reinforcement Learning for Multi-Agent Systems
    Luviano Cruz, David
    Yu, Wen
    2015 12TH INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING, COMPUTING SCIENCE AND AUTOMATIC CONTROL (CCE 2015), 2015,
  • [8] Iterative Planning for Multi-Agent Systems: An Application in Energy-Aware UAV-UGV Cooperative Task Site Assignments
    Thelasingha, Neelanga
    Julius, A. Agung
    Humann, James
    Reddinger, Jean-Paul
    Dotterweich, James
    Childers, Marshal
    IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2025, 22 : 3685 - 3703
  • [9] Joint Optimization of Multi-UAV Target Assignment and Path Planning Based on Multi-Agent Reinforcement Learning
    Qie, Han
    Shi, Dianxi
    Shen, Tianlong
    Xu, Xinhai
    Li, Yuan
    Wang, Liujing
    IEEE ACCESS, 2019, 7 : 146264 - 146272
  • [10] Path planning of multi-agent systems in unknown environment with neural kernel smoothing and reinforcement learning
    Luviano Cruz, David
    Yu, Wen
    NEUROCOMPUTING, 2017, 233 : 34 - 42