Optimizing UAV-UGV coalition operations: A hybrid clustering and multi-agent reinforcement learning approach for path planning in obstructed environment

被引：5

作者：

Brotee, Shamyo ^{[1
]}

Kabir, Farhan ^{[1
]}

Razzaque, Md. Abdur ^{[1
]}

Roy, Palash ^{[2
]}

Mamun-Or-Rashid, Md. ^{[1
]}

Hassan, Md. Rafiul ^{[3
]}

Hassan, Mohammad Mehedi ^{[4
]}

机构：

[1] Univ Dhaka, Green Networking Res Grp, Dept Comp Sci & Engn, Dhaka, Bangladesh

[2] Green Univ Bangladesh, Dept Comp Sci & Engn, Dhaka, Bangladesh

[3] Univ Maine Presque Isle, Coll Arts & Sci, Presque Isle, ME 04769 USA

[4] King Saud Univ, Coll Comp & Informat Sci, Dept Informat Syst, Riyadh, Saudi Arabia

来源：

AD HOC NETWORKS | 2024年 / 160卷

关键词：

UAV-UGV coalition; Path planning; Multi-agent deep reinforcement learning; Mean-shift clustering; Obstructed environment;

D O I：

10.1016/j.adhoc.2024.103519

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

One of the most critical applications undertaken by Unmanned Aerial Vehicles (UAVs) and Unmanned Ground Vehicles (UGVs) is reaching predefined targets by following the most time-efficient routes while avoiding collisions. Unfortunately, UAVs are hampered by limited battery life, and UGVs face challenges in reachability due to obstacles and elevation variations, which is why a coalition of UAVs and UGVs can be highly effective. Existing literature primarily focuses on one-to-one coalitions, which constrains the efficiency of reaching targets. In this work, we introduce a novel approach for a UAV-UGV coalition with a variable number of vehicles, employing a modified mean-shift clustering algorithm (MEANCRFT) to segment targets into multiple zones. This approach of assigning targets to various circular zones based on density and range significantly reduces the time required to reach these targets. Moreover, introducing variability in the number of UAVs and UGVs in a coalition enhances task efficiency by enabling simultaneous multi-target engagement. In our approach, each vehicle of the coalitions is trained using two advanced deep reinforcement learning algorithms in two separate experiments, namely Multi-agent Deep Deterministic Policy Gradient (MADDPG) and Multi- agent Proximal Policy Optimization (MAPPO). The results of our experimental evaluation demonstrate that the proposed MEANCRFT-MADDPG method substantially surpasses current state-of-the-art techniques, nearly doubling efficiency in terms of target navigation time and task completion rate.

引用

页数：16

共 50 条

[1] Proficiency Constrained Multi-Agent Reinforcement Learning for Environment-Adaptive Multi UAV-UGV Teaming
Yu, Qifei
Shen, Zhexin
Pang, Yijiang
Liu, Rui
2021 IEEE 17TH INTERNATIONAL CONFERENCE ON AUTOMATION SCIENCE AND ENGINEERING (CASE), 2021, : 2114 - 2118
[2] Cooperative Multi-Agent Planning Framework for Fuel Constrained UAV-UGV Routing Problem
Mondal, Md Safwan
Ramasamy, Subramanian
Humann, James D.
Dotterweich, James M.
Reddinger, Jean-Paul F.
Childers, Marshal A.
Bhounsule, Pranav A.
JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2025, 111 (01)
[3] Multi-agent robotic system (MARS) for UAV-UGV path planning and automatic sensory data collection in cluttered environments
Hu, Difeng
Gan, Vincent J. L.
Wang, Tao
Ma, Ling
BUILDING AND ENVIRONMENT, 2022, 221
[4] Multi-UAV Path Planning and Following Based on Multi-Agent Reinforcement Learning
Zhao, Xiaoru
Yang, Rennong
Zhong, Liangsheng
Hou, Zhiwei
DRONES, 2024, 8 (01)
[5] Optimizing Fuel-Constrained UAV-UGV Routes for Large Scale Coverage: Bilevel Planning in Heterogeneous Multi-Agent Systems
Mondal, Md Safwan
Ramasamy, Subramanian
Humann, James D.
Reddinger, Jean-Paul F.
Dotterweich, James M.
Childers, Marshal A.
Bhounsule, Pranav
2023 INTERNATIONAL SYMPOSIUM ON MULTI-ROBOT AND MULTI-AGENT SYSTEMS, MRS, 2023, : 114 - 120
[6] Multi-Agent Path Planning in Unknown Environment with Reinforcement Learning and Neural Network
Luviano Cruz, David
Yu, Wen
2014 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC), 2014, : 3458 - 3463
[7] Path Planning in Unknown Environment with Kernel Smoothing and Reinforcement Learning for Multi-Agent Systems
Luviano Cruz, David
Yu, Wen
2015 12TH INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING, COMPUTING SCIENCE AND AUTOMATIC CONTROL (CCE 2015), 2015,
[8] Iterative Planning for Multi-Agent Systems: An Application in Energy-Aware UAV-UGV Cooperative Task Site Assignments
Thelasingha, Neelanga
Julius, A. Agung
Humann, James
Reddinger, Jean-Paul
Dotterweich, James
Childers, Marshal
IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2025, 22 : 3685 - 3703
[9] Joint Optimization of Multi-UAV Target Assignment and Path Planning Based on Multi-Agent Reinforcement Learning
Qie, Han
Shi, Dianxi
Shen, Tianlong
Xu, Xinhai
Li, Yuan
Wang, Liujing
IEEE ACCESS, 2019, 7 : 146264 - 146272
[10] Path planning of multi-agent systems in unknown environment with neural kernel smoothing and reinforcement learning
Luviano Cruz, David
Yu, Wen
NEUROCOMPUTING, 2017, 233 : 34 - 42

← 1 2 3 4 5 →