Multi-Drone Collaborative Shepherding Through Multi-Task Reinforcement Learning

被引：0

作者：

Wang, Guanghui ^{[1
]}

Peng, Junkun ^{[2
]}

Guan, Chenyang ^{[1
]}

Chen, Jinhua ^{[2
]}

Guo, Bing ^{[1
]}

机构：

[1] Qinghai Univ, Xining 810016, Qinghai, Peoples R China

[2] Tsinghua Univ, Beijing 100084, Peoples R China

来源：

IEEE ROBOTICS AND AUTOMATION LETTERS | 2024年 / 9卷 / 11期

关键词：

Drones; Collaboration; Adaptation models; Biological system modeling; Multitasking; Deep reinforcement learning; Heuristic algorithms; Path planning for multiple mobile robots or agents; reinforcement learning; collaboration; shepherding;

D O I：

10.1109/LRA.2024.3468155

中图分类号：

TP24 [机器人技术];

学科分类号：

080202 ; 1405 ;

摘要：

Robotic shepherding has become indispensable in animal husbandry and crowd management, offering a modern solution to traditional challenges. Drone Automated Shepherding leverages advanced maneuverability and an extensive field of view to improve the efficiency of these tasks, which are typically labor-intensive and time-consuming. Existing methods for managing large herds face significant challenges due to insufficient coordination among multiple drones and the complexities involved in simultaneously executing diverse shepherding tasks. This paper aims to enhance the execution of multiple shepherding tasks by optimizing drone coordination, designing optimal flight paths, and reducing flight time. To harness the potential of reinforcement learning, we develop a multi-drone collaborative shepherding environment that facilitates efficient drone training using a dense reward system. Additionally, we employ a multi-task deep reinforcement learning algorithm that enhances the sample efficiency and reward performance by leveraging shared information across tasks within this environment. Two specific tasks, driving and collecting, are used to assess the performance of our methodology. The effectiveness of our approach is measured against a classical solution named CTRL, examining metrics such as success rate, completion time, and flight path length. Results indicate that our approach significantly outperforms the CTRL in all measured metrics. Visualization of drone trajectories provides further evidence of our enhanced collaboration and efficiency in shepherding operations. Real-world experiments are conducted on a square of 300 m(3) , where two drones utilize our method to guide four small autonomous vehicles from the starting area to the goal area within 30 seconds.

引用

页码：10311 / 10318

页数：8

共 50 条

[41] Options in Multi-task Reinforcement Learning - Transfer via Reflection
Denis, Nicholas
Fraser, Maia
ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, 11489 : 225 - 237
[42] Curriculum-Based Asymmetric Multi-Task Reinforcement Learning
Huang, Hanchi
Ye, Deheng
Shen, Li
Liu, Wei
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (06) : 7258 - 7269
[43] Multi-Task Deep Reinforcement Learning for Continuous Action Control
Yang, Zhaoyang
Merrick, Kathryn
Abbass, Hussein
Jin, Lianwen
PROCEEDINGS OF THE TWENTY-SIXTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 3301 - 3307
[44] Multi-Task Reinforcement Learning with Context-based Representations
Sodhani, Shagun
Zhang, Amy
Pineau, Joelle
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
[45] Contrastive Modules with Temporal Attention for Multi-Task Reinforcement Learning
Lan, Siming
Zhang, Rui
Yi, Qi
Guo, Jiaming
Peng, Shaohui
Gao, Yunkai
Wu, Fan
Chen, Ruizhi
Du, Zidong
Hu, Xing
Zhang, Xishan
Li, Ling
Chen, Yunji
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
[46] Multi-task reinforcement learning in partially observable stochastic environments
Li, Hui
Liao, Xuejun
Carin, Lawrence
Journal of Machine Learning Research, 2009, 10 : 1131 - 1186
[47] Conservative Data Sharing for Multi-Task Offline Reinforcement Learning
Yu, Tianhe
Kumar, Aviral
Chebotar, Yevgen
Hausman, Karol
Levine, Sergey
Finn, Chelsea
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
[48] Discovering Synergies for Robot Manipulation with Multi-Task Reinforcement Learning
He, Zhanpeng
Ciocarlie, Matei
2022 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2022), 2022, : 2714 - 2721
[49] PiCor: Multi-Task Deep Reinforcement Learning with Policy Correction
Bai, Fengshuo
Zhang, Hongming
Tao, Tianyang
Wu, Zhiheng
Wang, Yanna
Xu, Bo
THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 6, 2023, : 6728 - 6736
[50] Efficient Design Space Exploration with Multi-Task Reinforcement Learning
Hoffmann, Patrick
Gorelik, Kirill
Ivanov, Valentin
2024 IEEE INTERNATIONAL CONFERENCE ON ADVANCED INTELLIGENT MECHATRONICS, AIM 2024, 2024, : 1378 - 1385

← 1 2 3 4 5 →