Multi-Drone Collaborative Shepherding Through Multi-Task Reinforcement Learning

被引:0
|
作者
Wang, Guanghui [1 ]
Peng, Junkun [2 ]
Guan, Chenyang [1 ]
Chen, Jinhua [2 ]
Guo, Bing [1 ]
机构
[1] Qinghai Univ, Xining 810016, Qinghai, Peoples R China
[2] Tsinghua Univ, Beijing 100084, Peoples R China
来源
关键词
Drones; Collaboration; Adaptation models; Biological system modeling; Multitasking; Deep reinforcement learning; Heuristic algorithms; Path planning for multiple mobile robots or agents; reinforcement learning; collaboration; shepherding;
D O I
10.1109/LRA.2024.3468155
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
Robotic shepherding has become indispensable in animal husbandry and crowd management, offering a modern solution to traditional challenges. Drone Automated Shepherding leverages advanced maneuverability and an extensive field of view to improve the efficiency of these tasks, which are typically labor-intensive and time-consuming. Existing methods for managing large herds face significant challenges due to insufficient coordination among multiple drones and the complexities involved in simultaneously executing diverse shepherding tasks. This paper aims to enhance the execution of multiple shepherding tasks by optimizing drone coordination, designing optimal flight paths, and reducing flight time. To harness the potential of reinforcement learning, we develop a multi-drone collaborative shepherding environment that facilitates efficient drone training using a dense reward system. Additionally, we employ a multi-task deep reinforcement learning algorithm that enhances the sample efficiency and reward performance by leveraging shared information across tasks within this environment. Two specific tasks, driving and collecting, are used to assess the performance of our methodology. The effectiveness of our approach is measured against a classical solution named CTRL, examining metrics such as success rate, completion time, and flight path length. Results indicate that our approach significantly outperforms the CTRL in all measured metrics. Visualization of drone trajectories provides further evidence of our enhanced collaboration and efficiency in shepherding operations. Real-world experiments are conducted on a square of 300 m(3) , where two drones utilize our method to guide four small autonomous vehicles from the starting area to the goal area within 30 seconds.
引用
收藏
页码:10311 / 10318
页数:8
相关论文
共 50 条
  • [41] Options in Multi-task Reinforcement Learning - Transfer via Reflection
    Denis, Nicholas
    Fraser, Maia
    ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, 11489 : 225 - 237
  • [42] Curriculum-Based Asymmetric Multi-Task Reinforcement Learning
    Huang, Hanchi
    Ye, Deheng
    Shen, Li
    Liu, Wei
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (06) : 7258 - 7269
  • [43] Multi-Task Deep Reinforcement Learning for Continuous Action Control
    Yang, Zhaoyang
    Merrick, Kathryn
    Abbass, Hussein
    Jin, Lianwen
    PROCEEDINGS OF THE TWENTY-SIXTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 3301 - 3307
  • [44] Multi-Task Reinforcement Learning with Context-based Representations
    Sodhani, Shagun
    Zhang, Amy
    Pineau, Joelle
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
  • [45] Contrastive Modules with Temporal Attention for Multi-Task Reinforcement Learning
    Lan, Siming
    Zhang, Rui
    Yi, Qi
    Guo, Jiaming
    Peng, Shaohui
    Gao, Yunkai
    Wu, Fan
    Chen, Ruizhi
    Du, Zidong
    Hu, Xing
    Zhang, Xishan
    Li, Ling
    Chen, Yunji
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [46] Multi-task reinforcement learning in partially observable stochastic environments
    Li, Hui
    Liao, Xuejun
    Carin, Lawrence
    Journal of Machine Learning Research, 2009, 10 : 1131 - 1186
  • [47] Conservative Data Sharing for Multi-Task Offline Reinforcement Learning
    Yu, Tianhe
    Kumar, Aviral
    Chebotar, Yevgen
    Hausman, Karol
    Levine, Sergey
    Finn, Chelsea
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [48] Discovering Synergies for Robot Manipulation with Multi-Task Reinforcement Learning
    He, Zhanpeng
    Ciocarlie, Matei
    2022 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2022), 2022, : 2714 - 2721
  • [49] PiCor: Multi-Task Deep Reinforcement Learning with Policy Correction
    Bai, Fengshuo
    Zhang, Hongming
    Tao, Tianyang
    Wu, Zhiheng
    Wang, Yanna
    Xu, Bo
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 6, 2023, : 6728 - 6736
  • [50] Efficient Design Space Exploration with Multi-Task Reinforcement Learning
    Hoffmann, Patrick
    Gorelik, Kirill
    Ivanov, Valentin
    2024 IEEE INTERNATIONAL CONFERENCE ON ADVANCED INTELLIGENT MECHATRONICS, AIM 2024, 2024, : 1378 - 1385