Reinforcement Learning Mission Supervisor Design for Behavior-based Differential Drive Robots

被引:0
|
作者
Zhang, Zhenyi [1 ,2 ]
Huang, Jie [1 ,2 ]
机构
[1] School of Electrical Engineering and Automation, Fuzhou University, Fuzhou,350108, China
[2] 5G+ Industrial Internet Institute of Fuzhou University, Fuzhou,350108, China
来源
Jiqiren/Robot | 2024年 / 46卷 / 04期
关键词
Reinforcement learning;
D O I
10.13973/j.cnki.robot.230148
中图分类号
学科分类号
摘要
A multi-agent reinforcement learning mission supervisor (MARLMS) is designed for differential drive robots using trial-and-error learning. The proposed MARLMS addresses the challenge inherent in behavior-based multi-agent systems, wherein the design of switching rules to determine behavior priorities relies heavily on human intelligence. Building upon the null-space-based behavioral control (NSBC) framework, a differential model is introduced to replace the particle model. Consequently, a paradigm of NSBC with nonholonomic constraints is presented for the first time, enhancing the system robustness to the minimum extremum state. Subsequently, a joint policy is developed to dynamically and intelligently determine behavior priorities by modeling the behavior priority switching problem as a cooperative Markov game. The proposed MARLMS not only eliminates the need for manual design of switching rules but also reduces the computational and storage burdens during online operations. Simulation results demonstrate the superior behavior priority switching performance of the proposed MARLMS. Furthermore, successful implementation on AgileX Limo robots validates the practicality of the proposed MARLMS. © 2024 Chinese Academy of Sciences. All rights reserved.
引用
收藏
页码:397 / 416
相关论文
共 50 条
  • [1] Measuring the effectiveness of reinforcement learning for behavior-based robots
    Shackleton, J
    Gini, M
    ADAPTIVE BEHAVIOR, 1997, 5 (3-4) : 365 - 390
  • [2] AUTOMATIC PROGRAMMING OF BEHAVIOR-BASED ROBOTS USING REINFORCEMENT LEARNING
    MAHADEVAN, S
    CONNELL, J
    ARTIFICIAL INTELLIGENCE, 1992, 55 (2-3) : 311 - 365
  • [3] An architecture for behavior-based reinforcement learning
    Konidaris, GD
    Hayes, GM
    ADAPTIVE BEHAVIOR, 2005, 13 (01) : 5 - 32
  • [4] Design and implementation of a behavior-based control and learning architecture for mobile robots
    Suh, IH
    Lee, S
    Kim, BO
    Yi, BJ
    Oh, SR
    2003 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, VOLS 1-3, PROCEEDINGS, 2003, : 4142 - 4147
  • [5] A Behavior-Based Reinforcement Learning Approach to Control Walking Bipedal Robots Under Unknown Disturbances
    Beranek, Richard
    Karimi, Masoud
    Ahmadi, Mojtaba
    IEEE-ASME TRANSACTIONS ON MECHATRONICS, 2022, 27 (05) : 2710 - 2720
  • [6] Learning to ground fact symbols in behavior-based robots
    Hertzberg, J
    Jaeger, H
    Schönherr, F
    ECAI 2002: 15TH EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2002, 77 : 708 - 712
  • [7] Behavior-based learning fuzzy rules for mobile robots
    Thongchai, S
    PROCEEDINGS OF THE 2002 AMERICAN CONTROL CONFERENCE, VOLS 1-6, 2002, 1-6 : 995 - 1000
  • [8] An application of behavior-based architecture for mobile robots design
    Uribe-Gutierrez, S
    Martinez-Alfaro, H
    MICAI 2000: ADVANCES IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2000, 1793 : 136 - 147
  • [9] Behavior-based Reinforcement Learning Control for Robotic Rehabilitation Training
    Meng, Fancheng
    Fan, Keyan
    2015 27TH CHINESE CONTROL AND DECISION CONFERENCE (CCDC), 2015, : 4330 - 4334
  • [10] Intelligent Mission Supervisor Design for Null-space-based Behavioral Control System: A Reinforcement Learning Approach
    Huang, Jie
    Mei, Hengquan
    Zhang, Zhenyi
    2020 CHINESE AUTOMATION CONGRESS (CAC 2020), 2020, : 5861 - 5866