Reinforcement Learning Mission Supervisor Design for Behavior-based Differential Drive Robots

被引：0

作者：

Zhang, Zhenyi ^{[1
,2
]}

Huang, Jie ^{[1
,2
]}

机构：

[1] School of Electrical Engineering and Automation, Fuzhou University, Fuzhou,350108, China

[2] 5G+ Industrial Internet Institute of Fuzhou University, Fuzhou,350108, China

来源：

Jiqiren/Robot | 2024年 / 46卷 / 04期

关键词：

Reinforcement learning;

D O I：

10.13973/j.cnki.robot.230148

中图分类号：

学科分类号：

摘要：

A multi-agent reinforcement learning mission supervisor (MARLMS) is designed for differential drive robots using trial-and-error learning. The proposed MARLMS addresses the challenge inherent in behavior-based multi-agent systems, wherein the design of switching rules to determine behavior priorities relies heavily on human intelligence. Building upon the null-space-based behavioral control (NSBC) framework, a differential model is introduced to replace the particle model. Consequently, a paradigm of NSBC with nonholonomic constraints is presented for the first time, enhancing the system robustness to the minimum extremum state. Subsequently, a joint policy is developed to dynamically and intelligently determine behavior priorities by modeling the behavior priority switching problem as a cooperative Markov game. The proposed MARLMS not only eliminates the need for manual design of switching rules but also reduces the computational and storage burdens during online operations. Simulation results demonstrate the superior behavior priority switching performance of the proposed MARLMS. Furthermore, successful implementation on AgileX Limo robots validates the practicality of the proposed MARLMS. © 2024 Chinese Academy of Sciences. All rights reserved.

引用

页码：397 / 416

共 50 条

[1] Measuring the effectiveness of reinforcement learning for behavior-based robots
Shackleton, J
Gini, M
ADAPTIVE BEHAVIOR, 1997, 5 (3-4) : 365 - 390
[2] AUTOMATIC PROGRAMMING OF BEHAVIOR-BASED ROBOTS USING REINFORCEMENT LEARNING
MAHADEVAN, S
CONNELL, J
ARTIFICIAL INTELLIGENCE, 1992, 55 (2-3) : 311 - 365
[3] An architecture for behavior-based reinforcement learning
Konidaris, GD
Hayes, GM
ADAPTIVE BEHAVIOR, 2005, 13 (01) : 5 - 32
[4] Design and implementation of a behavior-based control and learning architecture for mobile robots
Suh, IH
Lee, S
Kim, BO
Yi, BJ
Oh, SR
2003 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, VOLS 1-3, PROCEEDINGS, 2003, : 4142 - 4147
[5] A Behavior-Based Reinforcement Learning Approach to Control Walking Bipedal Robots Under Unknown Disturbances
Beranek, Richard
Karimi, Masoud
Ahmadi, Mojtaba
IEEE-ASME TRANSACTIONS ON MECHATRONICS, 2022, 27 (05) : 2710 - 2720
[6] Learning to ground fact symbols in behavior-based robots
Hertzberg, J
Jaeger, H
Schönherr, F
ECAI 2002: 15TH EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2002, 77 : 708 - 712
[7] Behavior-based learning fuzzy rules for mobile robots
Thongchai, S
PROCEEDINGS OF THE 2002 AMERICAN CONTROL CONFERENCE, VOLS 1-6, 2002, 1-6 : 995 - 1000
[8] An application of behavior-based architecture for mobile robots design
Uribe-Gutierrez, S
Martinez-Alfaro, H
MICAI 2000: ADVANCES IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2000, 1793 : 136 - 147
[9] Behavior-based Reinforcement Learning Control for Robotic Rehabilitation Training
Meng, Fancheng
Fan, Keyan
2015 27TH CHINESE CONTROL AND DECISION CONFERENCE (CCDC), 2015, : 4330 - 4334
[10] Intelligent Mission Supervisor Design for Null-space-based Behavioral Control System: A Reinforcement Learning Approach
Huang, Jie
Mei, Hengquan
Zhang, Zhenyi
2020 CHINESE AUTOMATION CONGRESS (CAC 2020), 2020, : 5861 - 5866

← 1 2 3 4 5 →