rSoccer: A Framework for Studying Reinforcement Learning in Small and Very Small Size Robot Soccer

被引:2
|
作者
Martins, Felipe B. [1 ]
Machado, Mateus G. [1 ]
Bassani, Hansenclever F. [1 ]
Braga, Pedro H. M. [1 ]
Barros, Edna S. [1 ]
机构
[1] Univ Fed Pernambuco, Ctr Informat, Av Jornalista Anibal Fernandes S-N, BR-50740560 Recife, PE, Brazil
来源
关键词
Reinforcement learning; OpenAI Gym; Continuous control; Robot soccer; Simulation;
D O I
10.1007/978-3-030-98682-7_14
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Reinforcement learning is an active research area with a vast number of applications in robotics, and the RoboCup competition is an interesting environment for studying and evaluating reinforcement learning methods. A known difficulty in applying reinforcement learning to robotics is the high number of experience samples required, being the use of simulated environments for training the agents followed by transfer learning to real-world (sim-to-real) a viable path. This article introduces an open-source simulator for the IEEE Very Small Size Soccer and the Small Size League optimized for reinforcement learning experiments. We also propose a framework for creating OpenAI Gym environments with a set of benchmarks tasks for evaluating single-agent and multi-agent robot soccer skills. We then demonstrate the learning capabilities of two state-of-the-art reinforcement learning methods as well as their limitations in certain scenarios introduced in this framework. We believe this will make it easier for more teams to compete in these categories using end-to-end reinforcement learning approaches and further develop this research area.
引用
收藏
页码:165 / 176
页数:12
相关论文
共 50 条
  • [41] Study and application of reinforcement learning based on DAI in cooperative strategy of robot soccer
    Guo, Qi
    Zhang, Da-Zhi
    Yang, Yong-Tian
    Journal of Harbin Institute of Technology (New Series), 2009, 16 (04) : 513 - 519
  • [42] Study and application of reinforcement learning based on DAI in cooperative strategy of robot soccer
    郭琦
    张达志
    杨永田
    Journal of Harbin Institute of Technology(New series), 2009, (04) : 513 - 519
  • [43] Collaborative Q(λ) reinforcement learning algorithm -: A promising robot learning framework
    Kartoun, U
    Stem, H
    Edan, Y
    Feied, C
    Handler, J
    Smith, M
    Gillam, M
    PROCEEDINGS OF THE SIXTH IASTED INTERNATIONAL CONFERENCE ON ROBOTICS AND APPLICATIONS, 2005, : 13 - 19
  • [44] Navigation of small-size autonomous soccer robot based on iso-vision system with two cameras
    Gao, Qing-Ji
    Hong, Bing-Rong
    Ruan, Yu-Feng
    Harbin Gongye Daxue Xuebao/Journal of Harbin Institute of Technology, 2003, 35 (09): : 1029 - 1032
  • [45] Reinforcement learning with multiple heterogeneous modules: A framework for developmental robot learning
    Uchibe, E
    Doya, K
    2005 4TH IEEE INTERNATIONAL CONFERENCE ON DEVELOPMENT AND LEARNING, 2005, : 87 - 92
  • [46] Study and application of reinforcement learning based on DAI in cooperative strategy of robot soccer
    郭琦
    张达志
    杨永田
    Journal of Harbin Institute of Technology, 2009, 16 (04) : 513 - 519
  • [47] Distributed Multi-Agent Reinforcement Learning and Its application to Robot Soccer
    Fan, Bo
    Pu, Jiexin
    2008 INTERNATIONAL WORKSHOP ON EDUCATION TECHNOLOGY AND TRAINING AND 2008 INTERNATIONAL WORKSHOP ON GEOSCIENCE AND REMOTE SENSING, VOL 1, PROCEEDINGS, 2009, : 667 - 671
  • [48] Hierarchical Reinforcement Learning for Precise Soccer Shooting Skills using a Quadrupedal Robot
    Ji, Yandong
    Li, Zhongyu
    Sun, Yinan
    Peng, Xue Bin
    Levine, Sergey
    Berseth, Glen
    Sreenath, Koushil
    2022 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2022, : 1479 - 1486
  • [49] Finite time control based on reinforcement learning for a small-size unmanned helicopter
    Xian B.
    Lin J.-Y.
    Kongzhi yu Juece/Control and Decision, 2020, 35 (11): : 2646 - 2652
  • [50] BIREFRINGENT COMPENSATOR FOR STUDYING VERY SMALL CHANGES IN DOUBLE REFRACTION
    NARASIMHAMURTY, T
    ZIAUDDIN, M
    JOURNAL OF THE OPTICAL SOCIETY OF AMERICA, 1961, 51 (05) : 574 - +