HIERARCHICAL NEURO-FUZZY MODELS BASED ON REINFORCEMENT LEARNING FOR AUTONOMOUS AGENTS

被引:0
|
作者
Figueiredo, Karla [1 ,2 ]
Vellasco, Marley [1 ]
Pacheco, Marco [1 ]
de Souza, Flavio Joaquim [3 ]
机构
[1] Pontif Catholic Univ Rio de Janeiro, Dept Elect Engn, Rua Marques de Sao Vicente 225, BR-224:3190 Rio De Janeiro, Brazil
[2] Univ Estadual Zona Oeste, Dept Appl Math & Computat Sci, BR-23070200 Rio De Janeiro, Brazil
[3] Univ Estado Rio de Janeiro, Dept Syst & Comp Engn, Rio De Janeiro, Brazil
关键词
Reinforcement learning; Autonomous agents; Hybrid neuro-fuzzy; Hierarchical partitioning; Robotics;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This work introduces a new class of neuro-uzzy systems for intelligent agents, called ReinfolrenteUt Learning - Hierarchical Neuro-Puzzy Systent. This new class combines a hierarchical partitioning method of the input space with a Reinforcement Learning algorithm to achieve the following important characteristics: automatic creation of the model's structure; self-adjustment of the pammeters; autonomous learning of the actions; capacity to deal with a greater number of inputs; and automatic generation of linguistic fuzzy rules. The proposed model was devised to overcome limitations of traditional reinforcement learning methods based on lookup tables, particularly in applications involving continuous environments and/or environments considered to he high dimensional. The paper details the hierarchical neuro-fuzzy architecture, its basic cell, and the learning algorithm. The performance of the proposed system was evaluated in four benchmark applications the Mountain Car Problem, the Cart-Centering Problem. the Inverted Pendulum and the Khepera Robot Control. The results obtained demonstrate the capacity of the novel hierarchical neuro-fuzzy system to automatically extract knowledge from the agent's direct interaction with large and/or continuous ClItliVOTIMCIttS. This knowledge is in the form of fuzzy linguistic rules, with no prior definition of the number and position of the fuzzy sets.
引用
收藏
页码:1471 / 1494
页数:24
相关论文
共 50 条
  • [21] Mobile Robot Control Based on Hybrid Neuro-Fuzzy Value Gradient Reinforcement Learning
    Al-Dabooni, Seaar
    Wunsch, Donald
    2017 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2017, : 2820 - 2827
  • [22] A hierarchical recurrent neuro-fuzzy system
    Nürnberger, A
    JOINT 9TH IFSA WORLD CONGRESS AND 20TH NAFIPS INTERNATIONAL CONFERENCE, PROCEEDINGS, VOLS. 1-5, 2001, : 1407 - 1412
  • [23] How an Adaptive Learning Rate Benefits Neuro-Fuzzy Reinforcement Learning Systems
    Kuremoto, Takashi
    Obayashi, Masanao
    Kobayashi, Kunikazu
    Mabu, Shingo
    ADVANCES IN SWARM INTELLIGENCE, PT1, 2014, 8794 : 324 - 331
  • [24] How an adaptive learning rate benefits neuro-fuzzy reinforcement learning systems
    Kuremoto, Takashi (wu@yamaguchi-u.ac.jp), 1600, Springer Verlag (8794):
  • [25] Artificial Bee Colony Based Learning of Local Linear Neuro-Fuzzy Models
    Nikookar, Alireza
    Lucas, Caro
    Pedram, Mir Mohsen
    2013 13TH IRANIAN CONFERENCE ON FUZZY SYSTEMS (IFSC), 2013,
  • [26] Neuro-fuzzy identification models
    Matko, D
    Karba, R
    Zupancic, B
    PROCEEDINGS OF IEEE INTERNATIONAL CONFERENCE ON INDUSTRIAL TECHNOLOGY 2000, VOLS 1 AND 2, 2000, : 650 - 655
  • [27] Simplification of Neuro-Fuzzy Models
    Siminski, Krzysztof
    MAN-MACHINE INTERACTIONS, 2009, 59 : 265 - 272
  • [28] Human fall detection using neuro-fuzzy models based on ensemble learning
    Kordnoori, Shirin
    Sharifi, Arash
    Shah-Hosseini, Hamed
    PROGRESS IN ARTIFICIAL INTELLIGENCE, 2022, 11 (03) : 219 - 232
  • [29] Human fall detection using neuro-fuzzy models based on ensemble learning
    Shirin Kordnoori
    Arash Sharifi
    Hamed Shah-Hosseini
    Progress in Artificial Intelligence, 2022, 11 : 219 - 232
  • [30] Neuro-Fuzzy Based Autonomous Mobile Robot Navigation System
    Joshi, Maulin M.
    Zaveri, Mukesh A.
    11TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION, ROBOTICS AND VISION (ICARCV 2010), 2010, : 384 - 389