A Q-learning Based Continuous Tuning of Fuzzy Wall Tracking without Exploration

被引:4
|
作者
Valiollahi, S. [1 ]
Ghaderi, R. [1 ]
Ebrahimzadeh, A. [1 ]
机构
[1] Babol Univ Technol, Dept Elect & Comp Engn, Babol Sar 7414871167, Iran
来源
INTERNATIONAL JOURNAL OF ENGINEERING | 2012年 / 25卷 / 04期
关键词
Autonomous Navigation; Wall Tracking; Fuzzy Q-learning; Khepera Robot;
D O I
10.5829/idosi.ije.2012.25.04a.07
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
A simple and easy to implement is proposed to address wall tracking task of an autonomous robot. The robot should navigate in unknown environments, find the nearest wall, and track it solely based on locally sensed data. The proposed method benefits from coupling fuzzy logic and Q-learning to meet requirements of autonomous navigations. The robot summerizes the obtained information from the world into a set of fuzzy states. For each fuzzy state, there are some suggested actions. States are related to their corresponding actions via simple fuzzy if-then rules, designed by human reasoning. The robot selects the most encouraged action for each state by Q-learning and through online experiences. The objective is to design a wall tracking algorithm which can efficiently adapt itself to different wall shapes in completely unknown environments. Q-learning is applied without any exploration phase, i.e. no training environment is considered. Experimental results on simulated Khepera robot validate that the proposed method efficiently deals with various wall contours from simple straight shape to complex concave, convex, or polygon shapes. The robot successfully keeps track of walls while staying within predefined margins.
引用
收藏
页码:355 / 366
页数:12
相关论文
共 50 条
  • [31] Graph Exploration for Effective Multiagent Q-Learning
    Zhaikhan, Ainur
    Sayed, Ali H.
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, : 1 - 12
  • [32] Learning of Keepaway Task for RoboCup Soccer Agent Based on Fuzzy Q-Learning
    Sawa, Toru
    Watanabe, Toshihiko
    2011 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2011, : 250 - 256
  • [33] An improved Q-learning algorithm based on exploration region expansion strategy
    Gao, Qingji
    Hong, Bingong
    He, Zhendong
    Liu, Jie
    Niu, Guochen
    WCICA 2006: SIXTH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION, VOLS 1-12, CONFERENCE PROCEEDINGS, 2006, : 4167 - +
  • [34] Agent learning in simulated soccer by fuzzy Q-learning
    Takahashi, K
    Ueda, H
    Miyahara, T
    TENCON 2004 - 2004 IEEE REGION 10 CONFERENCE, VOLS A-D, PROCEEDINGS: ANALOG AND DIGITAL TECHNIQUES IN ELECTRICAL ENGINEERING, 2004, : B338 - B341
  • [35] A New Criterion of Human Comfort Assessment for Wheelchair Robots by Q-Learning Based Accompanist Tracking Fuzzy Controller
    Bing-Fei Wu
    Po-Yen Chen
    Chun-Hsien Lin
    International Journal of Fuzzy Systems, 2016, 18 : 1039 - 1053
  • [36] A New Criterion of Human Comfort Assessment for Wheelchair Robots by Q-Learning Based Accompanist Tracking Fuzzy Controller
    Wu, Bing-Fei
    Chen, Po-Yen
    Lin, Chun-Hsien
    INTERNATIONAL JOURNAL OF FUZZY SYSTEMS, 2016, 18 (06) : 1039 - 1053
  • [37] Improved Fuzzy Q-Learning with Replay Memory
    Li, Xin
    Cohen, Kelly
    FUZZY INFORMATION PROCESSING 2020, 2022, 1337 : 13 - 23
  • [38] Parameter specification for fuzzy clustering by Q-learning
    Oh, CH
    Ikeda, E
    Honda, K
    Ichihashi, H
    IJCNN 2000: PROCEEDINGS OF THE IEEE-INNS-ENNS INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOL IV, 2000, : 9 - 12
  • [39] Fuzzy Q-learning Control for Temperature Systems
    Chen, Yeong-Chin
    Hung, Lon-Chen
    Syamsudin, Mariana
    22ND IEEE/ACIS INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING, ARTIFICIAL INTELLIGENCE, NETWORKING AND PARALLEL/DISTRIBUTED COMPUTING (SNPD 2021-FALL), 2021, : 148 - 151
  • [40] Decoupled Visual Servoing With Fuzzy Q-Learning
    Shi, Haobin
    Li, Xuesi
    Hwang, Kao-Shing
    Pan, Wei
    Xu, Genjiu
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2018, 14 (01) : 241 - 252