A Q-learning Based Continuous Tuning of Fuzzy Wall Tracking without Exploration

被引:4
|
作者
Valiollahi, S. [1 ]
Ghaderi, R. [1 ]
Ebrahimzadeh, A. [1 ]
机构
[1] Babol Univ Technol, Dept Elect & Comp Engn, Babol Sar 7414871167, Iran
来源
INTERNATIONAL JOURNAL OF ENGINEERING | 2012年 / 25卷 / 04期
关键词
Autonomous Navigation; Wall Tracking; Fuzzy Q-learning; Khepera Robot;
D O I
10.5829/idosi.ije.2012.25.04a.07
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
A simple and easy to implement is proposed to address wall tracking task of an autonomous robot. The robot should navigate in unknown environments, find the nearest wall, and track it solely based on locally sensed data. The proposed method benefits from coupling fuzzy logic and Q-learning to meet requirements of autonomous navigations. The robot summerizes the obtained information from the world into a set of fuzzy states. For each fuzzy state, there are some suggested actions. States are related to their corresponding actions via simple fuzzy if-then rules, designed by human reasoning. The robot selects the most encouraged action for each state by Q-learning and through online experiences. The objective is to design a wall tracking algorithm which can efficiently adapt itself to different wall shapes in completely unknown environments. Q-learning is applied without any exploration phase, i.e. no training environment is considered. Experimental results on simulated Khepera robot validate that the proposed method efficiently deals with various wall contours from simple straight shape to complex concave, convex, or polygon shapes. The robot successfully keeps track of walls while staying within predefined margins.
引用
收藏
页码:355 / 366
页数:12
相关论文
共 50 条
  • [21] ENHANCEMENTS OF FUZZY Q-LEARNING ALGORITHM
    Glowaty, Grzegorz
    COMPUTER SCIENCE-AGH, 2005, 7 : 77 - 87
  • [22] Fuzzy Q-Learning with an Adaptive Representation
    Waldock, A.
    Carse, B.
    2008 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS, VOLS 1-5, 2008, : 720 - +
  • [23] Continuous-action Q-learning
    Millán, JDR
    Posenato, D
    Dedieu, E
    MACHINE LEARNING, 2002, 49 (2-3) : 247 - 265
  • [24] Continuous-Action Q-Learning
    José del R. Millán
    Daniele Posenato
    Eric Dedieu
    Machine Learning, 2002, 49 : 247 - 265
  • [25] Research on intelligence robot formation based on fuzzy Q-Learning
    Zhang, RB
    Shi, Y
    PROCEEDINGS OF THE 2004 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2004, : 1936 - 1941
  • [26] Modified fuzzy Q-learning based wind speed prediction
    Sharma, Rajneesh
    Shikhola, Tushar
    Kohli, Jaspreet Kaur
    JOURNAL OF WIND ENGINEERING AND INDUSTRIAL AERODYNAMICS, 2020, 206 (206)
  • [27] NAO robot obstacle avoidance based on fuzzy Q-learning
    Wen, Shuhuan
    Hu, Xueheng
    Li, Zhen
    Lam, Hak Keung
    Sun, Fuchun
    Fang, Bin
    INDUSTRIAL ROBOT-THE INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH AND APPLICATION, 2020, 47 (06): : 801 - 811
  • [28] Continuous Deep Q-Learning with Model-based Acceleration
    Gu, Shixiang
    Lillicrap, Timothy
    Sutskever, Ilya
    Levine, Sergey
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 48, 2016, 48
  • [29] Entropy-based tuning approach for Q-learning in an unstructured environment
    Chen, Yu-Jen
    Jiang, Wei-Cheng
    ROBOTICS AND AUTONOMOUS SYSTEMS, 2025, 187
  • [30] SEM: Safe exploration mask for q-learning
    Xuan, Chengbin
    Zhang, Feng
    Lam, Hak-Keung
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2022, 111