A Q-learning Based Continuous Tuning of Fuzzy Wall Tracking without Exploration

被引：4

作者：

Valiollahi, S. ^{[1
]}

Ghaderi, R. ^{[1
]}

Ebrahimzadeh, A. ^{[1
]}

机构：

[1] Babol Univ Technol, Dept Elect & Comp Engn, Babol Sar 7414871167, Iran

来源：

INTERNATIONAL JOURNAL OF ENGINEERING | 2012年 / 25卷 / 04期

关键词：

Autonomous Navigation; Wall Tracking; Fuzzy Q-learning; Khepera Robot;

D O I：

10.5829/idosi.ije.2012.25.04a.07

中图分类号：

T [工业技术];

学科分类号：

08 ;

摘要：

A simple and easy to implement is proposed to address wall tracking task of an autonomous robot. The robot should navigate in unknown environments, find the nearest wall, and track it solely based on locally sensed data. The proposed method benefits from coupling fuzzy logic and Q-learning to meet requirements of autonomous navigations. The robot summerizes the obtained information from the world into a set of fuzzy states. For each fuzzy state, there are some suggested actions. States are related to their corresponding actions via simple fuzzy if-then rules, designed by human reasoning. The robot selects the most encouraged action for each state by Q-learning and through online experiences. The objective is to design a wall tracking algorithm which can efficiently adapt itself to different wall shapes in completely unknown environments. Q-learning is applied without any exploration phase, i.e. no training environment is considered. Experimental results on simulated Khepera robot validate that the proposed method efficiently deals with various wall contours from simple straight shape to complex concave, convex, or polygon shapes. The robot successfully keeps track of walls while staying within predefined margins.

引用

页码：355 / 366

页数：12

共 50 条

[21] ENHANCEMENTS OF FUZZY Q-LEARNING ALGORITHM
Glowaty, Grzegorz
COMPUTER SCIENCE-AGH, 2005, 7 : 77 - 87
[22] Fuzzy Q-Learning with an Adaptive Representation
Waldock, A.
Carse, B.
2008 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS, VOLS 1-5, 2008, : 720 - +
[23] Continuous-action Q-learning
Millán, JDR
Posenato, D
Dedieu, E
MACHINE LEARNING, 2002, 49 (2-3) : 247 - 265
[24] Continuous-Action Q-Learning
José del R. Millán
Daniele Posenato
Eric Dedieu
Machine Learning, 2002, 49 : 247 - 265
[25] Research on intelligence robot formation based on fuzzy Q-Learning
Zhang, RB
Shi, Y
PROCEEDINGS OF THE 2004 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2004, : 1936 - 1941
[26] Modified fuzzy Q-learning based wind speed prediction
Sharma, Rajneesh
Shikhola, Tushar
Kohli, Jaspreet Kaur
JOURNAL OF WIND ENGINEERING AND INDUSTRIAL AERODYNAMICS, 2020, 206 (206)
[27] NAO robot obstacle avoidance based on fuzzy Q-learning
Wen, Shuhuan
Hu, Xueheng
Li, Zhen
Lam, Hak Keung
Sun, Fuchun
Fang, Bin
INDUSTRIAL ROBOT-THE INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH AND APPLICATION, 2020, 47 (06): : 801 - 811
[28] Continuous Deep Q-Learning with Model-based Acceleration
Gu, Shixiang
Lillicrap, Timothy
Sutskever, Ilya
Levine, Sergey
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 48, 2016, 48
[29] Entropy-based tuning approach for Q-learning in an unstructured environment
Chen, Yu-Jen
Jiang, Wei-Cheng
ROBOTICS AND AUTONOMOUS SYSTEMS, 2025, 187
[30] SEM: Safe exploration mask for q-learning
Xuan, Chengbin
Zhang, Feng
Lam, Hak-Keung
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2022, 111

← 1 2 3 4 5 →