A comparison of learning performance in two-dimensional Q-learning by the difference of Q-values alignment

被引：0

作者：

Kathy Thi Aung

Takayasu Fuchida

机构：

[1] Kagoshima University,Department of Information and Computer Science, Graduate School of Science and Engineering

来源：

Artificial Life and Robotics | 2012年 / 16卷 / 4期

关键词：

Q-learning; Q-value; Voronoi Q-value element; Single agent; State space;

D O I：

10.1007/s10015-011-0961-5

中图分类号：

学科分类号：

摘要：

In this article, we examine the learning performance of various strategies under different conditions using the Voronoi Q-value element (VQE) based on reward in a single-agent environment, and decide how to act in a certain state. In order to test our hypotheses, we performed computational experiments using several situations such as various angles of rotation of VQEs which are arranged into a lattice structure, various angles of an agent’s action rotation that has 4 actions, and a random arrangement of VQEs to correctly evaluate the optimal Q-values for state and action pairs in order to deal with continuous-valued inputs. As a result, the learning performance changes when the angle of VQEs and the angle of action are changed by a specific relative position.

引用

页码：473 / 477

页数：4

共 50 条

[11] Planning with Q-Values in Sparse Reward Reinforcement Learning
Lei, Hejun
Weng, Paul
Rojas, Juan
Guan, Yisheng
INTELLIGENT ROBOTICS AND APPLICATIONS (ICIRA 2022), PT I, 2022, 13455 : 603 - 614
[12] Deep Reinforcement Learning: From Q-Learning to Deep Q-Learning
Tan, Fuxiao
Yan, Pengfei
Guan, Xinping
NEURAL INFORMATION PROCESSING (ICONIP 2017), PT IV, 2017, 10637 : 475 - 483
[13] Q-LEARNING
WATKINS, CJCH
DAYAN, P
MACHINE LEARNING, 1992, 8 (3-4) : 279 - 292
[14] Judgmentally adjusted Q-values based on Q-ensemble for offline reinforcement learning
Liu W.
Xiang S.
Zhang T.
Han Y.
Guo X.
Zhang Y.
Hao Y.
Neural Computing and Applications, 2024, 36 (25) : 15255 - 15277
[15] Backward Q-learning: The combination of Sarsa algorithm and Q-learning
Wang, Yin-Hao
Li, Tzuu-Hseng S.
Lin, Chih-Jui
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2013, 26 (09) : 2184 - 2193
[16] Learning rates for Q-Learning
Even-Dar, E
Mansour, Y
COMPUTATIONAL LEARNING THEORY, PROCEEDINGS, 2001, 2111 : 589 - 604
[17] Learning rates for Q-learning
Even-Dar, E
Mansour, Y
JOURNAL OF MACHINE LEARNING RESEARCH, 2003, 5 : 1 - 25
[18] Constrained Deep Q-Learning Gradually Approaching Ordinary Q-Learning
Ohnishi, Shota
Uchibe, Eiji
Yamaguchi, Yotaro
Nakanishi, Kosuke
Yasui, Yuji
Ishii, Shin
FRONTIERS IN NEUROROBOTICS, 2019, 13
[19] Contextual Q-Learning
Pinto, Tiago
Vale, Zita
ECAI 2020: 24TH EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, 325 : 2927 - 2928
[20] Fuzzy Q-learning
Glorennec, PY
Jouffe, L
PROCEEDINGS OF THE SIXTH IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS, VOLS I - III, 1997, : 659 - 662

← 1 2 3 4 5 →