Learning Intelligent Behavior in a Non-stationary and Partially Observable Environment

被引：0

作者：

SelÇuk şenkul

Faruk Polat

机构：

[1] Middle East Technical University,Computer Engineering Department

来源：

Artificial Intelligence Review | 2002年 / 18卷

关键词：

agent learning; multi-agent systems; Q-learning; reinforcement learning;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

Individual learning in an environment where more than one agent exist is a chal-lengingtask. In this paper, a single learning agent situated in an environment where multipleagents exist is modeled based on reinforcement learning. The environment is non-stationaryand partially accessible from an agents' point of view. Therefore, learning activities of anagent is influenced by actions of other cooperative or competitive agents in the environment.A prey-hunter capture game that has the above characteristics is defined and experimentedto simulate the learning process of individual agents. Experimental results show that thereare no strict rules for reinforcement learning. We suggest two new methods to improve theperformance of agents. These methods decrease the number of states while keeping as muchstate as necessary.

引用

页码：97 / 115

页数：18

共 50 条

[1] Learning intelligent behavior in a non-stationary and partially observable environment
Senkul, S
Polat, F
ARTIFICIAL INTELLIGENCE REVIEW, 2002, 18 (02) : 97 - 115
[2] Learning Contextual Bandits in a Non-stationary Environment
Wu, Qingyun
Iyer, Naveen
Wang, Hongning
ACM/SIGIR PROCEEDINGS 2018, 2018, : 495 - 504
[3] The Parzen Kernel Approach to Learning in Non-stationary Environment
Pietruczuk, Lena
Rutkowski, Leszek
Jaworski, Maciej
Duda, Piotr
PROCEEDINGS OF THE 2014 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2014, : 3319 - 3323
[4] Bilevel Online Deep Learning in Non-stationary Environment
Han, Ya-nan
Liu, Jian-wei
Xiao, Bing-biao
Wang, Xin-Tan
Luo, Xiong-lin
ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2021, PT II, 2021, 12892 : 347 - 358
[5] Learning Optimal Behavior in Environments with Non-stationary Observations
Boone, Ilio
Rens, Gavin
ICAART: PROCEEDINGS OF THE 14TH INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE - VOL 3, 2022, : 729 - 736
[6] Bargaining in a non-stationary environment
Coles, MG
Muthoo, A
JOURNAL OF ECONOMIC THEORY, 2003, 109 (01) : 70 - 89
[7] Distributed recurrent self-organization for tracking the state of non-stationary partially observable dynamical systems
Khouzam, Bassem
Frezza-Buet, Herve
BIOLOGICALLY INSPIRED COGNITIVE ARCHITECTURES, 2013, 3 : 87 - 104
[8] Models of Forecasting of Enterprise's Behavior in Non-Stationary External Environment
Rayevnyeva, Olena
Touzani, Tarik
ESTUDIOS DE ECONOMIA APLICADA, 2020, 38 (04):
[9] Cascading Non-Stationary Bandits: Online Learning to Rank in the Non-Stationary Cascade Model
Li, Chang
de Rijke, Maarten
PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 2859 - 2865
[10] INVARIANT STRUCTURE IN NON-STATIONARY BEHAVIOR
TREVINO, G
JOURNAL OF SOUND AND VIBRATION, 1988, 125 (03) : 503 - 510

← 1 2 3 4 5 →