Learning Intelligent Behavior in a Non-stationary and Partially Observable Environment

被引:0
|
作者
SelÇuk şenkul
Faruk Polat
机构
[1] Middle East Technical University,Computer Engineering Department
来源
关键词
agent learning; multi-agent systems; Q-learning; reinforcement learning;
D O I
暂无
中图分类号
学科分类号
摘要
Individual learning in an environment where more than one agent exist is a chal-lengingtask. In this paper, a single learning agent situated in an environment where multipleagents exist is modeled based on reinforcement learning. The environment is non-stationaryand partially accessible from an agents' point of view. Therefore, learning activities of anagent is influenced by actions of other cooperative or competitive agents in the environment.A prey-hunter capture game that has the above characteristics is defined and experimentedto simulate the learning process of individual agents. Experimental results show that thereare no strict rules for reinforcement learning. We suggest two new methods to improve theperformance of agents. These methods decrease the number of states while keeping as muchstate as necessary.
引用
收藏
页码:97 / 115
页数:18
相关论文
共 50 条
  • [1] Learning intelligent behavior in a non-stationary and partially observable environment
    Senkul, S
    Polat, F
    ARTIFICIAL INTELLIGENCE REVIEW, 2002, 18 (02) : 97 - 115
  • [2] Learning Contextual Bandits in a Non-stationary Environment
    Wu, Qingyun
    Iyer, Naveen
    Wang, Hongning
    ACM/SIGIR PROCEEDINGS 2018, 2018, : 495 - 504
  • [3] The Parzen Kernel Approach to Learning in Non-stationary Environment
    Pietruczuk, Lena
    Rutkowski, Leszek
    Jaworski, Maciej
    Duda, Piotr
    PROCEEDINGS OF THE 2014 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2014, : 3319 - 3323
  • [4] Bilevel Online Deep Learning in Non-stationary Environment
    Han, Ya-nan
    Liu, Jian-wei
    Xiao, Bing-biao
    Wang, Xin-Tan
    Luo, Xiong-lin
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2021, PT II, 2021, 12892 : 347 - 358
  • [5] Learning Optimal Behavior in Environments with Non-stationary Observations
    Boone, Ilio
    Rens, Gavin
    ICAART: PROCEEDINGS OF THE 14TH INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE - VOL 3, 2022, : 729 - 736
  • [6] Bargaining in a non-stationary environment
    Coles, MG
    Muthoo, A
    JOURNAL OF ECONOMIC THEORY, 2003, 109 (01) : 70 - 89
  • [7] Distributed recurrent self-organization for tracking the state of non-stationary partially observable dynamical systems
    Khouzam, Bassem
    Frezza-Buet, Herve
    BIOLOGICALLY INSPIRED COGNITIVE ARCHITECTURES, 2013, 3 : 87 - 104
  • [8] Models of Forecasting of Enterprise's Behavior in Non-Stationary External Environment
    Rayevnyeva, Olena
    Touzani, Tarik
    ESTUDIOS DE ECONOMIA APLICADA, 2020, 38 (04):
  • [9] Cascading Non-Stationary Bandits: Online Learning to Rank in the Non-Stationary Cascade Model
    Li, Chang
    de Rijke, Maarten
    PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 2859 - 2865
  • [10] INVARIANT STRUCTURE IN NON-STATIONARY BEHAVIOR
    TREVINO, G
    JOURNAL OF SOUND AND VIBRATION, 1988, 125 (03) : 503 - 510