Modeling Biological Agents Beyond the Reinforcement-Learning Paradigm

被引:5
|
作者
Georgeon, Olivier L. [1 ]
Casado, Remi C. [1 ]
Matignon, Laetitia A. [1 ]
机构
[1] Univ Lyon 1, LIRIS, UMR5205, F-69622 Villeurbanne, France
关键词
D O I
10.1016/j.procs.2015.12.179
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
It is widely acknowledged that biological beings (animals) are not Markov: modelers generally do not model them as agents receiving a complete representation of their environment's state in input (except perhaps in simple controlled tasks). In this paper, we claim that biological beings generally cannot recognize rewarding Markov states of their environment either. Therefore, we model them as agents trying to perform rewarding interactions with their environment (interaction-driven tasks), but not as agents trying to reach rewarding states (state-driven tasks). We review two interaction-driven tasks: the AB and AABB task, and implement a non-Markov Reinforcement-Learning (RL) algorithm based upon historical sequences and Q-learning. Results show that this RL algorithm takes significantly longer than a constructivist algorithm implemented previously by Georgeon, Ritter, & Haynes (2009). This is because the constructivist algorithm directly learns and repeats hierarchical sequences of interactions, whereas the RL algorithm spends time learning Q-values. Along with theoretical arguments, these results support the constructivist paradigm for modeling biological agents.
引用
收藏
页码:17 / 22
页数:6
相关论文
共 50 条
  • [21] From recurrent choice to skill learning: A reinforcement-learning model
    Fu, Wai-Tat
    Anderson, John R.
    JOURNAL OF EXPERIMENTAL PSYCHOLOGY-GENERAL, 2006, 135 (02) : 184 - 206
  • [22] A Reinforcement-Learning Approach to Proactive Caching in Wireless Networks
    Somuyiwa, Samuel O.
    Gyorgy, Andras
    Gunduz, Deniz
    IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2018, 36 (06) : 1331 - 1344
  • [23] Reinforcement-Learning Based Preload Strategy for Short Video
    Ren, Zhicheng
    Shan, Yongxin
    Jiang, Wanchun
    Shan, Yijing
    Shan, Danfeng
    Wang, Jianxin
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, ICIC 2023, PT V, 2023, 14090 : 327 - 339
  • [24] Modulating Reinforcement-Learning Parameters using Agent Emotions
    von Haugwitz, Rickard
    Kitamura, Yoshifumi
    Takashima, Kazuki
    6TH INTERNATIONAL CONFERENCE ON SOFT COMPUTING AND INTELLIGENT SYSTEMS, AND THE 13TH INTERNATIONAL SYMPOSIUM ON ADVANCED INTELLIGENT SYSTEMS, 2012, : 1281 - 1285
  • [25] Enforcing ethical goals over reinforcement-learning policies
    Emery A. Neufeld
    Ezio Bartocci
    Agata Ciabattoni
    Guido Governatori
    Ethics and Information Technology, 2022, 24
  • [26] Orbitofrontal Circuits Control Multiple Reinforcement-Learning Processes
    Groman, Stephanie M.
    Keistler, Colby
    Keip, Alex J.
    Hammarlund, Emma
    DiLeone, Ralph J.
    Pittenger, Christopher
    Lee, Daeyeol
    Taylor, Jane R.
    NEURON, 2019, 103 (04) : 734 - +
  • [27] Enforcing ethical goals over reinforcement-learning policies
    Neufeld, Emery A.
    Bartocci, Ezio
    Ciabattoni, Agata
    Governatori, Guido
    ETHICS AND INFORMATION TECHNOLOGY, 2022, 24 (04)
  • [28] A Reinforcement-Learning Style Algorithm for Black Box Automata
    Cohen, Itay
    Fogler, Roi
    Peled, Doron
    2022 20TH ACM-IEEE INTERNATIONAL CONFERENCE ON FORMAL METHODS AND MODELS FOR SYSTEM DESIGN (MEMOCODE), 2022,
  • [29] Reinforcement-Learning Based Fault-Tolerant Control
    Zhang, Dapeng
    Lin, Zhiling
    Gao, Zhiwei
    2017 IEEE 15TH INTERNATIONAL CONFERENCE ON INDUSTRIAL INFORMATICS (INDIN), 2017, : 671 - 676
  • [30] A reinforcement-learning approach to failure-detection scheduling
    Zeng, Fancong
    USIC 2007: PROCEEDINGS OF THE SEVENTH INTERNATIONAL CONFERENCE ON QUALITY SOFTWARE, 2007, : 161 - 170