Probability matching and reinforcement learning

被引:5
|
作者
Rivas, Javier [1 ]
机构
[1] Univ Leicester, Dept Econ, Leicester LE1 7RH, Leics, England
关键词
Probability matching; Reinforcement learning; DECISION-MAKING; FORM GAMES;
D O I
10.1016/j.jmateco.2012.09.004
中图分类号
F [经济];
学科分类号
02 ;
摘要
Probability matching occurs when an action is chosen with a frequency equivalent to the probability of that action being the best choice. This sub-optimal behavior has been reported repeatedly by psychologists and experimental economists. We provide an evolutionary foundation for this phenomenon by showing that learning by reinforcement can lead to probability matching and, if the learning occurs sufficiently slowly, probability matching does not only occur in choice frequencies but also in choice probabilities. Our results are completed by proving that there exists no quasi-linear reinforcement learning specification such that the behavior is optimal for all environments where counterfactuals are observed. (C) 2012 Elsevier B.V. All rights reserved.
引用
收藏
页码:17 / 21
页数:5
相关论文
共 50 条
  • [31] Latent Structure Matching for Knowledge Transfer in Reinforcement Learning
    Zhou, Yi
    Yang, Fenglei
    FUTURE INTERNET, 2020, 12 (02)
  • [32] Adaptive Pattern Matching with Reinforcement Learning for Dynamic Graphs
    Kanezashi, Hiroki
    Suzumura, Toyotaro
    Garcia-Gasulla, Dario
    Oh, Min-hwan
    Matsuoka, Satoshi
    2018 IEEE 25TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING (HIPC), 2018, : 92 - 101
  • [33] Deep reinforcement learning approach for ontology matching problem
    Touati, Chahira
    Kemmar, Amina
    INTERNATIONAL JOURNAL OF DATA SCIENCE AND ANALYTICS, 2024, 18 (01) : 97 - 112
  • [34] Multiagent Reinforcement Learning with Regret Matching for Robot Soccer
    Liu, Qiang
    Ma, Jiachen
    Xie, Wei
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2013, 2013
  • [35] English Language Learning Pattern Matching Based on Distributed Reinforcement Learning
    Zhao, Hua
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2022, 2022
  • [36] COMBINATION OF SELF-REINFORCEMENT AND EXTERNAL-REINFORCEMENT ON MATCHING-TO-SAMPLE LEARNING
    KAWAMOTO, H
    FUKUSHIMA, O
    JAPANESE JOURNAL OF PSYCHOLOGY, 1989, 60 (04): : 231 - 236
  • [37] CONSTANT AND VARIABLE DELAY OF REINFORCEMENT EFFECTS ON PROBABILITY LEARNING BY PIGEONS
    TOPPING, JS
    JOURNAL OF COMPARATIVE AND PHYSIOLOGICAL PSYCHOLOGY, 1970, 70 (01): : 141 - &
  • [38] Using Reinforcement Learning to Minimize the Probability of Delay Occurrence in Transportation
    Cao, Zhiguang
    Guo, Hongliang
    Song, Wen
    Gao, Kaizhou
    Chen, Zhenghua
    Zhang, Le
    Zhang, Xuexi
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2020, 69 (03) : 2424 - 2436
  • [39] Quantum reinforcement learning control based on entropy and unequal probability
    Zhang, Yu-Yao
    Kuang, Sen
    Kongzhi Lilun Yu Yingyong/Control Theory and Applications, 2024, 41 (12): : 2277 - 2285
  • [40] REINFORCEMENT LEARNING FOR ROBOT CONTROL USING PROBABILITY DENSITY ESTIMATIONS
    Agostini, Alejandro
    Celaya, Enric
    ICINCO 2010: PROCEEDINGS OF THE 7TH INTERNATIONAL CONFERENCE ON INFORMATICS IN CONTROL, AUTOMATION AND ROBOTICS, VOL 1, 2010, : 160 - 168