Probability matching and reinforcement learning

被引：5

作者：

Rivas, Javier ^{[1
]}

机构：

[1] Univ Leicester, Dept Econ, Leicester LE1 7RH, Leics, England

来源：

JOURNAL OF MATHEMATICAL ECONOMICS | 2013年 / 49卷 / 01期

关键词：

Probability matching; Reinforcement learning; DECISION-MAKING; FORM GAMES;

D O I：

10.1016/j.jmateco.2012.09.004

中图分类号：

F [经济];

学科分类号：

02 ;

摘要：

Probability matching occurs when an action is chosen with a frequency equivalent to the probability of that action being the best choice. This sub-optimal behavior has been reported repeatedly by psychologists and experimental economists. We provide an evolutionary foundation for this phenomenon by showing that learning by reinforcement can lead to probability matching and, if the learning occurs sufficiently slowly, probability matching does not only occur in choice frequencies but also in choice probabilities. Our results are completed by proving that there exists no quasi-linear reinforcement learning specification such that the behavior is optimal for all environments where counterfactuals are observed. (C) 2012 Elsevier B.V. All rights reserved.

引用

页码：17 / 21

页数：5

共 50 条

[31] Latent Structure Matching for Knowledge Transfer in Reinforcement Learning
Zhou, Yi
Yang, Fenglei
FUTURE INTERNET, 2020, 12 (02)
[32] Adaptive Pattern Matching with Reinforcement Learning for Dynamic Graphs
Kanezashi, Hiroki
Suzumura, Toyotaro
Garcia-Gasulla, Dario
Oh, Min-hwan
Matsuoka, Satoshi
2018 IEEE 25TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING (HIPC), 2018, : 92 - 101
[33] Deep reinforcement learning approach for ontology matching problem
Touati, Chahira
Kemmar, Amina
INTERNATIONAL JOURNAL OF DATA SCIENCE AND ANALYTICS, 2024, 18 (01) : 97 - 112
[34] Multiagent Reinforcement Learning with Regret Matching for Robot Soccer
Liu, Qiang
Ma, Jiachen
Xie, Wei
MATHEMATICAL PROBLEMS IN ENGINEERING, 2013, 2013
[35] English Language Learning Pattern Matching Based on Distributed Reinforcement Learning
Zhao, Hua
MATHEMATICAL PROBLEMS IN ENGINEERING, 2022, 2022
[36] COMBINATION OF SELF-REINFORCEMENT AND EXTERNAL-REINFORCEMENT ON MATCHING-TO-SAMPLE LEARNING
KAWAMOTO, H
FUKUSHIMA, O
JAPANESE JOURNAL OF PSYCHOLOGY, 1989, 60 (04): : 231 - 236
[37] CONSTANT AND VARIABLE DELAY OF REINFORCEMENT EFFECTS ON PROBABILITY LEARNING BY PIGEONS
TOPPING, JS
JOURNAL OF COMPARATIVE AND PHYSIOLOGICAL PSYCHOLOGY, 1970, 70 (01): : 141 - &
[38] Using Reinforcement Learning to Minimize the Probability of Delay Occurrence in Transportation
Cao, Zhiguang
Guo, Hongliang
Song, Wen
Gao, Kaizhou
Chen, Zhenghua
Zhang, Le
Zhang, Xuexi
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2020, 69 (03) : 2424 - 2436
[39] Quantum reinforcement learning control based on entropy and unequal probability
Zhang, Yu-Yao
Kuang, Sen
Kongzhi Lilun Yu Yingyong/Control Theory and Applications, 2024, 41 (12): : 2277 - 2285
[40] REINFORCEMENT LEARNING FOR ROBOT CONTROL USING PROBABILITY DENSITY ESTIMATIONS
Agostini, Alejandro
Celaya, Enric
ICINCO 2010: PROCEEDINGS OF THE 7TH INTERNATIONAL CONFERENCE ON INFORMATICS IN CONTROL, AUTOMATION AND ROBOTICS, VOL 1, 2010, : 160 - 168

← 1 2 3 4 5 →