Exploration and recency as the main proximate causes of probability matching: a reinforcement learning analysis

被引:0
|
作者
Carolina Feher da Silva
Camila Gomes Victorino
Nestor Caticha
Marcus Vinícius Chrysóstomo Baldo
机构
[1] Department of General Physics,
[2] Institute of Physics,undefined
[3] University of São Paulo,undefined
[4] Rua do Matão Nr. 1371,undefined
[5] Cidade Universitária,undefined
[6] Department of Physiology and Biophysics,undefined
[7] Institute of Biomedical Sciences,undefined
[8] University of São Paulo,undefined
[9] Av. Prof. Lineu Prestes,undefined
[10] 1524,undefined
[11] ICB-I,undefined
[12] Cidade Universitária,undefined
[13] Department of General Physics,undefined
[14] Institute of Physics,undefined
[15] University of São Paulo,undefined
[16] Rua do Matão Nr. 1371,undefined
[17] Cidade Universitária,undefined
[18] Department of Physiology and Biophysics,undefined
[19] Institute of Biomedical Sciences,undefined
[20] University of São Paulo,undefined
[21] Av. Prof. Lineu Prestes,undefined
[22] 1524,undefined
[23] ICB-I,undefined
[24] Cidade Universitária,undefined
来源
关键词
D O I
暂无
中图分类号
学科分类号
摘要
Research has not yet reached a consensus on why humans match probabilities instead of maximise in a probability learning task. The most influential explanation is that they search for patterns in the random sequence of outcomes. Other explanations, such as expectation matching, are plausible, but do not consider how reinforcement learning shapes people’s choices. We aimed to quantify how human performance in a probability learning task is affected by pattern search and reinforcement learning. We collected behavioural data from 84 young adult participants who performed a probability learning task wherein the majority outcome was rewarded with 0.7 probability, and analysed the data using a reinforcement learning model that searches for patterns. Model simulations indicated that pattern search, exploration, recency (discounting early experiences), and forgetting may impair performance. Our analysis estimated that 85% (95% HDI [76, 94]) of participants searched for patterns and believed that each trial outcome depended on one or two previous ones. The estimated impact of pattern search on performance was, however, only 6%, while those of exploration and recency were 19% and 13% respectively. This suggests that probability matching is caused by uncertainty about how outcomes are generated, which leads to pattern search, exploration, and recency.
引用
收藏
相关论文
共 7 条