Risk preference as an outcome of evolutionarily adaptive learning mechanisms: An evolutionary simulation under diverse risky environments

被引:1
|
作者
Homma, Shogo [1 ,2 ,3 ]
Takezawa, Masanori [1 ,4 ,5 ]
机构
[1] Hokkaido Univ, Grad Sch Humanities & Human Sci, Dept Behav Sci, Sapporo, Hokkaido, Japan
[2] Japan Soc Promot Sci, Tokyo, Japan
[3] Nagoya Univ, Grad Sch Informat, Dept Cognit & Psychol Sci, Nagoya, Aichi, Japan
[4] Hokkaido Univ, Ctr Expt Res Social Sci, Sapporo, Hokkaido, Japan
[5] Hokkaido Univ, Ctr Human Nat Artificial Intelligence & Neurosci, Sapporo, Hokkaido, Japan
来源
PLOS ONE | 2024年 / 19卷 / 08期
关键词
DECISION; SENSITIVITY; INFORMATION; SELECTION; RATES;
D O I
10.1371/journal.pone.0307991
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
The optimization of cognitive and learning mechanisms can reveal complicated behavioral phenomena. In this study, we focused on reinforcement learning, which uses different learning rules for positive and negative reward prediction errors. We attempted to relate the evolved learning bias to the complex features of risk preference such as domain-specific behavior manifests and the relatively stable domain-general factor underlying behaviors. The simulations of the evolution of the two learning rates under diverse risky environments showed that the positive learning rate evolved on average to be higher than the negative one, when agents experienced both tasks where risk aversion was more rewarding and risk seeking was more rewarding. This evolution enabled agents to flexibly choose more reward behaviors depending on the task type. The evolved agents also demonstrated behavioral patterns described by the prospect theory. Our simulations captured two aspects of the evolution of risk preference: the domain-specific aspect, behavior acquired through learning in a specific context; and the implicit domain-general aspect, corresponding to the learning rates shaped through evolution to adaptively behave in a wide range of environments. These results imply that our framework of learning under the innate constraint may be useful in understanding the complicated behavioral phenomena.
引用
收藏
页数:22
相关论文
empty
未找到相关数据