Evolutionary instability of selfish learning in repeated games

被引:9
|
作者
McAvoy, Alex [1 ,2 ]
Kates-Harbeck, Julian [3 ]
Chatterjee, Krishnendu [4 ]
Hilbe, Christian [5 ]
机构
[1] Univ Penn, Dept Math, Philadelphia, PA 19104 USA
[2] Univ Penn, Ctr Math Biol, Philadelphia, PA 19104 USA
[3] Harvard Univ, Dept Phys, Cambridge, MA USA
[4] IST Austria, Klosterneuburg, Austria
[5] Max Planck Inst Evolutionary Biol, Max Planck Res Grp Dynam Social Behav, Plon, Germany
来源
PNAS NEXUS | 2022年 / 1卷 / 04期
基金
欧洲研究理事会;
关键词
INFINITELY REPEATED GAMES; SOCIAL PREFERENCES; RANDOM SEARCH; COOPERATION; DYNAMICS; STRATEGIES; FAIRNESS; EFFICIENCY; EXTORTION; ANSWER;
D O I
10.1093/pnasnexus/pgac141
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Across many domains of interaction, both natural and artificial, individuals use past experience to shape future behaviors. The results of such learning processes depend on what individuals wish to maximize. A natural objective is one's own success. However, when two such "selfish" learners interact with each other, the outcome can be detrimental to both, especially when there are conflicts of interest. Here, we explore how a learner can align incentives with a selfish opponent. Moreover, we consider the dynamics that arise when learning rules themselves are subject to evolutionary pressure. By combining extensive simulations and analytical techniques, we demonstrate that selfish learning is unstable in most classical two-player repeated games. If evolution operates on the level of long-run payoffs, selection instead favors learning rules that incorporate social (other-regarding) preferences. To further corroborate these results, we analyze data from a repeated prisoner's dilemma experiment. We find that selfish learning is insufficient to explain human behavior when there is a trade-off between payoff maximization and fairness.
引用
收藏
页数:15
相关论文
共 50 条
  • [21] URN MODELS FOR EVOLUTIONARY LEARNING GAMES
    TRACY, ND
    SEAMAN, JW
    JOURNAL OF MATHEMATICAL PSYCHOLOGY, 1992, 36 (02) : 278 - 282
  • [22] Evolutionary stability in repeated extensive games played by finite automata
    Andreozzi, Luciano
    GAMES AND ECONOMIC BEHAVIOR, 2013, 79 : 67 - 74
  • [23] Characterizing the Dynamics of Learning in Repeated Reference Games
    Hawkins, Robert D.
    Frank, Michael C.
    Goodman, Noah D.
    COGNITIVE SCIENCE, 2020, 44 (06)
  • [24] Modeling opponent learning in multiagent repeated games
    Hu, Yudong
    Han, Congying
    Li, Haoran
    Guo, Tiande
    APPLIED INTELLIGENCE, 2023, 53 (13) : 17194 - 17210
  • [25] Individual Learning and Cooperation in Noisy Repeated Games
    Yamamoto, Yuichi
    REVIEW OF ECONOMIC STUDIES, 2014, 81 (01): : 473 - 500
  • [26] Modeling opponent learning in multiagent repeated games
    Yudong Hu
    Congying Han
    Haoran Li
    Tiande Guo
    Applied Intelligence, 2023, 53 : 17194 - 17210
  • [27] Bayesian learning in repeated games of incomplete information
    Nachbar, JH
    SOCIAL CHOICE AND WELFARE, 2001, 18 (02) : 303 - 326
  • [28] A generalized approach to belief learning in repeated games
    Ioannou, Christos A.
    Romero, Julian
    GAMES AND ECONOMIC BEHAVIOR, 2014, 87 : 178 - 203
  • [29] Bayesian learning in repeated games of incomplete information
    John H. Nachbar
    Social Choice and Welfare, 2001, 18 : 303 - 326
  • [30] Learning in repeated stochastic network aggregative games
    Meigs, Emily
    Parise, Francesca
    Ozdaglar, Asuman
    2019 IEEE 58TH CONFERENCE ON DECISION AND CONTROL (CDC), 2019, : 6918 - 6923