Learning pareto-optimal solutions in 2x2 conflict games

被引:1
|
作者
Airiau, Stephane [1 ]
Sen, Sandip [1 ]
机构
[1] Univ Tulsa, Dept Math & Comp Sci, Tulsa, OK 74104 USA
关键词
D O I
10.1007/11691839_4
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multiagent learning literature has investigated iterated two-player games to develop mechanisms that allow agents to learn to converge on Nash Equilibrium strategy profiles. Such equilibrium configurations imply that no player has the motivation to unilaterally change its strategy. Often, in general sum games, a higher payoff can be obtained by both players if one chooses not to respond myopically to the other player. By developing mutual trust, agents can avoid immediate best responses that will lead to a Nash Equilibrium with lesser payoff. In this paper we experiment with agents who select actions based on expected utility calculations that incorporate the observed frequencies of the actions of the opponent(s). We augment these stochastically greedy agents with an interesting action revelation strategy that involves strategic declaration of one's commitment to an action to avoid worst-case, pessimistic moves. We argue that in certain situations, such apparently risky action revelation can indeed produce better payoffs than a nonrevealing approach. In particular, it is possible to obtain Pareto-optimal Nash Equilibrium outcomes. We improve on the outcome efficiency of a previous algorithm and present results over the set of structurally distinct two-person two-action conflict games where the players' preferences form a total order over the possible outcomes. We also present results on a large number of randomly generated payoff matrices of varying sizes and compare the payoffs of strategically revealing learners to payoffs at Nash equilibrium.
引用
收藏
页码:86 / 99
页数:14
相关论文
共 50 条
  • [21] DYNAMIC STABILITY AND LEARNING-PROCESSES IN 2X2 COORDINATION GAMES
    LI, SH
    ECONOMICS LETTERS, 1994, 46 (02) : 105 - 111
  • [22] LEARNING AND EQUILIBRIUM SELECTION IN 2X2 GAMES - SOME SIMULATION RESULTS
    DIGIOACCHINO, D
    ECONOMICS LETTERS, 1992, 38 (04) : 405 - 409
  • [23] DISTRIBUTED PARETO-OPTIMAL SOLUTIONS VIA DIFFUSION ADAPTATION
    Chen, Jianshu
    Sayed, Ali H.
    2012 IEEE STATISTICAL SIGNAL PROCESSING WORKSHOP (SSP), 2012, : 648 - 651
  • [24] Dual approach to minimization on the set of Pareto-optimal solutions
    Thach, PT
    Konno, H
    Yokota, D
    JOURNAL OF OPTIMIZATION THEORY AND APPLICATIONS, 1996, 88 (03) : 689 - 707
  • [25] Synthesis and characterization of Pareto-optimal solutions for the mixed H2/H∞ control problem
    Takahashi, RHC
    Palhares, RM
    Dutra, DA
    Gonçalves, LPS
    PROCEEDINGS OF THE 40TH IEEE CONFERENCE ON DECISION AND CONTROL, VOLS 1-5, 2001, : 3997 - 4002
  • [26] PARETO-OPTIMAL SOLUTIONS OF NASH BARGAINING RESOURCE ALLOCATION GAMES WITH SPECTRAL MASK AND TOTAL POWER CONSTRAINTS
    Gao, Jie
    Vorobyov, Sergiy A.
    Jiang, Hai
    2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 3146 - 3149
  • [27] Steering approaches to Pareto-optimal multiobjective reinforcement learning
    Vamplew, Peter
    Issabekov, Rustam
    Dazeley, Richard
    Foale, Cameron
    Berry, Adam
    Moore, Tim
    Creighton, Douglas
    NEUROCOMPUTING, 2017, 263 : 26 - 38
  • [28] MIXED STRATEGIES IN 2X2 MATRIX GAMES
    ZINK, DL
    AMERICAN PSYCHOLOGIST, 1960, 15 (07) : 494 - 494
  • [29] The advantage of complexity in two 2x2 games
    Engle-Warnick, J
    COMPLEXITY, 2004, 9 (05) : 71 - 78
  • [30] Fundamental clusters in spatial 2x2 games
    Hauert, C
    PROCEEDINGS OF THE ROYAL SOCIETY B-BIOLOGICAL SCIENCES, 2001, 268 (1468) : 761 - 769