Learning pareto-optimal solutions in 2x2 conflict games

被引:1
|
作者
Airiau, Stephane [1 ]
Sen, Sandip [1 ]
机构
[1] Univ Tulsa, Dept Math & Comp Sci, Tulsa, OK 74104 USA
关键词
D O I
10.1007/11691839_4
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multiagent learning literature has investigated iterated two-player games to develop mechanisms that allow agents to learn to converge on Nash Equilibrium strategy profiles. Such equilibrium configurations imply that no player has the motivation to unilaterally change its strategy. Often, in general sum games, a higher payoff can be obtained by both players if one chooses not to respond myopically to the other player. By developing mutual trust, agents can avoid immediate best responses that will lead to a Nash Equilibrium with lesser payoff. In this paper we experiment with agents who select actions based on expected utility calculations that incorporate the observed frequencies of the actions of the opponent(s). We augment these stochastically greedy agents with an interesting action revelation strategy that involves strategic declaration of one's commitment to an action to avoid worst-case, pessimistic moves. We argue that in certain situations, such apparently risky action revelation can indeed produce better payoffs than a nonrevealing approach. In particular, it is possible to obtain Pareto-optimal Nash Equilibrium outcomes. We improve on the outcome efficiency of a previous algorithm and present results over the set of structurally distinct two-person two-action conflict games where the players' preferences form a total order over the possible outcomes. We also present results on a large number of randomly generated payoff matrices of varying sizes and compare the payoffs of strategically revealing learners to payoffs at Nash equilibrium.
引用
收藏
页码:86 / 99
页数:14
相关论文
共 50 条
  • [41] Reinforcement Learning of Pareto-Optimal Multiobjective Policies Using Steering
    Vamplew, Peter
    Issabekov, Rustam
    Dazeley, Richard
    Foale, Cameron
    AI 2015: ADVANCES IN ARTIFICIAL INTELLIGENCE, 2015, 9457 : 596 - 608
  • [42] The payoff matrix of repeated asymmetric 2x2 games
    Elshobaky, Entisarat M.
    El-Seidy, Essam
    Taha, Mohammad A.
    APPLIED MATHEMATICS AND COMPUTATION, 2016, 274 : 393 - 402
  • [43] ENTRAPMENT IN 2X2 GAMES WITH FORCE VULNERABLE EQUILIBRIA
    STEVENSON, MB
    PHILLIPS, JL
    BEHAVIORAL SCIENCE, 1972, 17 (04): : 361 - +
  • [44] INTROSPECTION AND EQUILIBRIUM SELECTION IN 2X2 MATRIX GAMES
    OLCINA, G
    URBANO, A
    INTERNATIONAL JOURNAL OF GAME THEORY, 1994, 23 (03) : 183 - 206
  • [45] LIMITED-MOVE EQUILIBRIA IN 2X2 GAMES
    ZAGARE, FC
    THEORY AND DECISION, 1984, 16 (01) : 1 - 19
  • [46] Stationary Concepts for Experimental 2x2 Games: Reply
    Selten, Reinhard
    Chmura, Thorsten
    Goerg, Sebastian J.
    AMERICAN ECONOMIC REVIEW, 2011, 101 (02): : 1041 - 1044
  • [47] 2X2 GAMES OF COMMITMENT IN WORLD-POLITICS
    REISINGER, WM
    INTERNATIONAL STUDIES QUARTERLY, 1989, 33 (01) : 111 - 118
  • [48] Searching for robust Pareto-optimal solutions in multi-objective optimization
    Deb, K
    Gupta, H
    EVOLUTIONARY MULTI-CRITERION OPTIMIZATION, 2005, 3410 : 150 - 164
  • [49] Post-hoc Selection of Pareto-Optimal Solutions in Search and Recommendation
    Paparella, Vincenzo
    Anelli, Vito Walter
    Nardini, Franco Maria
    Perego, Raffaele
    Di Noia, Tommaso
    PROCEEDINGS OF THE 32ND ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2023, 2023, : 2013 - 2023