Bridge Bidding via Deep Reinforcement Learning and Belief Monte Carlo Search

被引:1
|
作者
Qiu, Zizhang [1 ]
Wang, Shouguang [1 ]
You, Dan [1 ]
Zhou, MengChu [1 ]
机构
[1] Zhejiang Gongshang Univ, Sch Informat & Elect Engn, Hangzhou 310018, Peoples R China
关键词
Bridges; Monte Carlo methods; Supervised learning; Interference; Games; Deep reinforcement learning; Software; Contract Bridge; reinforcement learning; search; GO; ALGORITHM; GAME;
D O I
10.1109/JAS.2024.124488
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Contract Bridge, a four-player imperfect information game, comprises two phases: bidding and playing. While computer programs excel at playing, bidding presents a challenging aspect due to the need for information exchange with partners and interference with communication of opponents. In this work, we introduce a Bridge bidding agent that combines supervised learning, deep reinforcement learning via self-play, and a test-time search approach. Our experiments demonstrate that our agent outperforms WBridge5, a highly regarded computer Bridge software that has won multiple world championships, by a performance of 0.98 IMPs (international match points) per deal over 10 000 deals, with a much cost-effective approach. The performance significantly surpasses previous state-of-the-art (0.85 IMPs per deal). Note 0.1 IMPs per deal is a significant improvement in Bridge bidding.
引用
收藏
页码:2111 / 2122
页数:12
相关论文
共 50 条
  • [31] A Deep Reinforcement Learning Bidding Algorithm on Electricity Market
    JIA Shuai
    GAN Zhongxue
    XI Yugeng
    LI Dewei
    XUE Shibei
    WANG Limin
    Journal of Thermal Science, 2020, 29 (05) : 1125 - 1134
  • [32] Deep Reinforcement Learning for Strategic Bidding in Electricity Markets
    Ye, Yujian
    Qiu, Dawei
    Sun, Mingyang
    Papadaskalopoulos, Dimitrios
    Strbac, Goran
    2020 IEEE POWER & ENERGY SOCIETY GENERAL MEETING (PESGM), 2020,
  • [33] A Deep Reinforcement Learning Bidding Algorithm on Electricity Market
    Shuai Jia
    Zhongxue Gan
    Yugeng Xi
    Dewei Li
    Shibei Xue
    Limin Wang
    Journal of Thermal Science, 2020, 29 : 1125 - 1134
  • [34] Deep Reinforcement Learning for Virtual Bidding in Electricity Markets
    Han D.
    Huang W.
    Yan Z.
    Zhongguo Dianji Gongcheng Xuebao/Proceedings of the Chinese Society of Electrical Engineering, 2022, 42 (04): : 1443 - 1454
  • [35] Adaptive Design of Alloys for CO2 Activation and Methanation via Reinforcement Learning Monte Carlo Tree Search Algorithm
    Song, Zhilong
    Zhou, Qionghua
    Lu, Shuaihua
    Dieb, Sae
    Ling, Chongyi
    Wang, Jinlan
    JOURNAL OF PHYSICAL CHEMISTRY LETTERS, 2023, 14 (14): : 3594 - 3601
  • [36] Adaptive Design of Alloys for CO2 Activation and Methanation via Reinforcement Learning Monte Carlo Tree Search Algorithm
    Song, Zhilong
    Zhou, Qionghua
    Lu, Shuaihua
    Dieb, Sae
    Ling, Chongyi
    Wang, Jinlan
    JOURNAL OF PHYSICAL CHEMISTRY LETTERS, 2023, : 3594 - 3601
  • [37] Bidding Strategy for Periodic Double Auctions Using Monte Carlo Tree Search
    Chowdhury, Moinul Morshed Porag
    Kiekintveld, Christopher
    Tran Cao Son
    Yeoh, William
    PROCEEDINGS OF THE 17TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS (AAMAS' 18), 2018, : 1897 - 1899
  • [38] Maximum Entropy Inverse Reinforcement Learning Using Monte Carlo Tree Search for Autonomous Driving
    da Silva, Junior Anderson Rodrigues
    Grassi Jr, Valdir
    Wolf, Denis Fernando
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 25 (09) : 11552 - 11562
  • [39] Towards efficient discovery of green synthetic pathways with Monte Carlo tree search and reinforcement learning
    Wang, Xiaoxue
    Qian, Yujie
    Gao, Hanyu
    Coley, Connor W.
    Mo, Yiming
    Barzilay, Regina
    Jensen, Klavs F.
    CHEMICAL SCIENCE, 2020, 11 (40) : 10959 - 10972
  • [40] Adaptive Playouts in Monte-Carlo Tree Search with Policy-Gradient Reinforcement Learning
    Graf, Tobias
    Platzner, Marco
    ADVANCES IN COMPUTER GAMES, ACG 2015, 2015, 9525 : 1 - 11