Bridge Bidding via Deep Reinforcement Learning and Belief Monte Carlo Search

被引:1
|
作者
Qiu, Zizhang [1 ]
Wang, Shouguang [1 ]
You, Dan [1 ]
Zhou, MengChu [1 ]
机构
[1] Zhejiang Gongshang Univ, Sch Informat & Elect Engn, Hangzhou 310018, Peoples R China
关键词
Bridges; Monte Carlo methods; Supervised learning; Interference; Games; Deep reinforcement learning; Software; Contract Bridge; reinforcement learning; search; GO; ALGORITHM; GAME;
D O I
10.1109/JAS.2024.124488
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Contract Bridge, a four-player imperfect information game, comprises two phases: bidding and playing. While computer programs excel at playing, bidding presents a challenging aspect due to the need for information exchange with partners and interference with communication of opponents. In this work, we introduce a Bridge bidding agent that combines supervised learning, deep reinforcement learning via self-play, and a test-time search approach. Our experiments demonstrate that our agent outperforms WBridge5, a highly regarded computer Bridge software that has won multiple world championships, by a performance of 0.98 IMPs (international match points) per deal over 10 000 deals, with a much cost-effective approach. The performance significantly surpasses previous state-of-the-art (0.85 IMPs per deal). Note 0.1 IMPs per deal is a significant improvement in Bridge bidding.
引用
收藏
页码:2111 / 2122
页数:12
相关论文
共 50 条
  • [1] Bridge Bidding via Deep Reinforcement Learning and Belief Monte Carlo Search
    Zizhang Qiu
    Shouguang Wang
    Dan You
    MengChu Zhou
    IEEE/CAA Journal of Automatica Sinica, 2024, 11 (10) : 2111 - 2122
  • [2] Automatic Bridge Bidding Using Deep Reinforcement Learning
    Yeh, Chih-Kuan
    Lin, Hsuan-Tien
    ECAI 2016: 22ND EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2016, 285 : 1362 - 1369
  • [3] Automatic Bridge Bidding Using Deep Reinforcement Learning
    Yeh, Chih-Kuan
    Hsieh, Cheng-Yu
    Lin, Hsuan-Tien
    IEEE TRANSACTIONS ON GAMES, 2018, 10 (04) : 365 - 377
  • [4] On Monte Carlo Tree Search and Reinforcement Learning
    Vodopivec, Tom
    Samothrakis, Spyridon
    Ster, Branko
    JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2017, 60 : 881 - 936
  • [5] Deep Reinforcement Learning Using Optimized Monte Carlo Tree Search in EWN
    Zhang, Yixian
    Li, Zhuoxuan
    Cao, Yiding
    Zhao, Xuan
    Cao, Jinde
    IEEE TRANSACTIONS ON GAMES, 2024, 16 (03) : 544 - 555
  • [6] Monte Carlo Tree Search for Bayesian Reinforcement Learning
    Vien, Ngo Anh
    Ertel, Wolfgang
    2012 11TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA 2012), VOL 1, 2012, : 138 - 143
  • [7] DeepMCTS: Deep Reinforcement Learning Assisted Monte Carlo Tree Search for MIMO Detection
    Mo, Tz-Wei
    Chang, Ronald Y.
    Kan, Te-Yi
    2022 IEEE 95TH VEHICULAR TECHNOLOGY CONFERENCE (VTC2022-SPRING), 2022,
  • [8] Reinforcement Learning - A Bridge Between Numerical Methods and Monte Carlo
    Borkar, Vivek S.
    PERSPECTIVES IN MATHEMATICAL SCIENCES I: PROBABILITY AND STATISTICS, 2009, 7 : 71 - 91
  • [9] MetroZero: Deep Reinforcement Learning and Monte Carlo Tree Search for Optimized Metro Network Expansion
    Alkilane, Khaled
    Lee, Der-Horng
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2025, 26 (01) : 810 - 823
  • [10] Monte Carlo Tree Search With Reinforcement Learning for Motion Planning
    Weingertner, Philippe
    Ho, Minnie
    Timofeev, Andrey
    Aubert, Sebastien
    Pita-Gil, Guillermo
    2020 IEEE 23RD INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS (ITSC), 2020,