Bridge Bidding via Deep Reinforcement Learning and Belief Monte Carlo Search

被引：1

作者：

Qiu, Zizhang ^{[1
]}

Wang, Shouguang ^{[1
]}

You, Dan ^{[1
]}

Zhou, MengChu ^{[1
]}

机构：

[1] Zhejiang Gongshang Univ, Sch Informat & Elect Engn, Hangzhou 310018, Peoples R China

来源：

IEEE-CAA JOURNAL OF AUTOMATICA SINICA | 2024年 / 11卷 / 10期

关键词：

Bridges; Monte Carlo methods; Supervised learning; Interference; Games; Deep reinforcement learning; Software; Contract Bridge; reinforcement learning; search; GO; ALGORITHM; GAME;

D O I：

10.1109/JAS.2024.124488

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Contract Bridge, a four-player imperfect information game, comprises two phases: bidding and playing. While computer programs excel at playing, bidding presents a challenging aspect due to the need for information exchange with partners and interference with communication of opponents. In this work, we introduce a Bridge bidding agent that combines supervised learning, deep reinforcement learning via self-play, and a test-time search approach. Our experiments demonstrate that our agent outperforms WBridge5, a highly regarded computer Bridge software that has won multiple world championships, by a performance of 0.98 IMPs (international match points) per deal over 10 000 deals, with a much cost-effective approach. The performance significantly surpasses previous state-of-the-art (0.85 IMPs per deal). Note 0.1 IMPs per deal is a significant improvement in Bridge bidding.

引用

页码：2111 / 2122

页数：12

共 50 条

[31] A Deep Reinforcement Learning Bidding Algorithm on Electricity Market
JIA Shuai
GAN Zhongxue
XI Yugeng
LI Dewei
XUE Shibei
WANG Limin
Journal of Thermal Science, 2020, 29 (05) : 1125 - 1134
[32] Deep Reinforcement Learning for Strategic Bidding in Electricity Markets
Ye, Yujian
Qiu, Dawei
Sun, Mingyang
Papadaskalopoulos, Dimitrios
Strbac, Goran
2020 IEEE POWER & ENERGY SOCIETY GENERAL MEETING (PESGM), 2020,
[33] A Deep Reinforcement Learning Bidding Algorithm on Electricity Market
Shuai Jia
Zhongxue Gan
Yugeng Xi
Dewei Li
Shibei Xue
Limin Wang
Journal of Thermal Science, 2020, 29 : 1125 - 1134
[34] Deep Reinforcement Learning for Virtual Bidding in Electricity Markets
Han D.
Huang W.
Yan Z.
Zhongguo Dianji Gongcheng Xuebao/Proceedings of the Chinese Society of Electrical Engineering, 2022, 42 (04): : 1443 - 1454
[35] Adaptive Design of Alloys for CO2 Activation and Methanation via Reinforcement Learning Monte Carlo Tree Search Algorithm
Song, Zhilong
Zhou, Qionghua
Lu, Shuaihua
Dieb, Sae
Ling, Chongyi
Wang, Jinlan
JOURNAL OF PHYSICAL CHEMISTRY LETTERS, 2023, 14 (14): : 3594 - 3601
[36] Adaptive Design of Alloys for CO2 Activation and Methanation via Reinforcement Learning Monte Carlo Tree Search Algorithm
Song, Zhilong
Zhou, Qionghua
Lu, Shuaihua
Dieb, Sae
Ling, Chongyi
Wang, Jinlan
JOURNAL OF PHYSICAL CHEMISTRY LETTERS, 2023, : 3594 - 3601
[37] Bidding Strategy for Periodic Double Auctions Using Monte Carlo Tree Search
Chowdhury, Moinul Morshed Porag
Kiekintveld, Christopher
Tran Cao Son
Yeoh, William
PROCEEDINGS OF THE 17TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS (AAMAS' 18), 2018, : 1897 - 1899
[38] Maximum Entropy Inverse Reinforcement Learning Using Monte Carlo Tree Search for Autonomous Driving
da Silva, Junior Anderson Rodrigues
Grassi Jr, Valdir
Wolf, Denis Fernando
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 25 (09) : 11552 - 11562
[39] Towards efficient discovery of green synthetic pathways with Monte Carlo tree search and reinforcement learning
Wang, Xiaoxue
Qian, Yujie
Gao, Hanyu
Coley, Connor W.
Mo, Yiming
Barzilay, Regina
Jensen, Klavs F.
CHEMICAL SCIENCE, 2020, 11 (40) : 10959 - 10972
[40] Adaptive Playouts in Monte-Carlo Tree Search with Policy-Gradient Reinforcement Learning
Graf, Tobias
Platzner, Marco
ADVANCES IN COMPUTER GAMES, ACG 2015, 2015, 9525 : 1 - 11

← 1 2 3 4 5 →