Monte-Carlo Simulation Balancing Revisited

被引：0

作者：

Graf, Tobias ^{[1
]}

Platzner, Marco ^{[2
]}

机构：

[1] Univ Paderborn, Int Grad Sch Dynam Intelligent Syst, Paderborn, Germany

[2] Univ Paderborn, Paderborn, Germany

来源：

2016 IEEE CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND GAMES (CIG) | 2016年

关键词：

GAME;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Simulation Balancing is an optimization algorithm to automatically tune the parameters of a playout policy used inside a Monte Carlo Tree Search. The algorithm fits a policy so that the expected result of a policy matches given target values of the training set. Up to now it has been successfully applied to Computer Go on small 9 x 9 boards but failed for larger board sizes like 1 9 x 19. On these large boards apprenticeship learning, which fits a policy so that it closely follows an expert, continues to be the algorithm of choice. In this paper we introduce several improvements to the original simulation balancing algorithm and test their effectiveness in Computer Go. The proposed additions remove the necessity to generate target values by deep searches, optimize faster and make the algorithm less prone to overfitting. The experiments show that simulation balancing improves the playing strength of a Go program using apprenticeship learning by more than 200 ELO on the large board size 1 9 x 19.

引用

页数：7

共 50 条

[31] MONTE-CARLO SIMULATION OF FLUID KRYPTON
BOSE, TK
BROSTOW, W
SOCHANSKI, JS
PHYSICS AND CHEMISTRY OF LIQUIDS, 1981, 11 (01) : 65 - 78
[32] A MONTE-CARLO SIMULATION OF AUGER CASCADES
POMPLUN, E
BOOZ, J
CHARLTON, DE
RADIATION RESEARCH, 1987, 111 (03) : 533 - 552
[33] MONTE-CARLO SIMULATION OF A MODEL OF A SURFACTANT
ABRAHAM, DB
SMITH, ER
JOURNAL OF PHYSICS A-MATHEMATICAL AND GENERAL, 1981, 14 (05): : L193 - L197
[34] MODERATO:: A Monte-Carlo radiographic simulation
Bonin, A
Lavayssière, B
Chalmond, B
REVIEW OF PROGRESS IN QUANTITATIVE NONDESTRUCTIVE EVALUATION, VOLS 19A AND 19B, 2000, 509 : 651 - 656
[35] MONTE-CARLO SIMULATION OF POLYMERIC MATERIALS
BINDER, K
PHYSICA SCRIPTA, 1994, 55 : 206 - 211
[36] DNA ELECTROPHORESIS - A MONTE-CARLO SIMULATION
KREMER, K
POLYMER COMMUNICATIONS, 1988, 29 (10): : 292 - 294
[37] Monte-Carlo simulation of stochastic flow
Grossmann, P
GEC JOURNAL OF RESEARCH, 1996, 13 (03): : 175 - 187
[38] HYBRID MONTE-CARLO SIMULATION OF SILICA
BROTZ, FA
DEPABLO, JJ
CHEMICAL ENGINEERING SCIENCE, 1994, 49 (17) : 3015 - 3031
[39] MONTE-CARLO SIMULATION OF BUOYANT DISPERSION
COGAN, JL
ATMOSPHERIC ENVIRONMENT, 1985, 19 (06) : 867 - 878
[40] MONTE-CARLO SIMULATION OF FLUX LINES
MA, HR
CHUI, ST
PHYSICAL REVIEW LETTERS, 1992, 68 (16) : 2528 - 2530

← 1 2 3 4 5 →