Evaluation Function Based Monte-Carlo LOA

被引:22
|
作者
Winands, Mark H. M. [1 ]
Bjornsson, Yngvi [2 ]
机构
[1] Maastricht Univ, Fac Humanities & Sci, Dept Knowledge Engn, Games & AI Grp, Maastricht, Netherlands
[2] Reykjavik Univ, Sch Comp Sci, Reykjavik, Iceland
来源
ADVANCES IN COMPUTER GAMES | 2010年 / 6048卷
关键词
D O I
10.1007/978-3-642-12993-3_4
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Recently, Monte-Carlo Tree Search (MCTS) has advanced the field of computer Go substantially. Also in the game of Lines of Action (LOA), which has been dominated so far by alpha beta, MCTS is making an inroad. In this paper we investigate how to use a positional evaluation function in a Monte-Carlo simulation-based LOA program (MC-LOA). Four different simulation strategies are designed, called Evaluation Cut-Off, Corrective, Greedy, and Mixed. They use an evaluation function in several ways. Experimental results reveal that the Mixed strategy is the best among them. This strategy draws the moves randomly based on their transition probabilities in the first part of a simulation, but selects them based on their evaluation score in the second part of a simulation. Using this simulation strategy the MC-LOA program plays at the same level as the alpha beta program MIA, the best LOA-playing entity in the world.
引用
收藏
页码:33 / +
页数:3
相关论文
共 50 条