Parameter-Free Sampled Fictitious Play for Solving Deterministic Dynamic Programming Problems

被引：1

作者：

Dolinskaya, Irina S. ^{[1
]}

Epelman, Marina A. ^{[2
]}

Sir, Esra Sisikoglu ^{[3
]}

Smith, Robert L. ^{[2
]}

机构：

[1] Northwestern Univ, Dept Ind Engn & Management Sci, Evanston, IL 60208 USA

[2] Univ Michigan, Dept Ind & Operat Engn, Ann Arbor, MI 48109 USA

[3] Mayo Clin, Off Access Management, Rochester, MN 55905 USA

来源：

JOURNAL OF OPTIMIZATION THEORY AND APPLICATIONS | 2016年 / 169卷 / 02期

关键词：

Sampled fictitious play; Dynamic programming; Maritime navigation; ALGORITHMS; PATHS; WAVES; CURVATURE;

D O I：

10.1007/s10957-015-0798-5

中图分类号：

C93 [管理学]; O22 [运筹学];

学科分类号：

070105 ; 12 ; 1201 ; 1202 ; 120202 ;

摘要：

In this paper, we present a parameter-free variation of the Sampled Fictitious Play algorithm that facilitates fast solution of deterministic dynamic programming problems. Its random tie-breaking procedure imparts a natural randomness to the algorithm which prevents it from "getting stuck" at a local optimal solution and allows the discovery of an optimal path in a finite number of iterations. Furthermore, we illustrate through an application to maritime navigation that, in practice, a parameter-free Sampled Fictitious Play algorithm finds a high-quality solution after only a few iterations, in contrast with traditional methods.

引用

页码：631 / 655

页数：25

共 50 条

[21] Parameter-free, Dynamic, and Strongly-Adaptive Online Learning
Cutkosky, Ashok
25TH AMERICAS CONFERENCE ON INFORMATION SYSTEMS (AMCIS 2019), 2019,
[22] APPROXIMATE METHOD FOR SOLVING DYNAMIC PROGRAMMING PROBLEMS
ALEKSEYE.OG
ENGINEERING CYBERNETICS, 1971, 9 (03): : 447 - &
[23] NeuroGenetic Approach for Solving Dynamic Programming Problems
Pires, Matheus Giovanni
da Silva, Ivan Nunes
Bertoni, Fabiana Cristina
2008 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC), VOLS 1-6, 2008, : 2143 - +
[24] Neurogenetic Approach for Solving Dynamic Programming Problems
Pires, Matheus Giovanni
da Silva, Ivan Nunes
ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING, PT II, 2010, 6114 : 72 - +
[25] Solving Dynamic Programming Problems on a Computational Grid
Yongyang Cai
Kenneth L. Judd
Greg Thain
Stephen J. Wright
Computational Economics, 2015, 45 : 261 - 284
[26] Solving Dynamic Programming Problems on a Computational Grid
Cai, Yongyang
Judd, Kenneth L.
Thain, Greg
Wright, Stephen J.
COMPUTATIONAL ECONOMICS, 2015, 45 (02) : 261 - 284
[27] Parameter-Free Reduction of the Estimation Bias in Deep Reinforcement Learning for Deterministic Policy Gradients
Saglam, Baturay
Mutlu, Furkan Burak
Cicek, Dogan Can
Kozat, Suleyman Serdar
NEURAL PROCESSING LETTERS, 2024, 56 (02)
[28] Parameter-Free Reduction of the Estimation Bias in Deep Reinforcement Learning for Deterministic Policy Gradients
Baturay Saglam
Furkan Burak Mutlu
Dogan Can Cicek
Suleyman Serdar Kozat
Neural Processing Letters, 56
[29] On solving optimal control problems with free initial condition using iterative dynamic programming
Mekarapiruk, W
Luus, R
CANADIAN JOURNAL OF CHEMICAL ENGINEERING, 2001, 79 (05): : 777 - 784
[30] Solving dynamic portfolio problems using stochastic programming
Consigli, G
Dempster, MAH
ZEITSCHRIFT FUR ANGEWANDTE MATHEMATIK UND MECHANIK, 1997, 77 : S535 - S536

← 1 2 3 4 5 →