Parameter-Free Sampled Fictitious Play for Solving Deterministic Dynamic Programming Problems

被引:1
|
作者
Dolinskaya, Irina S. [1 ]
Epelman, Marina A. [2 ]
Sir, Esra Sisikoglu [3 ]
Smith, Robert L. [2 ]
机构
[1] Northwestern Univ, Dept Ind Engn & Management Sci, Evanston, IL 60208 USA
[2] Univ Michigan, Dept Ind & Operat Engn, Ann Arbor, MI 48109 USA
[3] Mayo Clin, Off Access Management, Rochester, MN 55905 USA
关键词
Sampled fictitious play; Dynamic programming; Maritime navigation; ALGORITHMS; PATHS; WAVES; CURVATURE;
D O I
10.1007/s10957-015-0798-5
中图分类号
C93 [管理学]; O22 [运筹学];
学科分类号
070105 ; 12 ; 1201 ; 1202 ; 120202 ;
摘要
In this paper, we present a parameter-free variation of the Sampled Fictitious Play algorithm that facilitates fast solution of deterministic dynamic programming problems. Its random tie-breaking procedure imparts a natural randomness to the algorithm which prevents it from "getting stuck" at a local optimal solution and allows the discovery of an optimal path in a finite number of iterations. Furthermore, we illustrate through an application to maritime navigation that, in practice, a parameter-free Sampled Fictitious Play algorithm finds a high-quality solution after only a few iterations, in contrast with traditional methods.
引用
收藏
页码:631 / 655
页数:25
相关论文
共 50 条
  • [21] Parameter-free, Dynamic, and Strongly-Adaptive Online Learning
    Cutkosky, Ashok
    25TH AMERICAS CONFERENCE ON INFORMATION SYSTEMS (AMCIS 2019), 2019,
  • [22] APPROXIMATE METHOD FOR SOLVING DYNAMIC PROGRAMMING PROBLEMS
    ALEKSEYE.OG
    ENGINEERING CYBERNETICS, 1971, 9 (03): : 447 - &
  • [23] NeuroGenetic Approach for Solving Dynamic Programming Problems
    Pires, Matheus Giovanni
    da Silva, Ivan Nunes
    Bertoni, Fabiana Cristina
    2008 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC), VOLS 1-6, 2008, : 2143 - +
  • [24] Neurogenetic Approach for Solving Dynamic Programming Problems
    Pires, Matheus Giovanni
    da Silva, Ivan Nunes
    ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING, PT II, 2010, 6114 : 72 - +
  • [25] Solving Dynamic Programming Problems on a Computational Grid
    Yongyang Cai
    Kenneth L. Judd
    Greg Thain
    Stephen J. Wright
    Computational Economics, 2015, 45 : 261 - 284
  • [26] Solving Dynamic Programming Problems on a Computational Grid
    Cai, Yongyang
    Judd, Kenneth L.
    Thain, Greg
    Wright, Stephen J.
    COMPUTATIONAL ECONOMICS, 2015, 45 (02) : 261 - 284
  • [27] Parameter-Free Reduction of the Estimation Bias in Deep Reinforcement Learning for Deterministic Policy Gradients
    Saglam, Baturay
    Mutlu, Furkan Burak
    Cicek, Dogan Can
    Kozat, Suleyman Serdar
    NEURAL PROCESSING LETTERS, 2024, 56 (02)
  • [28] Parameter-Free Reduction of the Estimation Bias in Deep Reinforcement Learning for Deterministic Policy Gradients
    Baturay Saglam
    Furkan Burak Mutlu
    Dogan Can Cicek
    Suleyman Serdar Kozat
    Neural Processing Letters, 56
  • [29] On solving optimal control problems with free initial condition using iterative dynamic programming
    Mekarapiruk, W
    Luus, R
    CANADIAN JOURNAL OF CHEMICAL ENGINEERING, 2001, 79 (05): : 777 - 784
  • [30] Solving dynamic portfolio problems using stochastic programming
    Consigli, G
    Dempster, MAH
    ZEITSCHRIFT FUR ANGEWANDTE MATHEMATIK UND MECHANIK, 1997, 77 : S535 - S536