Approximate dynamic programming with a fuzzy parameterization

被引：48

作者：

Busoniu, Lucian ^{[1
]}

Ernst, Damien ^{[2
]}

De Schutter, Bart ^{[1
]}

Babuska, Robert ^{[1
]}

机构：

[1] Delft Univ Technol, Delft Ctr Syst &Control, NL-2628 CD Delft, Netherlands

[2] Univ Liege, Inst Montefiore, FNRS, B-4000 Liege, Belgium

来源：

AUTOMATICA | 2010年 / 46卷 / 05期

关键词：

Approximate dynamic programming; Fuzzy approximation; Value iteration; Convergence analysis; ALGORITHM;

D O I：

10.1016/j.automatica.2010.02.006

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Dynamic programming (DP) is a powerful paradigm for general, nonlinear optimal control. Computing exact DP solutions is in general only possible when the process states and the control actions take values in a small discrete set. In practice, it is necessary to approximate the solutions. Therefore, we propose an algorithm for approximate DP that relies on a fuzzy partition of the state space, and on a discretization of the action space. This fuzzy Q-iteration algorithm works for deterministic processes, under the discounted return criterion. We prove that fuzzy Q-iteration asymptotically converges to a solution that lies within a bound of the optimal solution. A bound on the suboptimality of the solution obtained in a finite number of iterations is also derived. Under continuity assumptions on the dynamics and on the reward function, we show that fuzzy Q-iteration is consistent, i.e., that it asymptotically obtains the optimal solution as the approximation accuracy increases. These properties hold both when the parameters of the approximator are updated in a synchronous fashion, and when they are updated asynchronously. The asynchronous algorithm is proven to converge at least as fast as the synchronous one. The performance of fuzzy Q-iteration is illustrated in a two-link manipulator control problem. (C) 2010 Elsevier Ltd. All rights reserved.

引用

页码：804 / 814

页数：11

共 50 条

[41] APPROXIMATE SOLUTION OF THE EQUATIONS OF DYNAMIC-PROGRAMMING
BISTRITSKAS, VB
USSR COMPUTATIONAL MATHEMATICS AND MATHEMATICAL PHYSICS, 1985, 25 (04): : 107 - 113
[42] A New Optimal Stepsize for Approximate Dynamic Programming
Ryzhov, Ilya O.
Frazier, Peter I.
Powell, Warren B.
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2015, 60 (03) : 743 - 758
[43] Markdown Optimization via Approximate Dynamic Programming
Özlem Coşgun
Ufuk Kula
Cengiz Kahraman
International Journal of Computational Intelligence Systems, 2013, 6 : 64 - 78
[44] Revisiting Approximate Dynamic Programming and its Convergence
Heydari, Ali
IEEE TRANSACTIONS ON CYBERNETICS, 2014, 44 (12) : 2733 - 2743
[45] APPROXIMATE METHOD FOR SOLVING DYNAMIC PROGRAMMING PROBLEMS
ALEKSEYE.OG
ENGINEERING CYBERNETICS, 1971, 9 (03): : 447 - &
[46] Explicit MPC based on Approximate Dynamic Programming
Bakarac, Peter
Holaza, Juraj
Kaluz, Martin
Klauco, Martin
Lofberg, Johan
Kvasnica, Michal
2018 EUROPEAN CONTROL CONFERENCE (ECC), 2018, : 1172 - 1177
[47] Approximate theory for functional equations in dynamic programming
Basu, Manjusri, 1600, (12):
[48] Separable dynamic programming and approximate decomposition methods
Bertsekas, Dimitri P.
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2007, 52 (05) : 911 - 916
[49] Approximate Dynamic Programming for Output Feedback Control
Jiang Yu
Jiang Zhong-Ping
PROCEEDINGS OF THE 29TH CHINESE CONTROL CONFERENCE, 2010, : 5815 - 5820
[50] Approximate dynamic programming approach for process control
Lee, Jay H.
INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND SYSTEMS (ICCAS 2010), 2010, : 459 - 464

← 1 2 3 4 5 →