Online Iterative Adaptive Dynamic Programming Approach for Solving the Zero-Sum Game for Nonlinear Continuous-Time Systems with Partially Unknown Dynamics

被引:1
|
作者
Fu, Bin [1 ]
Sun, Bo [2 ]
Guo, Hang [1 ]
Yang, Tao [1 ]
Fu, Wenxing [1 ]
机构
[1] Northwestern Polytech Univ, Unmanned Syst Res Inst, Xian 710072, Shanxi, Peoples R China
[2] Delft Univ Technol, Fac Aerosp Engn, NL-2629 HS Delft, Netherlands
关键词
Approximation dynamic programming; Zero-sum game; Integral reinforcement learning; Online learning; Value iteration; REINFORCEMENT; DESIGN;
D O I
10.1007/978-981-99-0479-2_262
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The current study presents an online iterative adaptive dynamic programming approach to resolve the zero-sum game (ZSG) for nonlinear continuous-time (CT) systems containing a partially unknown dynamic. The Hamilton-Jacobian-Issacs (HJI) equation is solved along the state trajectory according to the value function approximation and the policy improvement online. Relaxed dynamic programming is utilized to ensure the algorithm's convergence. Model and costate networks were established to conduct the method. Computational simulations are performed to present the efficiency of the algorithm.
引用
收藏
页码:2833 / 2842
页数:10
相关论文
共 50 条
  • [1] Event-Triggered Adaptive Dynamic Programming for Zero-Sum Game of Partially Unknown Continuous-Time Nonlinear Systems
    Xue, Shan
    Luo, Biao
    Liu, Derong
    IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2020, 50 (09): : 3189 - 3199
  • [2] Iterative Adaptive Dynamic Programming for Solving Unknown Nonlinear Zero-Sum Game Based on Online Data
    Zhu, Yuanheng
    Zhao, Dongbin
    Li, Xiangjun
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2017, 28 (03) : 714 - 725
  • [3] A Single-NN Iterative Adaptive Dynamic Programming Algorithm for Continuous-Time Nonlinear Zero-Sum Games
    Song, Ruizhuo
    Li, Junsong
    2018 37TH CHINESE CONTROL CONFERENCE (CCC), 2018, : 2848 - 2853
  • [4] Approximate dynamic programming for two-player zero-sum game related to H∞ control of unknown nonlinear continuous-time systems
    Sholeh Yasini
    Mohammad Bagher Naghibi Sistani
    Ali Karimpour
    International Journal of Control, Automation and Systems, 2015, 13 : 99 - 109
  • [5] Approximate Dynamic Programming for Two-player Zero-sum Game Related to H∞ Control of Unknown Nonlinear Continuous-time Systems
    Yasini, Sholeh
    Bagher, Mohammad
    Sistani, Naghibi
    Karimpour, Ali
    INTERNATIONAL JOURNAL OF CONTROL AUTOMATION AND SYSTEMS, 2015, 13 (01) : 99 - 109
  • [6] Event-Triggered Adaptive Dynamic Programming for Continuous-Time Nonlinear Two-Player Zero-Sum Game
    Xue, Shan
    Luo, Biao
    Liu, Derong
    Li, Yueheng
    NEURAL INFORMATION PROCESSING (ICONIP 2018), PT VII, 2018, 11307 : 15 - 25
  • [7] Online Solution of Two-Player Zero-Sum Games for Continuous-Time Nonlinear Systems With Completely Unknown Dynamics
    Fu, Yue
    Chai, Tianyou
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2016, 27 (12) : 2577 - 2587
  • [8] An iterative adaptive dynamic programming method for solving a class of nonlinear zero-sum differential games
    Zhang, Huaguang
    Wei, Qinglai
    Liu, Derong
    AUTOMATICA, 2011, 47 (01) : 207 - 214
  • [9] Adaptive dynamic programming for online solution of a zero-sum differential game
    Vrabie D.
    Lewis F.
    Journal of Control Theory and Applications, 2011, 9 (03): : 353 - 360
  • [10] Adaptive dynamic programming for online solution of a zero-sum differential game
    Draguna VRABIE
    Frank LEWIS
    JournalofControlTheoryandApplications, 2011, 9 (03) : 353 - 360