Adaptive Dynamic Programming and Optimal Control of Unknown Multiplayer Systems Based on Game Theory

被引:4
|
作者
Zhao, Jingang [1 ,2 ]
机构
[1] Weifang Univ, Sch Machinery & Automat, Weifang 261061, Shandong, Peoples R China
[2] Weifang Univ, Inst Intelligent Percept & Control Complex Syst, Weifang 261061, Shandong, Peoples R China
来源
IEEE ACCESS | 2022年 / 10卷
关键词
Games; Mathematical models; Heuristic algorithms; Approximation algorithms; System dynamics; Vehicle dynamics; Optimal control; Adaptive dynamic programming; nonzero-sum (NZS) games; multi-player systems; coupled Hamilton-Jacobi (HJ) equations; neural network (NN); ZERO-SUM GAMES; UNCERTAIN NONLINEAR-SYSTEMS; OPTIMAL TRACKING CONTROL; HORIZON OPTIMAL-CONTROL; TIME LINEAR-SYSTEMS; DESIGN;
D O I
10.1109/ACCESS.2022.3193505
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we present a new adaptive dynamic programming (ADP) scheme to solve the optimal control problem of multi-player systems with unknown dynamics from the perspective of nonzero-sum (NZS) games. In the presented scheme, a new iterative equation is given. On the basis of the given iterative equation, the control policy and corresponding value function for each player can be learned by using the state and input data, which does not need to identify the system dynamics. To overcome the difficulty of unknown system dynamics, neural network (NN)-based function approximation techniques are employed in the implementation. Based on the given iterative equation and NN-based function approximation techniques, a new non-model-based ADP algorithm is developed. The convergence of the developed non-model-based ADP algorithm is rigorously analyzed and proved. Finally, two numerical simulation examples are provided to demonstrate the performance of the developed non-model-based ADP algorithm.
引用
收藏
页码:77695 / 77706
页数:12
相关论文
共 50 条
  • [1] Stability and optimal control of a multiplayer dynamic game
    Scheffran, J
    OPERATIONS RESEARCH PROCEEDINGS 2000, 2001, : 14 - 19
  • [2] Extended adaptive optimal control of linear systems with unknown dynamics using adaptive dynamic programming
    Gan, Minggang
    Zhao, Jingang
    Zhang, Chi
    ASIAN JOURNAL OF CONTROL, 2021, 23 (02) : 1097 - 1106
  • [3] Optimal control of unknown nonaffine nonlinear discrete-time systems based on adaptive dynamic programming
    Wang, Ding
    Liu, Derong
    Wei, Qinglai
    Zhao, Dongbin
    Jin, Ning
    AUTOMATICA, 2012, 48 (08) : 1825 - 1832
  • [4] An Unknown Multiplayer Nonzero-Sum Game: Prescribed-Time Dynamic Event-Triggered Control via Adaptive Dynamic Programming
    Zhang, Kun
    Zhang, Zhi-Xuan
    Xie, Xiang Peng
    Rubio, Jose de Jesus
    IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2024,
  • [5] A novel optimal control design for unknown nonlinear systems based on adaptive dynamic programming and nonlinear model predictive control
    Hu, Wei
    Zhang, Guoshan
    Zheng, Yuqing
    ASIAN JOURNAL OF CONTROL, 2022, 24 (04) : 1638 - 1649
  • [6] Adaptive Dynamic Programming-based Optimal Control of Unknown Affine Nonlinear Discrete-time Systems
    Dierks, Travis
    Thumati, Balaje T.
    Jagannathan, S.
    IJCNN: 2009 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1- 6, 2009, : 1368 - 1373
  • [7] Neural-network-observer-based optimal control for unknown nonlinear systems using adaptive dynamic programming
    Liu, Derong
    Huang, Yuzhu
    Wang, Ding
    Wei, Qinglai
    INTERNATIONAL JOURNAL OF CONTROL, 2013, 86 (09) : 1554 - 1566
  • [8] Adaptive Dynamic Programming for Optimal Tracking Control of Unknown Nonlinear Systems With Application to Coal Gasification
    Wei, Qinglai
    Liu, Derong
    IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2014, 11 (04) : 1020 - 1036
  • [9] Model-Free Adaptive Optimal Control for Unknown Nonlinear Multiplayer Nonzero-Sum Game
    Wei, Qinglai
    Zhu, Liao
    Song, Ruizhuo
    Zhang, Pinjia
    Liu, Derong
    Xiao, Jun
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (02) : 879 - 892
  • [10] Optimal dynamic output feedback control of unknown linear continuous-time systems by adaptive dynamic programming☆
    Xie, Kedi
    Zheng, Yiwei
    Jiang, Yi
    Lan, Weiyao
    Yu, Xiao
    AUTOMATICA, 2024, 163