Learning-based control for discrete-time constrained nonzero-sum games

被引:6
|
作者
Mu, Chaoxu [1 ]
Peng, Jiangwen [1 ]
Tang, Yufei [2 ]
机构
[1] Tianjin Univ, Sch Elect & Informat Engn, Tianjin, Peoples R China
[2] Florida Atlantic Univ, Dept Comp Elect Engn & Comp Sci, Boca Raton, FL 33431 USA
基金
中国国家自然科学基金;
关键词
EXPERIENCE REPLAY; SYSTEMS;
D O I
10.1049/cit2.12015
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A generalized policy-iteration-based solution to a class of discrete-time multi-player non-zero-sum games concerning the control constraints was proposed. Based on initial admissible control policies, the iterative value function of each player converges to the optimum approximately, which is structured by the iterative control policies satisfying the Nash equilibrium. Afterwards, the stability analysis is shown to illustrate that the iterative control policies can stabilize the system and minimize the performance index function of each player. Meanwhile, neural networks are implemented to approximate the iterative control policies and value functions with the impact of control constraints. Finally, two numerical simulations of the discrete-time two-player non-zero-sum games for linear and non-linear systems are shown to illustrate the effectiveness of the proposed scheme.
引用
收藏
页码:203 / 213
页数:11
相关论文
共 50 条
  • [1] Nonzero-sum constrained discrete-time Markov games: the case of unbounded costs
    Wenzhao Zhang
    Yonghui Huang
    Xianping Guo
    TOP, 2014, 22 : 1074 - 1102
  • [2] Nonzero-sum constrained discrete-time Markov games: the case of unbounded costs
    Zhang, Wenzhao
    Huang, Yonghui
    Guo, Xianping
    TOP, 2014, 22 (03) : 1074 - 1102
  • [3] NONZERO-SUM EXPECTED AVERAGE DISCRETE-TIME STOCHASTIC GAMES: THE CASE OF UNCOUNTABLE SPACES
    Wei, Qingda
    Chen, Xian
    SIAM JOURNAL ON CONTROL AND OPTIMIZATION, 2019, 57 (06) : 4099 - 4124
  • [4] The Nonzero-sum Games of Discrete -time Stochastic Singular Systems
    Zhou, Haiying
    PROCEEDINGS OF THE 30TH CHINESE CONTROL AND DECISION CONFERENCE (2018 CCDC), 2018, : 1510 - 1514
  • [5] Off-policy based adaptive dynamic programming method for nonzero-sum games on discrete-time system
    Wen, Yinlei
    Zhang, Huaguang
    Ren, He
    Zhang, Kun
    JOURNAL OF THE FRANKLIN INSTITUTE-ENGINEERING AND APPLIED MATHEMATICS, 2020, 357 (12): : 8059 - 8081
  • [6] Two-player nonzero-sum stopping games in discrete time
    Shmaya, E
    Solan, E
    ANNALS OF PROBABILITY, 2004, 32 (3B): : 2733 - 2764
  • [7] Integral Reinforcement Learning-Based Optimal Control for Nonzero-Sum Games of Multi-Player Input-Constrained Nonlinear Systems
    Wu, Qiuye
    Zhao, Bo
    Liu, Derong
    2021 7TH INTERNATIONAL CONFERENCE ON ROBOTICS AND ARTIFICIAL INTELLIGENCE, ICRAI 2021, 2021, : 59 - 63
  • [8] Multi-event-triggered adaptive critic control with guaranteed cost for discrete-time nonlinear nonzero-sum games
    Wang, Ding
    Hu, Lingzhi
    Qiao, Junfei
    INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2022, 32 (18) : 10292 - 10308
  • [9] Learning in Nonzero-Sum Stochastic Games with Potentials
    Mguni, David
    Wu, Yutong
    Du, Yali
    Yang, Yaodong
    Wang, Ziyi
    Li, Minne
    Wen, Ying
    Jennings, Joel
    Wang, Jun
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
  • [10] Discrete-Time Nonzero-Sum Games for Multiplayer Using Policy-Iteration-Based Adaptive Dynamic Programming Algorithms
    Zhang, Huaguang
    Jiang, He
    Luo, Chaomin
    Xiao, Geyang
    IEEE TRANSACTIONS ON CYBERNETICS, 2017, 47 (10) : 3331 - 3340