Learning-based control for discrete-time constrained nonzero-sum games

被引:6
|
作者
Mu, Chaoxu [1 ]
Peng, Jiangwen [1 ]
Tang, Yufei [2 ]
机构
[1] Tianjin Univ, Sch Elect & Informat Engn, Tianjin, Peoples R China
[2] Florida Atlantic Univ, Dept Comp Elect Engn & Comp Sci, Boca Raton, FL 33431 USA
基金
中国国家自然科学基金;
关键词
EXPERIENCE REPLAY; SYSTEMS;
D O I
10.1049/cit2.12015
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A generalized policy-iteration-based solution to a class of discrete-time multi-player non-zero-sum games concerning the control constraints was proposed. Based on initial admissible control policies, the iterative value function of each player converges to the optimum approximately, which is structured by the iterative control policies satisfying the Nash equilibrium. Afterwards, the stability analysis is shown to illustrate that the iterative control policies can stabilize the system and minimize the performance index function of each player. Meanwhile, neural networks are implemented to approximate the iterative control policies and value functions with the impact of control constraints. Finally, two numerical simulations of the discrete-time two-player non-zero-sum games for linear and non-linear systems are shown to illustrate the effectiveness of the proposed scheme.
引用
收藏
页码:203 / 213
页数:11
相关论文
共 50 条
  • [21] MARKOV APPROXIMATIONS OF NONZERO-SUM DIFFERENTIAL GAMES
    Averboukh, Yu, V
    VESTNIK UDMURTSKOGO UNIVERSITETA-MATEMATIKA MEKHANIKA KOMPYUTERNYE NAUKI, 2020, 30 (01): : 3 - 17
  • [22] Data-Based Reinforcement Learning for Nonzero-Sum Games With Unknown Drift Dynamics
    Zhang, Qichao
    Zhao, Dongbin
    IEEE TRANSACTIONS ON CYBERNETICS, 2019, 49 (08) : 2874 - 2885
  • [23] Nonzero-Sum Differential Games with Bargaining Solutions
    Liu, P. T.
    JOURNAL OF OPTIMIZATION THEORY AND APPLICATIONS, 1973, 11 (03) : 284 - 292
  • [24] Nonzero-sum impulse games with regime switching
    Lv, Siyu
    Xiong, Jie
    AUTOMATICA, 2022, 145
  • [25] Nonzero-Sum Stochastic Games with Probability Criteria
    Huang, Xiangxiang
    Guo, Xianping
    DYNAMIC GAMES AND APPLICATIONS, 2020, 10 (02) : 509 - 527
  • [26] Nonzero-Sum Stochastic Games with Probability Criteria
    Xiangxiang Huang
    Xianping Guo
    Dynamic Games and Applications, 2020, 10 : 509 - 527
  • [27] Nonzero-sum Adversarial Hypothesis Testing Games
    Yasodharan, Sarath
    Loiseau, Patrick
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [28] Infinite Time Nonzero-Sum Linear Quadratic Stochastic Differential Games
    Sun Huiying
    Li Meng
    Zhang Weihai
    PROCEEDINGS OF THE 29TH CHINESE CONTROL CONFERENCE, 2010, : 1081 - 1084
  • [29] Parallel Control for Nonzero-Sum Games With Completely Unknown Nonlinear Dynamics via Reinforcement Learning
    Lu, Jingwei
    Wei, Qinglai
    Wang, Fei-Yue
    IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2025,
  • [30] Event-Triggered Intelligent Critic Design for Constrained Nonaffine Nonzero-Sum Games
    Hu, Lingzhi
    Wang, Ding
    Gao, Ning
    Zhao, Mingming
    2021 7TH INTERNATIONAL CONFERENCE ON ROBOTICS AND ARTIFICIAL INTELLIGENCE, ICRAI 2021, 2021, : 76 - 81