Learning-based control for discrete-time constrained nonzero-sum games

被引:6
|
作者
Mu, Chaoxu [1 ]
Peng, Jiangwen [1 ]
Tang, Yufei [2 ]
机构
[1] Tianjin Univ, Sch Elect & Informat Engn, Tianjin, Peoples R China
[2] Florida Atlantic Univ, Dept Comp Elect Engn & Comp Sci, Boca Raton, FL 33431 USA
基金
中国国家自然科学基金;
关键词
EXPERIENCE REPLAY; SYSTEMS;
D O I
10.1049/cit2.12015
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A generalized policy-iteration-based solution to a class of discrete-time multi-player non-zero-sum games concerning the control constraints was proposed. Based on initial admissible control policies, the iterative value function of each player converges to the optimum approximately, which is structured by the iterative control policies satisfying the Nash equilibrium. Afterwards, the stability analysis is shown to illustrate that the iterative control policies can stabilize the system and minimize the performance index function of each player. Meanwhile, neural networks are implemented to approximate the iterative control policies and value functions with the impact of control constraints. Finally, two numerical simulations of the discrete-time two-player non-zero-sum games for linear and non-linear systems are shown to illustrate the effectiveness of the proposed scheme.
引用
收藏
页码:203 / 213
页数:11
相关论文
共 50 条
  • [11] Pseudocontinuity in Optimization and Nonzero-Sum Games
    J. Morgan
    V. Scalzo
    Journal of Optimization Theory and Applications, 2004, 120 : 181 - 197
  • [12] Pseudocontinuity in optimization and nonzero-sum games
    Morgan, J
    Scalzo, V
    JOURNAL OF OPTIMIZATION THEORY AND APPLICATIONS, 2004, 120 (01) : 181 - 197
  • [13] On the Stackelberg Strategy in Nonzero-Sum Games
    Simaan, M.
    Cruz, J. B., Jr.
    JOURNAL OF OPTIMIZATION THEORY AND APPLICATIONS, 1973, 11 (05) : 533 - 555
  • [14] Event-triggered constrained neural critic control of nonlinear continuous-time multiplayer nonzero-sum games
    Li, Menghua
    Wang, Ding
    Zhao, Mingming
    Qiao, Junfei
    INFORMATION SCIENCES, 2023, 631 : 412 - 428
  • [15] Nash equilibria in nonzero-sum differential games with impulse control
    Sadana, Utsav
    Reddy, Puduru Viswanadha
    Zaccour, Georges
    EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2021, 295 (02) : 792 - 805
  • [16] Decentralized Event-Triggered Adaptive Control of Discrete-Time Nonzero-Sum Games Over Wireless Sensor-Actuator Networks With Input Constraints
    Su, Hanguang
    Zhang, Huaguang
    Jiang, He
    Wen, Yinlei
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 31 (10) : 4254 - 4266
  • [17] Initial Excitation-Based Optimal Control for Continuous-Time Linear Nonzero-Sum Games
    Li, Hongyang
    Wei, Qinglai
    IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2024, 54 (09): : 5444 - 5455
  • [18] STATIONARY MARKOV NASH EQUILIBRIA FOR NONZERO-SUM CONSTRAINED ARAT MARKOV GAMES
    Dufour, Francois
    Prieto-Rumeau, Tomas
    SIAM JOURNAL ON CONTROL AND OPTIMIZATION, 2022, 60 (02) : 945 - 967
  • [19] SOME ASPECTS OF NONZERO-SUM DIFFERENTIAL GAMES
    LAWSER, JJ
    VOLZ, RA
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 1971, AC16 (01) : 66 - &
  • [20] Data-Driven Nonzero-Sum Game for Discrete-Time Systems Using Off-Policy Reinforcement Learning
    Yang, Yongliang
    Zhang, Sen
    Dong, Jie
    Yin, Yixin
    IEEE ACCESS, 2020, 8 : 14074 - 14088