Learning-based control for discrete-time constrained nonzero-sum games

被引:6
|
作者
Mu, Chaoxu [1 ]
Peng, Jiangwen [1 ]
Tang, Yufei [2 ]
机构
[1] Tianjin Univ, Sch Elect & Informat Engn, Tianjin, Peoples R China
[2] Florida Atlantic Univ, Dept Comp Elect Engn & Comp Sci, Boca Raton, FL 33431 USA
基金
中国国家自然科学基金;
关键词
EXPERIENCE REPLAY; SYSTEMS;
D O I
10.1049/cit2.12015
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A generalized policy-iteration-based solution to a class of discrete-time multi-player non-zero-sum games concerning the control constraints was proposed. Based on initial admissible control policies, the iterative value function of each player converges to the optimum approximately, which is structured by the iterative control policies satisfying the Nash equilibrium. Afterwards, the stability analysis is shown to illustrate that the iterative control policies can stabilize the system and minimize the performance index function of each player. Meanwhile, neural networks are implemented to approximate the iterative control policies and value functions with the impact of control constraints. Finally, two numerical simulations of the discrete-time two-player non-zero-sum games for linear and non-linear systems are shown to illustrate the effectiveness of the proposed scheme.
引用
收藏
页码:203 / 213
页数:11
相关论文
共 50 条
  • [31] Nash Equilibrium Seeking in Nonzero-Sum Games: A Prescribed-Time Fuzzy Control Approach
    Zhang, Yan
    Chadli, Mohammed
    Xiang, Zhengrong
    IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2024, 32 (12) : 6929 - 6938
  • [32] Concurrent learning-based approximate feedback-Nash equilibrium solution of N-player nonzero-sum differential games
    Kamalapurkar, Rushikesh
    Klotz, Justin R.
    Dixon, Warren E.
    IEEE/CAA Journal of Automatica Sinica, 2014, 1 (03) : 239 - 247
  • [33] Concurrent Learning-based Approximate Feedback-Nash Equilibrium Solution of N-player Nonzero-sum Differential Games
    Rushikesh Kamalapurkar
    Justin R.Klotz
    Warren E.Dixon
    IEEE/CAAJournalofAutomaticaSinica, 2014, 1 (03) : 239 - 247
  • [34] The multi-player nonzero-sum Dynkin game in discrete time
    Hamadene, Said
    Hassani, Mohammed
    MATHEMATICAL METHODS OF OPERATIONS RESEARCH, 2014, 79 (02) : 179 - 194
  • [35] The multi-player nonzero-sum Dynkin game in discrete time
    Said Hamadène
    Mohammed Hassani
    Mathematical Methods of Operations Research, 2014, 79 : 179 - 194
  • [36] Policy iteration based Q-learning for linear nonzero-sum quadratic differential games
    Xinxing Li
    Zhihong Peng
    Li Liang
    Wenzhong Zha
    Science China Information Sciences, 2019, 62
  • [37] Policy iteration based Q-learning for linear nonzero-sum quadratic differential games
    Xinxing LI
    Zhihong PENG
    Li LIANG
    Wenzhong ZHA
    ScienceChina(InformationSciences), 2019, 62 (05) : 195 - 213
  • [38] Additional Aspects of the Stackelberg Strategy in Nonzero-Sum Games
    Simaan, M.
    Cruz, J. B., Jr.
    JOURNAL OF OPTIMIZATION THEORY AND APPLICATIONS, 1973, 11 (06) : 613 - 626
  • [39] INTERGROUP ATTITUDES AND STRATEGIES IN NONZERO-SUM DILEMMA GAMES
    WILSON, W
    AMERICAN PSYCHOLOGIST, 1966, 21 (07) : 651 - &
  • [40] Policy iteration based Q-learning for linear nonzero-sum quadratic differential games
    Li, Xinxing
    Peng, Zhihong
    Liang, Li
    Zha, Wenzhong
    SCIENCE CHINA-INFORMATION SCIENCES, 2019, 62 (05)