Learning-based control for discrete-time constrained nonzero-sum games

被引：6

作者：

Mu, Chaoxu ^{[1
]}

Peng, Jiangwen ^{[1
]}

Tang, Yufei ^{[2
]}

机构：

[1] Tianjin Univ, Sch Elect & Informat Engn, Tianjin, Peoples R China

[2] Florida Atlantic Univ, Dept Comp Elect Engn & Comp Sci, Boca Raton, FL 33431 USA

来源：

CAAI TRANSACTIONS ON INTELLIGENCE TECHNOLOGY | 2021年 / 6卷 / 02期

基金：

中国国家自然科学基金;

关键词：

EXPERIENCE REPLAY; SYSTEMS;

D O I：

10.1049/cit2.12015

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

A generalized policy-iteration-based solution to a class of discrete-time multi-player non-zero-sum games concerning the control constraints was proposed. Based on initial admissible control policies, the iterative value function of each player converges to the optimum approximately, which is structured by the iterative control policies satisfying the Nash equilibrium. Afterwards, the stability analysis is shown to illustrate that the iterative control policies can stabilize the system and minimize the performance index function of each player. Meanwhile, neural networks are implemented to approximate the iterative control policies and value functions with the impact of control constraints. Finally, two numerical simulations of the discrete-time two-player non-zero-sum games for linear and non-linear systems are shown to illustrate the effectiveness of the proposed scheme.

引用

页码：203 / 213

页数：11

共 50 条

[41] Off-Policy Reinforcement Learning for Partially Unknown Nonzero-Sum Games
Zhang, Qichao
Zhao, Dongbin
Zhang, Sibo
NEURAL INFORMATION PROCESSING, ICONIP 2017, PT I, 2017, 10634 : 822 - 830
[42] Nonzero-sum stochastic differential games with discontinuous feedback
Mannucci, P
SIAM JOURNAL ON CONTROL AND OPTIMIZATION, 2004, 43 (04) : 1222 - 1233
[43] Numerical approximations for nonzero-sum stochastic differential games
Kushner, Harold J.
SIAM JOURNAL ON CONTROL AND OPTIMIZATION, 2007, 46 (06) : 1942 - 1971
[44] Nonzero-Sum Games of Optimal Stopping for Markov Processes
Attard, Natalie
APPLIED MATHEMATICS AND OPTIMIZATION, 2018, 77 (03): : 567 - 597
[45] Two-player nonzero-sum ω-regular games
Chatterjee, K
CONCUR 2005 - CONCURRENCY THEORY, PROCEEDINGS, 2005, 3653 : 413 - 427
[46] Constructive ε-Nash Equilibria for Nonzero-Sum Differential Games
Mylvaganam, Thulasi
Sassano, Mario
Astolfi, Alessandro
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2015, 60 (04) : 950 - 965
[47] NOTE ON NONZERO-SUM DIFFERENTIAL GAMES WITH BARGAINING SOLUTION
HAURIE, A
JOURNAL OF OPTIMIZATION THEORY AND APPLICATIONS, 1976, 18 (01) : 31 - 39
[48] Nonzero-sum stochastic differential games with reflecting diffusions
Ghosh, Mrinal K.
Kumar, K. Suresh
COMPUTATIONAL & APPLIED MATHEMATICS, 1999, 18 (03): : 355 - 368
[49] Nonzero-Sum Games of Optimal Stopping for Markov Processes
Natalie Attard
Applied Mathematics & Optimization, 2018, 77 : 567 - 597
[50] Online finite-horizon optimal learning algorithm for nonzero-sum games with partially unknown dynamics and constrained inputs
Cui, Xiaohong
Zhang, Huaguang
Luo, Yanhong
Zu, Peifu
NEUROCOMPUTING, 2016, 185 : 37 - 44

← 1 2 3 4 5 →