Learning-based control for discrete-time constrained nonzero-sum games

被引：6

作者：

Mu, Chaoxu ^{[1
]}

Peng, Jiangwen ^{[1
]}

Tang, Yufei ^{[2
]}

机构：

[1] Tianjin Univ, Sch Elect & Informat Engn, Tianjin, Peoples R China

[2] Florida Atlantic Univ, Dept Comp Elect Engn & Comp Sci, Boca Raton, FL 33431 USA

来源：

CAAI TRANSACTIONS ON INTELLIGENCE TECHNOLOGY | 2021年 / 6卷 / 02期

基金：

中国国家自然科学基金;

关键词：

EXPERIENCE REPLAY; SYSTEMS;

D O I：

10.1049/cit2.12015

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

A generalized policy-iteration-based solution to a class of discrete-time multi-player non-zero-sum games concerning the control constraints was proposed. Based on initial admissible control policies, the iterative value function of each player converges to the optimum approximately, which is structured by the iterative control policies satisfying the Nash equilibrium. Afterwards, the stability analysis is shown to illustrate that the iterative control policies can stabilize the system and minimize the performance index function of each player. Meanwhile, neural networks are implemented to approximate the iterative control policies and value functions with the impact of control constraints. Finally, two numerical simulations of the discrete-time two-player non-zero-sum games for linear and non-linear systems are shown to illustrate the effectiveness of the proposed scheme.

引用

页码：203 / 213

页数：11

共 50 条

[1] Nonzero-sum constrained discrete-time Markov games: the case of unbounded costs
Wenzhao Zhang
Yonghui Huang
Xianping Guo
TOP, 2014, 22 : 1074 - 1102
[2] Nonzero-sum constrained discrete-time Markov games: the case of unbounded costs
Zhang, Wenzhao
Huang, Yonghui
Guo, Xianping
TOP, 2014, 22 (03) : 1074 - 1102
[3] NONZERO-SUM EXPECTED AVERAGE DISCRETE-TIME STOCHASTIC GAMES: THE CASE OF UNCOUNTABLE SPACES
Wei, Qingda
Chen, Xian
SIAM JOURNAL ON CONTROL AND OPTIMIZATION, 2019, 57 (06) : 4099 - 4124
[4] The Nonzero-sum Games of Discrete -time Stochastic Singular Systems
Zhou, Haiying
PROCEEDINGS OF THE 30TH CHINESE CONTROL AND DECISION CONFERENCE (2018 CCDC), 2018, : 1510 - 1514
[5] Off-policy based adaptive dynamic programming method for nonzero-sum games on discrete-time system
Wen, Yinlei
Zhang, Huaguang
Ren, He
Zhang, Kun
JOURNAL OF THE FRANKLIN INSTITUTE-ENGINEERING AND APPLIED MATHEMATICS, 2020, 357 (12): : 8059 - 8081
[6] Two-player nonzero-sum stopping games in discrete time
Shmaya, E
Solan, E
ANNALS OF PROBABILITY, 2004, 32 (3B): : 2733 - 2764
[7] Integral Reinforcement Learning-Based Optimal Control for Nonzero-Sum Games of Multi-Player Input-Constrained Nonlinear Systems
Wu, Qiuye
Zhao, Bo
Liu, Derong
2021 7TH INTERNATIONAL CONFERENCE ON ROBOTICS AND ARTIFICIAL INTELLIGENCE, ICRAI 2021, 2021, : 59 - 63
[8] Multi-event-triggered adaptive critic control with guaranteed cost for discrete-time nonlinear nonzero-sum games
Wang, Ding
Hu, Lingzhi
Qiao, Junfei
INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2022, 32 (18) : 10292 - 10308
[9] Learning in Nonzero-Sum Stochastic Games with Potentials
Mguni, David
Wu, Yutong
Du, Yali
Yang, Yaodong
Wang, Ziyi
Li, Minne
Wen, Ying
Jennings, Joel
Wang, Jun
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
[10] Discrete-Time Nonzero-Sum Games for Multiplayer Using Policy-Iteration-Based Adaptive Dynamic Programming Algorithms
Zhang, Huaguang
Jiang, He
Luo, Chaomin
Xiao, Geyang
IEEE TRANSACTIONS ON CYBERNETICS, 2017, 47 (10) : 3331 - 3340

← 1 2 3 4 5 →