Event-triggered optimal control for discrete-time multi-player non-zero-sum games using parallel control

被引:34
作者
Lu, Jingwei [1 ,2 ]
Wei, Qinglai [1 ,2 ]
Wang, Ziyang [3 ]
Zhou, Tianmin [1 ,2 ]
Wang, Fei-Yue [1 ,2 ]
机构
[1] Univ Chinese Acad Sci, Sch Artificial Intelligence, Beijing 100049, Peoples R China
[2] Chinese Acad Sci, Inst Automat, State Key Lab Management & Control Complex Syst, Beijing 100190, Peoples R China
[3] Tsinghua Univ, Dept Automat, Beijing 100084, Peoples R China
基金
中国国家自然科学基金;
关键词
Event-triggered; Non-zero-sum games; Parallel control; Neural network; Adaptive dynamic programming; NONLINEAR-SYSTEMS; OPTIMAL TRACKING;
D O I
10.1016/j.ins.2021.10.073
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper presents a novel event-triggered optimal control (ETOC) method for discrete -time (DT) multi-player non-zero-sum games (NZSGs). First, a novel event-triggered algo-rithm is developed for DT multi-player NZSGs based on the time-triggered optimal value functions. Therefore, the developed event-triggered algorithm only needs to solve the time-triggered Hamilton-Jacobi-Bellman (HJB) equations. Then, the asymptotic stability of the closed-loop system is proved. Additionally, we show that an upper bound for the sum of the actual performance indices of all the players can be determined in advance. A key step in the implementation of the developed event-triggered algorithm is to obtain the next state of the actual system, which is difficult to implement on the actual system. Thus, a parallel control method is utilized to predict the next state by constructing the par-allel system for the actual system and obtain the optimal value functions. The method that combines the developed event-triggered algorithm and parallel control is called the event-triggered optimal parallel control (ETOPC) method. The neural network (NN) technique and the iterative adaptive dynamic programming (IADP) technique are employed in parallel control. Moreover, the control stability is shown further in the consideration of the NN weight approximation errors. Finally, two simulations justify the theoretical conjectures. (c) 2021 Elsevier Inc. All rights reserved.
引用
收藏
页码:519 / 535
页数:17
相关论文
共 42 条
[1]   Discrete-time nonlinear HJB solution using approximate dynamic programming: Convergence proof [J].
Al-Tamimi, Asma ;
Lewis, Frank L. ;
Abu-Khalaf, Murad .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2008, 38 (04) :943-949
[2]   Towards safe reinforcement-learning in industrial grid-warehousing [J].
Andersen, Per-Arne ;
Goodwin, Morten ;
Granmo, Ole-Christoffer .
INFORMATION SCIENCES, 2020, 537 :467-484
[3]  
Bertsekas D.P., 1996, NEURO DYNAMIC PROGRA, V3
[4]   Adaptive Event-Triggered Control Based on Heuristic Dynamic Programming for Nonlinear Discrete-Time Systems [J].
Dong, Lu ;
Zhong, Xiangnan ;
Sun, Changyin ;
He, Haibo .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2017, 28 (07) :1594-1605
[5]   Parallel Driving in CPSS: A Unified Approach for Transport Automation and Vehicle Intelligence [J].
Wang, Fei-Yue ;
Zheng, Nan-Ning ;
Cao, Dongpu ;
Martinez, Clara Marina ;
Li, Li ;
Liu, Teng .
IEEE-CAA JOURNAL OF AUTOMATICA SINICA, 2017, 4 (04) :577-587
[6]   Fuzzy adaptive event-triggered control for a class of nonlinear systems with time-varying full state constraints [J].
Jin, Xin ;
Li, Yuan-Xin .
INFORMATION SCIENCES, 2021, 563 :111-129
[7]  
Khalil H.K., 1992, NONLINEAR SYSTEMS
[8]  
Lewis F., 2012, OPTIMAL CONTROL, V3rd
[9]   Data Augmented Deep Behavioral Cloning for Urban Traffic Control Operations Under a Parallel Learning Framework [J].
Li, Xiaoshuang ;
Ye, Peijun ;
Jin, Junchen ;
Zhu, Fenghua ;
Wang, Fei-Yue .
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (06) :5128-5137
[10]   Event-triggered control for robust set stabilization of logical control networks [J].
Li, Yalu ;
Li, Haitao ;
Sun, Weiwei .
AUTOMATICA, 2018, 95 :556-560