Model-free optimal tracking policies for Markov jump systems by solving non-zero-sum games

被引:3
|
作者
Zhou, Peixin [1 ]
Xue, Huiwen [1 ]
Wen, Jiwei [1 ]
Shi, Peng [2 ,3 ]
Luan, Xaoli [1 ]
机构
[1] Jiangnan Univ, Sch Internet Things Engn, Key Lab Adv Proc Control Light Ind, Minist Educ, Wuxi 214122, Peoples R China
[2] Univ Adelaide, Sch Elect & Mech Engn, Adelaide, SA 5005, Australia
[3] Obuda Univ, Res & Innovat Ctr, H-1034 Budapest, Hungary
基金
中国国家自然科学基金;
关键词
Value iteration algorithm; Influence function; Adaptive optimal tracking; Non-zero-sum game; Nash equilibrium;
D O I
10.1016/j.ins.2023.119423
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper develops model-free optimal tracking policies for Markov jump systems by solving nonzero-sum games (NZSGs). First, coupled action and mode-dependent value functions (CAMDVFs) are built for solving a two-player NZSG and getting Nash equilibrium solutions. Second, we propose a value iteration (VI) algorithm to parallelly update policies under each mode by collecting data on different operation modes within each iterative window. Moreover, the iterative increasing convergence of the CAMDVFs is proved by introducing auxiliary functions between two adjacent iterations. It is worth pointing out that an influence function is introduced to remove abnormal data to improve the learning capability of the VI algorithm effectively. Finally, the tracking policies' validity, self-adaptability and application potential are verified by a numerical example and a generalized economic model.
引用
收藏
页数:17
相关论文
共 50 条
  • [31] Q-learning-based non-zero sum games for Markov jump multiplayer systems under actor-critic NNs structure
    Wang, Yun
    Xia, Jiawei
    Wang, Jing
    Shen, Hao
    INFORMATION SCIENCES, 2024, 681
  • [32] Differential Dynamic Programming for Finite-Horizon Multi-Player Non-Zero-Sum Differential Games of Nonlinear Systems
    Zhang, Yuqi
    Zhang, Bin
    2023 IEEE 12TH DATA DRIVEN CONTROL AND LEARNING SYSTEMS CONFERENCE, DDCLS, 2023, : 511 - 517
  • [33] Differential Dynamic Programming for Finite-Horizon Multi-Player Non-Zero-Sum Differential Games of Nonlinear Systems
    Zhang, Yuqi
    Zhang, Bin
    Proceedings of 2023 IEEE 12th Data Driven Control and Learning Systems Conference, DDCLS 2023, 2023, : 511 - 517
  • [34] Optimal Control for Fuzzy Markov Jump Singularly Perturbed Systems: A Hybrid Zero-Sum Game Iteration Approach
    Wang, Jing
    Huang, Yaling
    Xie, Xiangpeng
    Yan, Huaicheng
    Shen, Hao
    IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2024, 32 (11) : 6388 - 6398
  • [35] Successive over relaxation for model-free LQR control of discrete-time Markov jump systems
    Fan, Wenwu
    Xiong, Junlin
    AUTOMATICA, 2025, 171
  • [36] A Model-Free Iteration Algorithm for Markov Jump Linear Systems Based on Gauss-Seidel Method
    Fan, Wenwu
    Xiong, Junlin
    2023 62ND IEEE CONFERENCE ON DECISION AND CONTROL, CDC, 2023, : 1275 - 1280
  • [37] Model-free H∞ Stochastic Optimal Design for Unknown Linear Networked Control System Zero-sum Games via Q-Learning
    Xu, Hao
    Jagannathan, S.
    2011 IEEE INTERNATIONAL SYMPOSIUM ON INTELLIGENT CONTROL (ISIC), 2011, : 198 - 203
  • [38] Event-triggered optimal control for discrete-time multi-player non-zero-sum games using parallel control
    Lu, Jingwei
    Wei, Qinglai
    Wang, Ziyang
    Zhou, Tianmin
    Wang, Fei-Yue
    INFORMATION SCIENCES, 2022, 584 : 519 - 535
  • [39] Neural-network-based safe learning control for non-zero-sum differential games of nonlinear systems with asymmetric input constraints
    Qin, Chunbin
    Zhu, Tianzeng
    Jiang, Kaijun
    Wu, Yinliang
    Zhang, Jishi
    APPLIED INTELLIGENCE, 2024, : 7810 - 7828
  • [40] Data-driven adaptive dynamic programming schemes for non-zero-sum games of unknown discrete-time nonlinear systems
    Jiang, He
    Zhang, Huaguang
    Zhang, Kun
    Cui, Xiaohong
    NEUROCOMPUTING, 2018, 275 : 649 - 658