共 50 条
- [44] On the Use of Non-Stationary Strategies for Solving Two-Player Zero-Sum Markov Games ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 51, 2016, 51 : 893 - 901
- [45] Online Solution of Nonlinear Two-Player Zero-Sum Games Using Synchronous Policy Iteration 49TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2010, : 3040 - 3047
- [46] The Lagging Anchor Algorithm: Reinforcement Learning in Two-Player Zero-Sum Games with Imperfect Information Machine Learning, 2002, 49 : 5 - 37
- [47] GPI-Based design for partially unknown nonlinear two-player zero-sum games JOURNAL OF THE FRANKLIN INSTITUTE-ENGINEERING AND APPLIED MATHEMATICS, 2023, 360 (03): : 2068 - 2088
- [48] Online solution of nonlinear two-player zero-sum games using synchronous policy iteration International Journal of Robust and Nonlinear Control, 2012, 22 (13): : 1460 - 1483
- [50] Policy gradient algorithm and its convergence analysis for two-player zero-sum Markov games Zhejiang Daxue Xuebao (Gongxue Ban)/Journal of Zhejiang University (Engineering Science), 2024, 58 (03): : 480 - 491