A Matrosov Theorem for Adversarial Markov Decision Processes

被引:25
|
作者
Teel, Andrew R. [1 ]
机构
[1] Univ Calif Santa Barbara, Dept Elect & Comp Engn, Santa Barbara, CA 93106 USA
基金
美国国家科学基金会;
关键词
ASYMPTOTIC STABILITY;
D O I
10.1109/TAC.2013.2250073
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Matrosov's relaxation of Lyapunov conditions for uniform global asymptotic stability in time-varying systems is extended to stochastic, set-valued discrete-time systems. Nested Matrosov functions are used to give conditions for stability that complement invariance principles for time-invariant systems. Unlike invariance principles, Matrosov functions also can be applied to general time-varying systems.
引用
收藏
页码:2142 / 2148
页数:8
相关论文
共 50 条
  • [41] On the detection of Markov decision processes
    Duan, Xiaoming
    Savas, Yagiz
    Yan, Rui
    Xu, Zhe
    Topcu, Ufuk
    AUTOMATICA, 2025, 175
  • [42] Robust Markov Decision Processes
    Wiesemann, Wolfram
    Kuhn, Daniel
    Rustem, Berc
    MATHEMATICS OF OPERATIONS RESEARCH, 2013, 38 (01) : 153 - 183
  • [43] Ordinal Decision Models for Markov Decision Processes
    Weng, Paul
    20TH EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE (ECAI 2012), 2012, 242 : 828 - 833
  • [44] Markov Decision Processes with Arbitrary Reward Processes
    Yu, Jia Yuan
    Mannor, Shie
    Shimkin, Nahum
    MATHEMATICS OF OPERATIONS RESEARCH, 2009, 34 (03) : 737 - 757
  • [45] Markov Decision Processes with Arbitrary Reward Processes
    Yu, Jia Yuan
    Mannor, Shie
    Shimkin, Nahum
    RECENT ADVANCES IN REINFORCEMENT LEARNING, 2008, 5323 : 268 - +
  • [46] AN EXTENSION OF A UNIFORM ASYMPTOTIC STABILITY THEOREM BY MATROSOV
    WADA, T
    PROCEEDINGS OF THE JAPAN ACADEMY SERIES A-MATHEMATICAL SCIENCES, 1987, 63 (09) : 350 - 353
  • [47] Markov Chains and Markov Decision Processes in Isabelle/HOL
    Hoelzl, Johannes
    JOURNAL OF AUTOMATED REASONING, 2017, 59 (03) : 345 - 387
  • [48] Markov Chains and Markov Decision Processes in Isabelle/HOL
    Johannes Hölzl
    Journal of Automated Reasoning, 2017, 59 : 345 - 387
  • [49] APPROXIMATING THE MARKOV PROPERTY IN MARKOV DECISION-PROCESSES
    WHITE, DJ
    INFORMATION AND DECISION TECHNOLOGIES, 1989, 15 (03): : 147 - 162
  • [50] ERGODIC THEOREM FOR MARKOV PROCESSES WITH FINITE LIFETIME
    PARTZSCH, L
    THEORY OF PROBILITY AND ITS APPLICATIONS,USSR, 1971, 16 (04): : 697 - 700