Reinforcement Learning for Stochastic Max-Plus Linear Systems

被引:0
|
作者
Subramanian, Vignesh [1 ]
Farhadi, Farzaneh [2 ]
Soudjani, Sadegh [3 ]
机构
[1] Georgia Inst Technol, Atlanta, GA 30332 USA
[2] Newcastle Univ, Sch Engn, Newcastle Upon Tyne, Tyne & Wear, England
[3] Newcastle Univ, Sch Comp, Newcastle Upon Tyne, Tyne & Wear, England
基金
英国工程与自然科学研究理事会;
关键词
DISCRETE-EVENT SYSTEMS; REACHABILITY ANALYSIS;
D O I
10.1109/CDC49753.2023.10384207
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper studies the design of control policies for Discrete Event Systems under uncertainties. We capture the timing of the events using the framework of max-plus-linear systems in which the time between consecutive events depends on random delays with unknown distributions. Our policy synthesis approach is with respect to a cost function, and it can be extended directly to satisfy safety specifications on the timing of events. The main novelty of our approach is to translate the system evolution to a Markov decision process (MDP) that has an uncountable state space and develop a stochastic optimisation problem under the evolution of the MDP. To tackle the unknown distribution of uncertainties (thus unknown transition probabilities in the MDP), we employ model-free reinforcement learning to perform optimisations and find control policies for the system. Our implementation results on the 9-dimensional model of a railway network show superiority of our learning approach in comparison with the stochastic model predictive control approach.
引用
收藏
页码:5631 / 5638
页数:8
相关论文
共 50 条
  • [21] ON SPARSITY OF APPROXIMATE SOLUTIONS TO MAX-PLUS LINEAR SYSTEMS
    Li, Pingke
    KYBERNETIKA, 2024, 60 (03) : 425 - 425
  • [22] Soluble approximation of linear systems in max-plus algebra
    Cechlárová, K
    Cuninghame-Green, RA
    KYBERNETIKA, 2003, 39 (02) : 137 - 141
  • [23] SYSTEMS OF FUZZY NUMBER MAX-PLUS LINEAR EQUATIONS
    Rudhito, M.
    Wahyuni, Sri
    Suparwanto, Ari
    Susilo, Frans
    JOURNAL OF THE INDONESIAN MATHEMATICAL SOCIETY, 2011, 17 (01) : 17 - 28
  • [24] MAX-PLUS LINEAR SYSTEMS AT BUS LINE SYNCHRONIZATION
    Pesko, Stefan
    Turek, Richard
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE QUANTITATIVE METHODS IN ECONOMICS (MULTIPLE CRITERIA DECISION MAKING XVI), 2012, : 180 - 185
  • [25] Soluble approximation of linear systems in max-plus algebra
    Cechlárová, K
    Cuninghame-Green, RA
    SYSTEM STRUCTURE AND CONTROL 2001, VOLS 1 AND 2, 2001, : 809 - 811
  • [26] New transience bounds for max-plus linear systems
    Charron-Bost, Bernadette
    Fugger, Matthias
    Nowak, Thomas
    DISCRETE APPLIED MATHEMATICS, 2017, 219 : 83 - 99
  • [27] Reachability and observability of linear systems over max-plus
    Gazarik, MJ
    Kamen, EW
    KYBERNETIKA, 1999, 35 (01) : 2 - 12
  • [28] Control and State Estimation for max-plus Linear Systems
    Hardouin, Laurent
    Cottenceau, Bertrand
    Shang, Ying
    Raisch, Joerg
    FOUNDATIONS AND TRENDS IN SYSTEMS AND CONTROL, 2018, 6 (01): : 1 - 116
  • [29] On the model reference control for max-plus linear systems
    Maia, C. A.
    Hardouin, L.
    Santos-Mendes, R.
    Cottenceau, B.
    2005 44th IEEE Conference on Decision and Control & European Control Conference, Vols 1-8, 2005, : 7799 - 7803
  • [30] Stochastic stability in Max-Product and Max-Plus Systems with Markovian Jumps
    Kordonis, Ioannis
    Maragos, Petros
    Papavassilopoulos, George P.
    AUTOMATICA, 2018, 92 : 123 - 132