Stochastic Optimal Control as Approximate Input Inference

被引:0
|
作者
Watson, Joe [1 ]
Abdulsamad, Hany [1 ]
Peters, Jan [2 ]
机构
[1] Tech Univ Darmstadt, Dept Comp Sci, Darmstadt, Germany
[2] Max Planck Inst Intelligent Systems Tubingen, Robot Learning Grp, Tubingen, Germany
来源
基金
欧盟地平线“2020”;
关键词
Stochastic Optimal Control; Approximate Inference; OPTIMAL FEEDBACK-CONTROL;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Optimal control of stochastic nonlinear dynamical systems is a major challenge in the domain of robot learning. Given the intractability of the global control problem, state-of-the-art algorithms focus on approximate sequential optimization techniques, that heavily rely on heuristics for regularization in order to achieve stable convergence. By building upon the duality between inference and control, we develop the view of Optimal Control as Input Estimation, devising a probabilistic stochastic optimal control formulation that iteratively infers the optimal input distributions by minimizing an upper bound of the control cost. Inference is performed through Expectation Maximization and message passing on a probabilistic graphical model of the dynamical system, and time-varying linear Gaussian feedback controllers are extracted from the joint state-action distribution. This perspective incorporates uncertainty quantification, effective initialization through priors, and the principled regularization inherent to the Bayesian treatment. Moreover, it can be shown that for deterministic linearized systems, our framework derives the maximum entropy linear quadratic optimal control law. We provide a complete and detailed derivation of our probabilistic approach and highlight its advantages in comparison to other deterministic and probabilistic solvers.
引用
收藏
页数:20
相关论文
共 50 条
  • [21] Graphical model inference in optimal control of stochastic multi-agent systems
    van den Broek, Bart
    Wiegerinck, Wim
    Kappen, Bert
    JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2008, 32 : 95 - 122
  • [22] Approximate Optimal Curve Path Tracking Control for Nonlinear Systems with Asymmetric Input Constraints
    Wang, Yajing
    Wang, Xiangke
    Shen, Lincheng
    DRONES, 2022, 6 (11)
  • [23] Active Inference for Stochastic Control
    Paul, Aswin
    Sajid, Noor
    Gopalkrishnan, Manoj
    Razi, Adeel
    MACHINE LEARNING AND PRINCIPLES AND PRACTICE OF KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2021, PT I, 2021, 1524 : 669 - 680
  • [24] OPTIMAL CONTROL OF A STOCHASTIC PROCESSING SYSTEM DRIVEN BY A FRACTIONAL BROWNIAN MOTION INPUT
    Ghosh, Arka P.
    Roitershtein, Alexander
    Weerasinghe, Ananda
    ADVANCES IN APPLIED PROBABILITY, 2010, 42 (01) : 183 - 209
  • [25] Optimal control for Ito-stochastic systems with multiple input and output delays
    Kong, Shulan
    Chen, Wen
    IET CONTROL THEORY AND APPLICATIONS, 2016, 10 (10): : 1187 - 1193
  • [26] Optimal control for discrete-time singular stochastic systems with input delay
    Wang, Fan
    Liang, Jinling
    Wang, Feng
    OPTIMAL CONTROL APPLICATIONS & METHODS, 2016, 37 (06): : 1282 - 1313
  • [27] Nonlinear stochastic optimal control with input saturation constraints based on path integrals
    Satoh, Satoshi
    Kappen, Hilbert J.
    IEEJ TRANSACTIONS ON ELECTRICAL AND ELECTRONIC ENGINEERING, 2020, 15 (08) : 1169 - 1175
  • [28] Stochastic optimal open-loop feedback control Approximate solution of the Hamiltonian system
    Marti, K.
    Stein, I.
    ADVANCES IN ENGINEERING SOFTWARE, 2015, 89 : 43 - 51
  • [29] APPROXIMATE AND NUMERICAL-METHODS FOR SYNTHESIS OF OPTIMAL-CONTROL OF STOCHASTIC-SYSTEMS
    KOLMANOVSKIY, VB
    KOLOSOV, GY
    SOVIET JOURNAL OF COMPUTER AND SYSTEMS SCIENCES, 1990, 28 (01): : 140 - 153
  • [30] APPROXIMATE AND NUMERICAL-METHODS OF THE OPTIMAL-CONTROL SYNTHESIS FOR STOCHASTIC-SYSTEMS
    KOLMANOVSKII, VB
    KOLOSOV, GY
    LECTURE NOTES IN CONTROL AND INFORMATION SCIENCES, 1991, 154 : 63 - 80