Stochastic Optimal Control as Approximate Input Inference

被引:0
|
作者
Watson, Joe [1 ]
Abdulsamad, Hany [1 ]
Peters, Jan [2 ]
机构
[1] Tech Univ Darmstadt, Dept Comp Sci, Darmstadt, Germany
[2] Max Planck Inst Intelligent Systems Tubingen, Robot Learning Grp, Tubingen, Germany
来源
基金
欧盟地平线“2020”;
关键词
Stochastic Optimal Control; Approximate Inference; OPTIMAL FEEDBACK-CONTROL;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Optimal control of stochastic nonlinear dynamical systems is a major challenge in the domain of robot learning. Given the intractability of the global control problem, state-of-the-art algorithms focus on approximate sequential optimization techniques, that heavily rely on heuristics for regularization in order to achieve stable convergence. By building upon the duality between inference and control, we develop the view of Optimal Control as Input Estimation, devising a probabilistic stochastic optimal control formulation that iteratively infers the optimal input distributions by minimizing an upper bound of the control cost. Inference is performed through Expectation Maximization and message passing on a probabilistic graphical model of the dynamical system, and time-varying linear Gaussian feedback controllers are extracted from the joint state-action distribution. This perspective incorporates uncertainty quantification, effective initialization through priors, and the principled regularization inherent to the Bayesian treatment. Moreover, it can be shown that for deterministic linearized systems, our framework derives the maximum entropy linear quadratic optimal control law. We provide a complete and detailed derivation of our probabilistic approach and highlight its advantages in comparison to other deterministic and probabilistic solvers.
引用
收藏
页数:20
相关论文
共 50 条
  • [31] APPROXIMATE SYNTHESIS OF OPTIMAL CONTROL
    KROTOV, VF
    AUTOMATION AND REMOTE CONTROL, 1965, 25 (11) : 1369 - &
  • [32] Nonparametric statistical inference for stochastic optimal control problems and its applications for financial investment
    Liu, Yang
    Liang, Yanzi
    Lan, Xinchen
    Lu, Zheng
    FINANCE RESEARCH LETTERS, 2024, 64
  • [33] Approximate Value Iteration for a Class of Deterministic Optimal Control Problems with Infinite State and Input Alphabets
    Reissig, Gunther
    2016 IEEE 55TH CONFERENCE ON DECISION AND CONTROL (CDC), 2016, : 1063 - 1068
  • [34] OPTIMAL STOCHASTIC CONTROL
    WONHAM, WM
    AUTOMATICA, 1969, 5 (01) : 113 - +
  • [35] ON STOCHASTIC OPTIMAL CONTROL
    PARKINS, G
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 1969, AC14 (02) : 193 - &
  • [36] OPTIMAL STOCHASTIC CONTROL
    KUSHNER, HJ
    IRE TRANSACTIONS ON AUTOMATIC CONTROL, 1962, AC 7 (05): : 120 - &
  • [37] Optimal Tracking Control and Stabilization for Stochastic Systems with Multi-step Input Delay
    Han, Chunyan
    Liu, Yue
    2018 37TH CHINESE CONTROL CONFERENCE (CCC), 2018, : 2185 - 2190
  • [38] Optimal Control for Stochastic Discrete-time Systems with Multiple Input-delays
    Wang, Hongxia
    Zhang, Huanshui
    Wang, Xuan
    PROCEEDINGS OF THE 10TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION (WCICA 2012), 2012, : 1529 - 1534
  • [39] Finite-horizon Indefinite Optimal Control for Ito Stochastic System with Input Delay
    Li, Hongdan
    Zhang, Huanshui
    2017 CHINESE AUTOMATION CONGRESS (CAC), 2017, : 5807 - 5811
  • [40] k-Optimal: a novel approximate inference algorithm for ProbLog
    Joris Renkens
    Guy Van den Broeck
    Siegfried Nijssen
    Machine Learning, 2012, 89 : 215 - 231