Stochastic Optimal Control as Approximate Input Inference

被引:0
|
作者
Watson, Joe [1 ]
Abdulsamad, Hany [1 ]
Peters, Jan [2 ]
机构
[1] Tech Univ Darmstadt, Dept Comp Sci, Darmstadt, Germany
[2] Max Planck Inst Intelligent Systems Tubingen, Robot Learning Grp, Tubingen, Germany
来源
基金
欧盟地平线“2020”;
关键词
Stochastic Optimal Control; Approximate Inference; OPTIMAL FEEDBACK-CONTROL;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Optimal control of stochastic nonlinear dynamical systems is a major challenge in the domain of robot learning. Given the intractability of the global control problem, state-of-the-art algorithms focus on approximate sequential optimization techniques, that heavily rely on heuristics for regularization in order to achieve stable convergence. By building upon the duality between inference and control, we develop the view of Optimal Control as Input Estimation, devising a probabilistic stochastic optimal control formulation that iteratively infers the optimal input distributions by minimizing an upper bound of the control cost. Inference is performed through Expectation Maximization and message passing on a probabilistic graphical model of the dynamical system, and time-varying linear Gaussian feedback controllers are extracted from the joint state-action distribution. This perspective incorporates uncertainty quantification, effective initialization through priors, and the principled regularization inherent to the Bayesian treatment. Moreover, it can be shown that for deterministic linearized systems, our framework derives the maximum entropy linear quadratic optimal control law. We provide a complete and detailed derivation of our probabilistic approach and highlight its advantages in comparison to other deterministic and probabilistic solvers.
引用
收藏
页数:20
相关论文
共 50 条
  • [41] k-Optimal: a novel approximate inference algorithm for ProbLog
    Renkens, Joris
    Van den Broeck, Guy
    Nijssen, Siegfried
    MACHINE LEARNING, 2012, 89 (03) : 215 - 231
  • [42] Robustness of Stochastic Optimal Control to Approximate Diffusion Models Under Several Cost Evaluation Criteria
    Pradhan, Somnath
    Yueksel, Serdar
    MATHEMATICS OF OPERATIONS RESEARCH, 2024, 49 (04) : 2049 - 2077
  • [43] Approximate Optimal Control of Fractional Impulsive Partial Stochastic Differential Inclusions Driven by Rosenblatt Process
    Yan, Zuomao
    APPLIED MATHEMATICS AND OPTIMIZATION, 2024, 89 (01):
  • [44] Approximate Optimal Control of Fractional Impulsive Partial Stochastic Differential Inclusions Driven by Rosenblatt Process
    Zuomao Yan
    Applied Mathematics & Optimization, 2024, 89
  • [45] "Exact" and Approximate Methods for Bayesian Inference: Stochastic Volatility Case Study
    Shapovalova, Yuliya
    ENTROPY, 2021, 23 (04)
  • [46] Compensator-based approximate optimal control for affine nonlinear systems with input constraints and unmatched disturbances
    Lu, Ke
    Liu, Chunsheng
    Sun, Jingliang
    Li, Chunhua
    Ma, Chengcheng
    TRANSACTIONS OF THE INSTITUTE OF MEASUREMENT AND CONTROL, 2020, 42 (15) : 3024 - 3034
  • [47] Approximate Optimal Robust Tracking Control Based on State Error and Derivative Without Initial Admissible Input
    Li, Dongdong
    Dong, Jiuxiang
    IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2024, 54 (02): : 1059 - 1069
  • [48] Optimal Variational Perturbations for the Inference of Stochastic Reaction Dynamics
    Zechner, C.
    Nandy, P.
    Unger, M.
    Koeppl, H.
    2012 IEEE 51ST ANNUAL CONFERENCE ON DECISION AND CONTROL (CDC), 2012, : 5336 - 5341
  • [49] Stochastic variational inference for probabilistic optimal power flows
    Loschenbrand, Markus
    ELECTRIC POWER SYSTEMS RESEARCH, 2021, 200
  • [50] DISCRETE STOCHASTIC CONTROL WITH INPUT CONSTRAINTS
    MACGREGOR, JF
    PROCEEDINGS OF THE INSTITUTION OF ELECTRICAL ENGINEERS-LONDON, 1977, 124 (08): : 732 - 734