Bounded-Error LQR-Trees

被引:0
|
作者
Ames, Barrett [1 ]
Konidaris, George [2 ]
机构
[1] Duke Univ, Comp Sci, Durham, NC 27706 USA
[2] Brown Univ, Comp Sci, Providence, RI 02912 USA
关键词
D O I
10.1109/iros40897.2019.8967750
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present a feedback motion planning algorithm, Bounded-Error LQR-Trees, that leverages reinforcement learning theory to find a policy with a bounded amount of error. The algorithm composes locally valid linear-quadratic regulators (LQR) into a nonlinear controller, similar to how LQR-Trees constructs its policy, but minimizes the cost of the constructed policy by minimizing the Bellman Residual, which is estimated in the overlapping regions of LQR controllers. We prove a sample-based upper bound on the true Bellman Residual, and demonstrate a five-fold reduction in cost over previous methods on a simple underactuated nonlinear system.
引用
收藏
页码:144 / 150
页数:7
相关论文
共 50 条
  • [1] LQR-trees with Sampling Based Exploration of the State Space
    Fejlek, Jiri
    Ratschan, Stefan
    2023 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, IROS, 2023, : 4777 - 4782
  • [2] Simulation-Based LQR-Trees with Input and State Constraints
    Reist, Philipp
    Tedrake, Russ
    2010 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2010, : 5504 - 5510
  • [3] Control Synthesis and Verification for a Perching UAV using LQR-Trees
    Moore, Joseph
    Tedrake, Russ
    2012 IEEE 51ST ANNUAL CONFERENCE ON DECISION AND CONTROL (CDC), 2012, : 3707 - 3714
  • [5] Feedback-motion-planning with simulation-based LQR-trees
    Reist, Philipp
    Preiswerk, Pascal
    Tedrake, Russ
    INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2016, 35 (11): : 1328 - 1351
  • [6] PARAMETER SETS FOR BOUNDED-ERROR DATA
    MOORE, R
    MATHEMATICS AND COMPUTERS IN SIMULATION, 1992, 34 (02) : 113 - 119
  • [7] Quantum search on bounded-error inputs
    Hoyer, P
    Mosca, M
    de Wolf, R
    AUTOMATA, LANGUAGES AND PROGRAMMING, PROCEEDINGS, 2003, 2719 : 291 - 299
  • [8] Robust Bounded-Error Subset Selection
    Alghoniemy, Masoud
    2018 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS (ICCE), 2018,
  • [9] EXPERIMENT DESIGN FOR BOUNDED-ERROR MODELS
    PRONZATO, L
    WALTER, E
    MATHEMATICS AND COMPUTERS IN SIMULATION, 1990, 32 (5-6) : 571 - 584
  • [10] L(2) PROJECTION IN BOUNDED-ERROR ESTIMATION
    KEESMAN, KJ
    INTERNATIONAL JOURNAL OF ADAPTIVE CONTROL AND SIGNAL PROCESSING, 1995, 9 (01) : 71 - 85