Bounded-Error LQR-Trees

被引:0
|
作者
Ames, Barrett [1 ]
Konidaris, George [2 ]
机构
[1] Duke Univ, Comp Sci, Durham, NC 27706 USA
[2] Brown Univ, Comp Sci, Providence, RI 02912 USA
来源
2019 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS) | 2019年
关键词
D O I
10.1109/iros40897.2019.8967750
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present a feedback motion planning algorithm, Bounded-Error LQR-Trees, that leverages reinforcement learning theory to find a policy with a bounded amount of error. The algorithm composes locally valid linear-quadratic regulators (LQR) into a nonlinear controller, similar to how LQR-Trees constructs its policy, but minimizes the cost of the constructed policy by minimizing the Bellman Residual, which is estimated in the overlapping regions of LQR controllers. We prove a sample-based upper bound on the true Bellman Residual, and demonstrate a five-fold reduction in cost over previous methods on a simple underactuated nonlinear system.
引用
收藏
页码:144 / 150
页数:7
相关论文
共 50 条
  • [21] SOME RESULTS ON EXPERIMENT DESIGN FOR BOUNDED-ERROR MODELS
    PRONZATO, L
    WALTER, E
    MODELLING AND SIMULATION OF SYSTEMS, 1989, 3 : 183 - 185
  • [22] Distributed Interval Observers for Bounded-Error LTI Systems
    Khajenejad, Mohammad
    Brown, Scott
    Martinez, Sonia
    2023 AMERICAN CONTROL CONFERENCE, ACC, 2023, : 2444 - 2449
  • [23] A bounded-error approach to piecewise affine system identification
    Bemporad, A
    Garulli, A
    Paoletti, S
    Vicino, A
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2005, 50 (10) : 1567 - 1580
  • [24] Bounded-Error Subset Selection for Sparse Channel Estimtation
    Aziz, Sara Osama Abd El
    Banawan, Karim
    Alghoniemy, Masoud
    30TH INTERNATIONAL CONFERENCE ON COMPUTER THEORY AND APPLICATIONS (ICCTA 2020), 2020, : 5 - 9
  • [25] Interval constraint propagation with application to bounded-error estimation
    Jaulin, L
    AUTOMATICA, 2000, 36 (10) : 1547 - 1552
  • [26] Data-driven bounded-error fault detection
    Suarez Fabrega, Antonio J.
    Bravo Caro, Jose Manuel
    Abad Herrera, Pedro J.
    Gasca, Rafael M.
    INTERNATIONAL JOURNAL OF ADAPTIVE CONTROL AND SIGNAL PROCESSING, 2014, 28 (12) : 1299 - 1324
  • [27] Bounded-error tracking of time-varying parameters
    Piet-Lahanier, H., 1661, IEEE, Piscataway (39):
  • [28] Optimal experiment design for bounded-error estimation of nonlinear models
    Denis-Vidal, Liliane
    Jauberthie, Carine
    Kieffer, Michel
    2019 IEEE 58TH CONFERENCE ON DECISION AND CONTROL (CDC), 2019, : 4147 - 4154
  • [29] Topology Error Identification with a Bounded-error Modeling of Measurements Uncertainty in Power System
    Wang, Bin
    He, Guang-yu
    Li, Bang-feng
    Xing, Tao
    Yu, Jia-xi
    INTERNATIONAL CONFERENCE ON ELECTRICAL, CONTROL AND AUTOMATION ENGINEERING (ECAE 2013), 2013, : 367 - 371
  • [30] Prefix-based Bounded-error Estimation with Intermittent Observations
    Rutledge, Kwesi J.
    Yong, Sze Zheng
    Ozay, Necmiye
    2019 AMERICAN CONTROL CONFERENCE (ACC), 2019, : 4320 - 4325