Bundled Gradients Through Contact Via Randomized Smoothing

被引:20
|
作者
Suh, Hyung Ju Terry [1 ]
Pang, Tao [1 ]
Tedrake, Russ [1 ]
机构
[1] MIT, CSAIL, Cambridge, MA 02139 USA
关键词
Smoothing methods; Planning; Optimization; Convergence; Optimal control; Monte Carlo methods; Stochastic processes; Contact modeling; manipulation planning; optimization and optimal control; TRAJECTORY OPTIMIZATION; CONVEX; FRICTION; BODIES;
D O I
10.1109/LRA.2022.3146931
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
The empirical success of derivative-free methods in reinforcement learning for planning through contact seems at odds with the perceived fragility of classical gradient-based optimization methods in these domains. What is causing this gap, and how might we use the answer to improve gradient-based methods? We believe a stochastic formulation of dynamics is one crucial ingredient. We use tools from randomized smoothing to analyze sampling-based approximations of the gradient, and formalize such approximations through the bundled gradient. We show that using the bundled gradient in lieu of the gradient mitigates fast-changing gradients of non-smooth contact dynamics modeled by the implicit time-stepping, or the penalty method. Finally, we apply the bundled gradient to optimal control using iterative MPC, introducing a novel algorithm which improves convergence over using exact gradients. Combining our algorithm with a convex implicit time-stepping formulation of contact, we show that we can tractably tackle planning-through-contact problems in manipulation.
引用
收藏
页码:4000 / 4007
页数:8
相关论文
共 50 条
  • [31] Variable Smoothing for Convex Optimization Problems Using Stochastic Gradients
    Bot, Radu Ioan
    Boehm, Axel
    JOURNAL OF SCIENTIFIC COMPUTING, 2020, 85 (02)
  • [32] Variable Smoothing for Convex Optimization Problems Using Stochastic Gradients
    Radu Ioan Boţ
    Axel Böhm
    Journal of Scientific Computing, 2020, 85
  • [33] Local Gradients Smoothing: Defense against localized adversarial attacks
    Naseer, Muzammal
    Khan, Salman H.
    Porikli, Fatih
    2019 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2019, : 1300 - 1307
  • [34] Adversarial Robustness of Deep Learning-Based Malware Detectors via (De)Randomized Smoothing
    Gibert, Daniel
    Zizzo, Giulio
    Le, Quan
    Planes, Jordi
    IEEE ACCESS, 2024, 12 : 61152 - 61162
  • [35] Controlled geometry via smoothing
    Petersen, P
    Wei, GF
    Ye, RG
    COMMENTARII MATHEMATICI HELVETICI, 1999, 74 (03) : 345 - 363
  • [36] Data-Dependent Randomized Smoothing
    Alfarra, Motasem
    Bibi, Adel
    Torr, Philip H. S.
    Ghanem, Bernard
    UNCERTAINTY IN ARTIFICIAL INTELLIGENCE, VOL 180, 2022, 180 : 64 - 74
  • [37] Randomized Smoothing for (Parallel) Stochastic Optimization
    Duchi, John C.
    Bartlett, Peter L.
    Wainwright, Martin J.
    2012 IEEE 51ST ANNUAL CONFERENCE ON DECISION AND CONTROL (CDC), 2012, : 5442 - 5444
  • [38] Surface Smoothing Procedures in Computational Contact Mechanics
    D. M. Neto
    M. C. Oliveira
    L. F. Menezes
    Archives of Computational Methods in Engineering, 2017, 24 : 37 - 87
  • [39] Surface Smoothing Procedures in Computational Contact Mechanics
    Neto, D. M.
    Oliveira, M. C.
    Menezes, L. F.
    ARCHIVES OF COMPUTATIONAL METHODS IN ENGINEERING, 2017, 24 (01) : 37 - 87
  • [40] The Potential for Cost Savings through Bundled Episode Payments
    Cutler, David M.
    Ghosh, Kaushik
    NEW ENGLAND JOURNAL OF MEDICINE, 2012, 366 (12): : 1075 - 1077