Bundled Gradients Through Contact Via Randomized Smoothing

被引:20
|
作者
Suh, Hyung Ju Terry [1 ]
Pang, Tao [1 ]
Tedrake, Russ [1 ]
机构
[1] MIT, CSAIL, Cambridge, MA 02139 USA
关键词
Smoothing methods; Planning; Optimization; Convergence; Optimal control; Monte Carlo methods; Stochastic processes; Contact modeling; manipulation planning; optimization and optimal control; TRAJECTORY OPTIMIZATION; CONVEX; FRICTION; BODIES;
D O I
10.1109/LRA.2022.3146931
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
The empirical success of derivative-free methods in reinforcement learning for planning through contact seems at odds with the perceived fragility of classical gradient-based optimization methods in these domains. What is causing this gap, and how might we use the answer to improve gradient-based methods? We believe a stochastic formulation of dynamics is one crucial ingredient. We use tools from randomized smoothing to analyze sampling-based approximations of the gradient, and formalize such approximations through the bundled gradient. We show that using the bundled gradient in lieu of the gradient mitigates fast-changing gradients of non-smooth contact dynamics modeled by the implicit time-stepping, or the penalty method. Finally, we apply the bundled gradient to optimal control using iterative MPC, introducing a novel algorithm which improves convergence over using exact gradients. Combining our algorithm with a convex implicit time-stepping formulation of contact, we show that we can tractably tackle planning-through-contact problems in manipulation.
引用
收藏
页码:4000 / 4007
页数:8
相关论文
共 50 条
  • [41] Managing care through limited risk, bundled contracting
    Aldridge, K
    Evans, J
    Mills, M
    Gilbert, R
    SYSTEM OF CARE FOR CHILDREN'S MENTAL HEALTH: EXPANDING THE RESEARCH BASE, ANNUAL RESEARCH CONFERENCE, PROCEEDINGS, 1997, : 79 - 84
  • [42] Modeling breach of contract risk through bundled options
    Haksoz, Cagri
    Simsek, Koray D.
    JOURNAL OF OPERATIONAL RISK, 2010, 5 (03): : 3 - 20
  • [43] Beyond Single Path Integrated Gradients for Reliable Input Attribution via Randomized Path Sampling
    Jeon, Giyoung
    Jeong, Haedong
    Choi, Jaesik
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 2052 - 2061
  • [44] VOC-Certifire: Certifiably Robust One-Shot Spectroscopic Classification via Randomized Smoothing
    Sy, Mohamed
    Ibrahim, Emad Al
    Farooq, Aamir
    ACS OMEGA, 2024, 9 (37): : 39033 - 39042
  • [45] Smoothing and non-smoothing via a flow tangent to the Ricci flow
    Erbar, Matthias
    Juillet, Nicolas
    JOURNAL DE MATHEMATIQUES PURES ET APPLIQUEES, 2018, 110 : 123 - 154
  • [46] Enhancement of spatially adaptive smoothing splines via parameterization of smoothing parameters
    Jang, Dongik
    Oh, Hee-Seok
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2011, 55 (02) : 1029 - 1040
  • [47] SIMPLIFIED GRAVITY INTERPRETATION BY GRADIENTS - GEOLOGICAL CONTACT
    STANLEY, JM
    GEOPHYSICS, 1977, 42 (06) : 1230 - 1235
  • [48] Kinematics of an abrasive grain in the contact zone in `Gridle' smoothing
    Pr Nauk Inst Technol Maszyn Autom Politech Wroclaw, 24 (301):
  • [49] Invariance-Aware Randomized Smoothing Certificates
    Schuchardt, Jan
    Guennemann, Stephan
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [50] Stochastic optimization with randomized smoothing for image registration
    Sun, Wei
    Poot, Dirk H. J.
    Smal, Ihor
    Yang, Xuan
    Niessen, Wiro J.
    Klein, Stefan
    MEDICAL IMAGE ANALYSIS, 2017, 35 : 146 - 158