Bundled Gradients Through Contact Via Randomized Smoothing

被引:20
|
作者
Suh, Hyung Ju Terry [1 ]
Pang, Tao [1 ]
Tedrake, Russ [1 ]
机构
[1] MIT, CSAIL, Cambridge, MA 02139 USA
关键词
Smoothing methods; Planning; Optimization; Convergence; Optimal control; Monte Carlo methods; Stochastic processes; Contact modeling; manipulation planning; optimization and optimal control; TRAJECTORY OPTIMIZATION; CONVEX; FRICTION; BODIES;
D O I
10.1109/LRA.2022.3146931
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
The empirical success of derivative-free methods in reinforcement learning for planning through contact seems at odds with the perceived fragility of classical gradient-based optimization methods in these domains. What is causing this gap, and how might we use the answer to improve gradient-based methods? We believe a stochastic formulation of dynamics is one crucial ingredient. We use tools from randomized smoothing to analyze sampling-based approximations of the gradient, and formalize such approximations through the bundled gradient. We show that using the bundled gradient in lieu of the gradient mitigates fast-changing gradients of non-smooth contact dynamics modeled by the implicit time-stepping, or the penalty method. Finally, we apply the bundled gradient to optimal control using iterative MPC, introducing a novel algorithm which improves convergence over using exact gradients. Combining our algorithm with a convex implicit time-stepping formulation of contact, we show that we can tractably tackle planning-through-contact problems in manipulation.
引用
收藏
页码:4000 / 4007
页数:8
相关论文
共 50 条
  • [1] Certified Adversarial Robustness via Randomized Smoothing
    Cohen, Jeremy
    Rosenfeld, Elan
    Kolter, J. Zico
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97
  • [2] Scalable Certified Segmentation via Randomized Smoothing
    Fischer, Marc
    Baader, Maximilian
    Vechev, Martin
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
  • [3] Certified Robustness via Locally Biased Randomized Smoothing
    Anderson, Brendon G.
    Sojoudi, Somayeh
    LEARNING FOR DYNAMICS AND CONTROL CONFERENCE, VOL 168, 2022, 168
  • [4] UniCR: Universally Approximated Certified Robustness via Randomized Smoothing
    Hong, Hanbin
    Wang, Binghui
    Hong, Yuan
    COMPUTER VISION - ECCV 2022, PT V, 2022, 13665 : 86 - 103
  • [5] Towards Robust Certified Defense via Improved Randomized Smoothing
    Kumar, Satyadwyoom
    Narayan, Apurva
    2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [6] Adversarial Defense via Attention-Based Randomized Smoothing
    Xu, Xiao
    Feng, Shiyu
    Wang, Zheng
    Xie, Lizhe
    Hu, Yining
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2020, PT I, 2020, 12396 : 455 - 466
  • [7] Smoothing policies and safe policy gradients
    Papini, Matteo
    Pirotta, Matteo
    Restelli, Marcello
    MACHINE LEARNING, 2022, 111 (11) : 4081 - 4137
  • [8] Smoothing policies and safe policy gradients
    Matteo Papini
    Matteo Pirotta
    Marcello Restelli
    Machine Learning, 2022, 111 : 4081 - 4137
  • [9] Certifiably-Robust Federated Adversarial Learning via Randomized Smoothing
    Chen, Cheng
    Kailkhura, Bhavya
    Goldhahn, Ryan
    Zhou, Yi
    2021 IEEE 18TH INTERNATIONAL CONFERENCE ON MOBILE AD HOC AND SMART SYSTEMS (MASS 2021), 2021, : 173 - 179
  • [10] Randomized smoothing networks
    Herlihy, M
    Tirthapura, S
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2006, 66 (05) : 626 - 632