Bundled Gradients Through Contact Via Randomized Smoothing

被引:20
|
作者
Suh, Hyung Ju Terry [1 ]
Pang, Tao [1 ]
Tedrake, Russ [1 ]
机构
[1] MIT, CSAIL, Cambridge, MA 02139 USA
关键词
Smoothing methods; Planning; Optimization; Convergence; Optimal control; Monte Carlo methods; Stochastic processes; Contact modeling; manipulation planning; optimization and optimal control; TRAJECTORY OPTIMIZATION; CONVEX; FRICTION; BODIES;
D O I
10.1109/LRA.2022.3146931
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
The empirical success of derivative-free methods in reinforcement learning for planning through contact seems at odds with the perceived fragility of classical gradient-based optimization methods in these domains. What is causing this gap, and how might we use the answer to improve gradient-based methods? We believe a stochastic formulation of dynamics is one crucial ingredient. We use tools from randomized smoothing to analyze sampling-based approximations of the gradient, and formalize such approximations through the bundled gradient. We show that using the bundled gradient in lieu of the gradient mitigates fast-changing gradients of non-smooth contact dynamics modeled by the implicit time-stepping, or the penalty method. Finally, we apply the bundled gradient to optimal control using iterative MPC, introducing a novel algorithm which improves convergence over using exact gradients. Combining our algorithm with a convex implicit time-stepping formulation of contact, we show that we can tractably tackle planning-through-contact problems in manipulation.
引用
收藏
页码:4000 / 4007
页数:8
相关论文
共 50 条
  • [21] Hidden Cost of Randomized Smoothing
    Mohapatra, Jeet
    Ko, Ching-Yun
    Weng, Tsui-Wei
    Chen, Pin-Yu
    Liu, Sijia
    Daniel, Luca
    24TH INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS (AISTATS), 2021, 130
  • [22] Double Sampling Randomized Smoothing
    Li, Linyi
    Zhang, Jiawei
    Xie, Tao
    Li, Bo
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
  • [23] RANDOMIZED SMOOTHING FOR STOCHASTIC OPTIMIZATION
    Duchi, John C.
    Bartlett, Peter L.
    Wainwright, Martin J.
    SIAM JOURNAL ON OPTIMIZATION, 2012, 22 (02) : 674 - 701
  • [24] Certified Robustness of Community Detection against Adversarial Structural Perturbation via Randomized Smoothing
    Jia, Jinyuan
    Wang, Binghui
    Cao, Xiaoyu
    Gong, Neil Zhenqiang
    WEB CONFERENCE 2020: PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE (WWW 2020), 2020, : 2718 - 2724
  • [25] Certified defense against patch attacks via mask-guided randomized smoothing
    Kui Zhang
    Hang Zhou
    Huanyu Bian
    Weiming Zhang
    Nenghai Yu
    Science China Information Sciences, 2022, 65
  • [26] New smoothing procedures in contact mechanics
    Chamoret, D
    Saillard, P
    Rassineux, A
    Bergheau, JM
    JOURNAL OF COMPUTATIONAL AND APPLIED MATHEMATICS, 2004, 168 (1-2) : 107 - 116
  • [27] Certified defense against patch attacks via mask-guided randomized smoothing
    Kui ZHANG
    Hang ZHOU
    Huanyu BIAN
    Weiming ZHANG
    Nenghai YU
    ScienceChina(InformationSciences), 2022, 65 (07) : 86 - 97
  • [28] Certified defense against patch attacks via mask-guided randomized smoothing
    Zhang, Kui
    Zhou, Hang
    Bian, Huanyu
    Zhang, Weiming
    Yu, Nenghai
    SCIENCE CHINA-INFORMATION SCIENCES, 2022, 65 (07)
  • [29] Decidability via mosaics for bundled Ockhamist logic
    Gatto, Alberto
    2015 22nd International Symposium on Temporal Representation and Reasoning (TIME), 2015, : 131 - 139
  • [30] Effecting a price squeeze through bundled pricing
    Aron, DJ
    Wildman, SS
    COMPETITION, REGULATION, AND CONVERGENCE: CURRENT TRENDS IN TELECOMMUNICATIONS POLICY RESEARCH, 1999, : 1 - 17