Bundled Gradients Through Contact Via Randomized Smoothing

被引：20

作者：

Suh, Hyung Ju Terry ^{[1
]}

Pang, Tao ^{[1
]}

Tedrake, Russ ^{[1
]}

机构：

[1] MIT, CSAIL, Cambridge, MA 02139 USA

来源：

IEEE ROBOTICS AND AUTOMATION LETTERS | 2022年 / 7卷 / 02期

关键词：

Smoothing methods; Planning; Optimization; Convergence; Optimal control; Monte Carlo methods; Stochastic processes; Contact modeling; manipulation planning; optimization and optimal control; TRAJECTORY OPTIMIZATION; CONVEX; FRICTION; BODIES;

D O I：

10.1109/LRA.2022.3146931

中图分类号：

TP24 [机器人技术];

学科分类号：

080202 ; 1405 ;

摘要：

The empirical success of derivative-free methods in reinforcement learning for planning through contact seems at odds with the perceived fragility of classical gradient-based optimization methods in these domains. What is causing this gap, and how might we use the answer to improve gradient-based methods? We believe a stochastic formulation of dynamics is one crucial ingredient. We use tools from randomized smoothing to analyze sampling-based approximations of the gradient, and formalize such approximations through the bundled gradient. We show that using the bundled gradient in lieu of the gradient mitigates fast-changing gradients of non-smooth contact dynamics modeled by the implicit time-stepping, or the penalty method. Finally, we apply the bundled gradient to optimal control using iterative MPC, introducing a novel algorithm which improves convergence over using exact gradients. Combining our algorithm with a convex implicit time-stepping formulation of contact, we show that we can tractably tackle planning-through-contact problems in manipulation.

引用

页码：4000 / 4007

页数：8

共 50 条

[21] Hidden Cost of Randomized Smoothing
Mohapatra, Jeet
Ko, Ching-Yun
Weng, Tsui-Wei
Chen, Pin-Yu
Liu, Sijia
Daniel, Luca
24TH INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS (AISTATS), 2021, 130
[22] Double Sampling Randomized Smoothing
Li, Linyi
Zhang, Jiawei
Xie, Tao
Li, Bo
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
[23] RANDOMIZED SMOOTHING FOR STOCHASTIC OPTIMIZATION
Duchi, John C.
Bartlett, Peter L.
Wainwright, Martin J.
SIAM JOURNAL ON OPTIMIZATION, 2012, 22 (02) : 674 - 701
[24] Certified Robustness of Community Detection against Adversarial Structural Perturbation via Randomized Smoothing
Jia, Jinyuan
Wang, Binghui
Cao, Xiaoyu
Gong, Neil Zhenqiang
WEB CONFERENCE 2020: PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE (WWW 2020), 2020, : 2718 - 2724
[25] Certified defense against patch attacks via mask-guided randomized smoothing
Kui Zhang
Hang Zhou
Huanyu Bian
Weiming Zhang
Nenghai Yu
Science China Information Sciences, 2022, 65
[26] New smoothing procedures in contact mechanics
Chamoret, D
Saillard, P
Rassineux, A
Bergheau, JM
JOURNAL OF COMPUTATIONAL AND APPLIED MATHEMATICS, 2004, 168 (1-2) : 107 - 116
[27] Certified defense against patch attacks via mask-guided randomized smoothing
Kui ZHANG
Hang ZHOU
Huanyu BIAN
Weiming ZHANG
Nenghai YU
ScienceChina(InformationSciences), 2022, 65 (07) : 86 - 97
[28] Certified defense against patch attacks via mask-guided randomized smoothing
Zhang, Kui
Zhou, Hang
Bian, Huanyu
Zhang, Weiming
Yu, Nenghai
SCIENCE CHINA-INFORMATION SCIENCES, 2022, 65 (07)
[29] Decidability via mosaics for bundled Ockhamist logic
Gatto, Alberto
2015 22nd International Symposium on Temporal Representation and Reasoning (TIME), 2015, : 131 - 139
[30] Effecting a price squeeze through bundled pricing
Aron, DJ
Wildman, SS
COMPETITION, REGULATION, AND CONVERGENCE: CURRENT TRENDS IN TELECOMMUNICATIONS POLICY RESEARCH, 1999, : 1 - 17

← 1 2 3 4 5 →