Pontryagin Differentiable Programming: An End-to-End Learning and Control Framework

被引:0
|
作者
Jin, Wanxin [1 ]
Wang, Zhaoran [2 ]
Yang, Zhuoran [3 ]
Mou, Shaoshuai [1 ]
机构
[1] Purdue Univ, W Lafayette, IN 47907 USA
[2] Northwestern Univ, Evanston, IL 60208 USA
[3] Princeton Univ, Princeton, NJ 08544 USA
关键词
MODEL;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper develops a Pontryagin Differentiable Programming (PDP) methodology, which establishes a unified framework to solve a broad class of learning and control tasks. The PDP distinguishes from existing methods by two novel techniques: first, we differentiate through Pontryagin's Maximum Principle, and this allows to obtain the analytical derivative of a trajectory with respect to tunable parameters within an optimal control system, enabling end-to-end learning of dynamics, policies, or/and control objective functions; and second, we propose an auxiliary control system in the backward pass of the PDP framework, and the output of this auxiliary control system is the analytical derivative of the original system's trajectory with respect to the parameters, which can be iteratively solved using standard control tools. We investigate three learning modes of the PDP: inverse reinforcement learning, system identification, and control/planning. We demonstrate the capability of the PDP in each learning mode on different high-dimensional systems, including multi-link robot arm, 6-DoF maneuvering quadrotor, and 6-DoF rocket powered landing.
引用
收藏
页数:14
相关论文
共 50 条
  • [41] ∂PV: An End-to-End Differentiable Solar-Cell Simulator
    Mann, Sean
    Fadel, Eric
    Schoenholz, Samuel S.
    Cubuk, Ekin D.
    Johnson, Steven G.
    Romano, Giuseppe
    Mann, Sean (seanmann@mit.edu); Romano, Giuseppe (romanog@mit.edu), 2021, arXiv
  • [42] A software framework for end-to-end genomic sequence analysis with deep learning
    Klie, Adam
    Carter, Hannah
    NATURE COMPUTATIONAL SCIENCE, 2023, 3 (11): : 920 - 921
  • [43] An End-to-End Machine Learning Framework for Predicting Common Geriatric Diseases
    Jian Guo
    Yu Han
    Fan Xu
    Jiru Deng
    Zhe Li
    Journal of Beijing Institute of Technology, 2023, 32 (02) : 209 - 218
  • [44] A software framework for end-to-end genomic sequence analysis with deep learning
    Nature Computational Science, 2023, 3 : 920 - 921
  • [45] FlashSim: accelerating HEP simulation with an end-to-end Machine Learning framework
    Vaselli, Francesco
    Rizzi, Andrea
    Cattafesta, Filippo
    Cicconofri, Gloria
    26TH INTERNATIONAL CONFERENCE ON COMPUTING IN HIGH ENERGY AND NUCLEAR PHYSICS, CHEP 2023, 2024, 295
  • [46] A framework for end-to-end learning on semantic tree-structured data
    Woof, William
    Chen, Ke
    arXiv, 2020,
  • [47] Protecting the Ownership of Deep Learning Models with An End-to-End Watermarking Framework
    Zhang, Wei
    Cui, Wenxue
    Jiang, Feng
    Yang, Chifu
    Li, Ran
    2021 IEEE 20TH INTERNATIONAL CONFERENCE ON TRUST, SECURITY AND PRIVACY IN COMPUTING AND COMMUNICATIONS (TRUSTCOM 2021), 2021, : 76 - 82
  • [48] An End-to-End Machine Learning Framework for Predicting Common Geriatric Diseases
    Jian G.
    Yu H.
    Fan X.
    Jiru D.
    Zhe L.
    Journal of Beijing Institute of Technology (English Edition), 2023, 32 (02): : 209 - 218
  • [49] DeepEdgeSoC: End-to-end deep learning framework for edge IoT devices
    Al Koutayni, Mhd Rashed
    Reis, Gerd
    Stricker, Didier
    INTERNET OF THINGS, 2023, 21
  • [50] Bighead: A Framework-Agnostic, End-to-End Machine Learning Platform
    Brumbaugh, Eli
    Bhushan, Mani
    Cheong, Andrew
    Du, Michelle Gu-Qian
    Feng, Jeff
    Handel, Nick
    Hoh, Andrew
    Hone, Jack
    Hunter, Brad
    Kale, Atul
    Luque, Alfredo
    Nooraei, Bahador
    Park, John
    Puttaswamy, Krishna
    Schiller, Kyle
    Shapiro, Evgeny
    Shi, Conglei
    Siegel, Aaron
    Simha, Nikhil
    Sbrocca, Marie
    Yao, Shi-Jing
    Yoon, Patrick
    Zanoyan, Varant
    Zeng, Xiao-Han T.
    Zhu, Qiang
    2019 IEEE INTERNATIONAL CONFERENCE ON DATA SCIENCE AND ADVANCED ANALYTICS (DSAA 2019), 2019, : 551 - 560