OPEN-LOOP OPTIMAL CONTROL FOR TRACKING A REFERENCE SIGNAL WITH APPROXIMATE DYNAMIC PROGRAMMING

被引:0
|
作者
Diaz, Jorge A. [1 ]
Xu, Lei [2 ]
Sardarmehni, Tohid [3 ]
机构
[1] Univ Texas Rio Grande Valley, Dept Mech Engn, Edinburg, TX 78539 USA
[2] Kent State Univ, Dept Comp Sci, Kent, OH 44242 USA
[3] Calif State Univ Northridge, Dept Mech Engn, Northridge, CA 91330 USA
基金
美国国家科学基金会;
关键词
optimal control; approximate dynamic programming; dynamic programming; neural networks;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Dynamic programming (DP) provides a systematic, closed-loop solution for optimal control problems. However, it suffers from the curse of dimensionality in higher orders. Approximate dynamic programming (ADP) methods can remedy this by finding near-optimal rather than exact optimal solutions. In summary, ADP uses function approximators, such as neural networks, to approximate optimal control solutions. ADP can then converge to the near-optimal solution using techniques such as reinforcement learning (RL). The two main challenges in using this approach are finding a proper training domain and selecting a suitable neural network architecture for precisely approximating the solutions with RL. Users select the training domain and the neural networks mostly by trial and error, which is tedious and time-consuming. This paper proposes trading the closed-loop solution provided by ADP methods for more effectively selecting the domain of training. To do so, we train a neural network using a small and moving domain around the reference signal. We asses the method's effectiveness by applying it to a widely used benchmark problem, the Van der Pol oscillator; and a real-world problem, controlling a quadrotor to track a reference trajectory. Simulation results demonstrate comparable performance to traditional methods while reducing computational requirements.
引用
收藏
页数:7
相关论文
共 50 条
  • [1] Stochastic optimal open-loop feedback control Approximate solution of the Hamiltonian system
    Marti, K.
    Stein, I.
    ADVANCES IN ENGINEERING SOFTWARE, 2015, 89 : 43 - 51
  • [2] Unified approach for open-loop optimal control
    Imura, Y.
    Naidu, D. S.
    OPTIMAL CONTROL APPLICATIONS & METHODS, 2007, 28 (02): : 59 - 75
  • [3] Open-Loop Optimal Temperature Control in Greenhouses
    Van Henten, E. J.
    Bontsema, J.
    PROCEEDINGS OF THE INTERNATIONAL SYMPOSIUM ON HIGH TECHNOLOGY FOR GREENHOUSE SYSTEM MANAGEMENT, VOLS 1 AND 2, 2008, (801): : 629 - 635
  • [4] Optimal Tracking Control for Ship Course Using Approximate Dynamic Programming Method
    Xie Qingqing
    Luo Bin
    Tan Fuxiao
    2013 32ND CHINESE CONTROL CONFERENCE (CCC), 2013, : 2911 - 2916
  • [5] Open-loop Model-free Dynamic Control of a Soft Manipulator for Tracking Tasks
    Centurelli, Andrea
    Rizzo, Alessandro
    Tolu, Silvia
    Falotico, Egidio
    2021 20TH INTERNATIONAL CONFERENCE ON ADVANCED ROBOTICS (ICAR), 2021, : 128 - 133
  • [6] Stochastic optimal structural control: Stochastic optimal open-loop feedback control
    Marti, K.
    ADVANCES IN ENGINEERING SOFTWARE, 2012, 44 (01) : 26 - 34
  • [7] Open-loop position tracking control of a piezoceramic flexible beam using a dynamic hysteresis compensator
    Nguyen, Phuong-Bac
    Choi, Seung-Bok
    SMART MATERIALS AND STRUCTURES, 2010, 19 (12)
  • [8] Approximating open-loop and closed-loop optimal control by model predictive control
    Dontchev, Asen L.
    Kolmanovsky, Ilya, V
    Krastanov, Mikhail, I
    Veliov, Vladimir M.
    2020 EUROPEAN CONTROL CONFERENCE (ECC 2020), 2020, : 190 - 195
  • [9] OPTIMAL OPEN-LOOP CONTROL OF AN ABSORPTION PROCESS WITH STATE AND CONTROL CONSTRAINTS
    KONTARATOS, D
    PINGLOT, D
    RAIRO-AUTOMATIQUE-SYSTEMS ANALYSIS AND CONTROL, 1979, 13 (02): : 203 - 217
  • [10] Optimal boundary control of a tracking problem for a parabolic distributed system with open-loop control using evolutionary algorithms
    Stonier, RJ
    Drumm, MJ
    6TH WORLD MULTICONFERENCE ON SYSTEMICS, CYBERNETICS AND INFORMATICS, VOL XII, PROCEEDINGS: INDUSTRIAL SYSTEMS AND ENGINEERING II, 2002, : 175 - 180