Cooperative Model-Based Reinforcement Learning for Approximate Optimal Tracking

被引：0

作者：

Greene, Max L. ^{[1
]}

Bell, Zachary, I ^{[2
]}

Nivison, Scott A. ^{[2
]}

How, Jonathan P. ^{[3
]}

Dixon, Warren E. ^{[1
]}

机构：

[1] Univ Florida, Dept Mech & Aerosp Engn, Gainesville, FL 32611 USA

[2] Air Force Res Lab, Munit Directorate, Eglin AFB, FL USA

[3] MIT, Dept Aeronaut & Astronaut, Cambridge, MA 02139 USA

来源：

2021 AMERICAN CONTROL CONFERENCE (ACC) | 2021年

关键词：

SYSTEMS;

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This paper provides an approximate online adaptive solution to the infinite-horizon optimal tracking problem for a set of agents with homogeneous dynamics and common tracking objectives. Model-based reinforcement learning is implemented by simultaneously evaluating the Bellman error (BE) at the state of each agent and on nearby off-trajectory points, as needed, throughout the state space. Each agent will calculate and share their respective on and off-trajectory BE information with a centralized estimator, which computes updates for the approximate solution to the infinite-horizon optimal tracking problem and shares the estimate with the agents. In doing so, the computational burden associated with BE extrapolation is shared between the agents and a centralized updating resource. Edge computing is leveraged to share the computational load between the agents and a centralized resource. Uniformly ultimately bounded tracking of each agent's state to the desired state and convergence of the control policy to the neighborhood of the optimal policy is proven via a Lyapunov-like stability analysis.

引用

页码：1973 / 1978

页数：6

共 50 条

[41] Abstraction Selection in Model-Based Reinforcement Learning
Jiang, Nan
Kulesza, Alex
Singh, Satinder
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 37, 2015, 37 : 179 - 188
[42] Asynchronous Methods for Model-Based Reinforcement Learning
Zhang, Yunzhi
Clavera, Ignasi
Tsai, Boren
Abbeel, Pieter
CONFERENCE ON ROBOT LEARNING, VOL 100, 2019, 100
[43] Online Constrained Model-based Reinforcement Learning
van Niekerk, Benjamin
Damianou, Andreas
Rosman, Benjamin
CONFERENCE ON UNCERTAINTY IN ARTIFICIAL INTELLIGENCE (UAI2017), 2017,
[44] Calibrated Model-Based Deep Reinforcement Learning
Malik, Ali
Kuleshov, Volodymyr
Song, Jiaming
Nemer, Danny
Seymour, Harlan
Ermon, Stefano
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97
[45] Skill-based Model-based Reinforcement Learning
Shi, Lucy Xiaoyang
Lim, Joseph J.
Lee, Youngwoon
CONFERENCE ON ROBOT LEARNING, VOL 205, 2022, 205 : 2262 - 2272
[46] Model-based reinforcement learning for nonlinear optimal control with practical asymptotic stability guarantees
Kim, Yeonsoo
Lee, Jong Min
AICHE JOURNAL, 2020, 66 (10)
[47] Cooperative co-learning: A model-based approach for solving multi agent reinforcement problems
Scherrer, B
Charpillet, F
14TH IEEE INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2002, : 463 - 468
[48] Safe model-based reinforcement learning for nonlinear optimal control with state and input constraints
Kim, Yeonsoo
Kim, Jong Woo
AICHE JOURNAL, 2022, 68 (05)
[49] Model gradient: unified model and policy learning in model-based reinforcement learning
Chengxing Jia
Fuxiang Zhang
Tian Xu
Jing-Cheng Pang
Zongzhang Zhang
Yang Yu
Frontiers of Computer Science, 2024, 18
[50] Model gradient: unified model and policy learning in model-based reinforcement learning
Jia, Chengxing
Zhang, Fuxiang
Xu, Tian
Pang, Jing-Cheng
Zhang, Zongzhang
Yu, Yang
FRONTIERS OF COMPUTER SCIENCE, 2024, 18 (04)

← 1 2 3 4 5 →