Cooperative Model-Based Reinforcement Learning for Approximate Optimal Tracking

被引：0

作者：

Greene, Max L. ^{[1
]}

Bell, Zachary, I ^{[2
]}

Nivison, Scott A. ^{[2
]}

How, Jonathan P. ^{[3
]}

Dixon, Warren E. ^{[1
]}

机构：

[1] Univ Florida, Dept Mech & Aerosp Engn, Gainesville, FL 32611 USA

[2] Air Force Res Lab, Munit Directorate, Eglin AFB, FL USA

[3] MIT, Dept Aeronaut & Astronaut, Cambridge, MA 02139 USA

来源：

2021 AMERICAN CONTROL CONFERENCE (ACC) | 2021年

关键词：

SYSTEMS;

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This paper provides an approximate online adaptive solution to the infinite-horizon optimal tracking problem for a set of agents with homogeneous dynamics and common tracking objectives. Model-based reinforcement learning is implemented by simultaneously evaluating the Bellman error (BE) at the state of each agent and on nearby off-trajectory points, as needed, throughout the state space. Each agent will calculate and share their respective on and off-trajectory BE information with a centralized estimator, which computes updates for the approximate solution to the infinite-horizon optimal tracking problem and shares the estimate with the agents. In doing so, the computational burden associated with BE extrapolation is shared between the agents and a centralized updating resource. Edge computing is leveraged to share the computational load between the agents and a centralized resource. Uniformly ultimately bounded tracking of each agent's state to the desired state and convergence of the control policy to the neighborhood of the optimal policy is proven via a Lyapunov-like stability analysis.

引用

页码：1973 / 1978

页数：6

共 50 条

[21] Learning to Paint With Model-based Deep Reinforcement Learning
Huang, Zhewei
Heng, Wen
Zhou, Shuchang
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 8708 - 8717
[22] Incremental model-based reinforcement learning with model constraint
Yang, Zhiyou
Fu, Mingsheng
Qu, Hong
Li, Fan
Shi, Shuqing
Hu, Wang
NEURAL NETWORKS, 2025, 185
[23] Objective Mismatch in Model-based Reinforcement Learning
Lambert, Nathan
Amos, Brandon
Yadan, Omry
Calandra, Roberto
LEARNING FOR DYNAMICS AND CONTROL, VOL 120, 2020, 120 : 761 - 770
[24] Model-based reinforcement learning with dimension reduction
Tangkaratt, Voot
Morimoto, Jun
Sugiyama, Masashi
NEURAL NETWORKS, 2016, 84 : 1 - 16
[25] On Effective Scheduling of Model-based Reinforcement Learning
Lai, Hang
Shen, Jian
Zhang, Weinan
Huang, Yimin
Zhang, Xing
Tang, Ruiming
Yu, Yong
Li, Zhenguo
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
[26] Transferring Instances for Model-Based Reinforcement Learning
Taylor, Matthew E.
Jong, Nicholas K.
Stone, Peter
MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, PART II, PROCEEDINGS, 2008, 5212 : 488 - 505
[27] MOReL: Model-Based Offline Reinforcement Learning
Kidambi, Rahul
Rajeswaran, Aravind
Netrapalli, Praneeth
Joachims, Thorsten
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
[28] Modeling Survival in model-based Reinforcement Learning
Moazami, Saeed
Doerschuk, Peggy
2020 SECOND INTERNATIONAL CONFERENCE ON TRANSDISCIPLINARY AI (TRANSAI 2020), 2020, : 17 - 24
[29] Model-Based Reinforcement Learning With Isolated Imaginations
Pan, Minting
Zhu, Xiangming
Zheng, Yitao
Wang, Yunbo
Yang, Xiaokang
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (05) : 2788 - 2803
[30] Model-based average reward reinforcement learning
Tadepalli, P
Ok, D
ARTIFICIAL INTELLIGENCE, 1998, 100 (1-2) : 177 - 224

← 1 2 3 4 5 →