Analyzing Approximate Value Iteration Algorithms

被引：0

作者：

Ramaswamy, Arunselvan ^{[1
]}

Bhatnagar, Shalabh ^{[2
]}

机构：

[1] Department of Computer Science, Paderborn University, Paderborn,33098, Germany

[2] Department of Computer Science and Automation, the Robert Bosch Center for Cyber-Physical Systems, Indian Institute of Science, Bengaluru,560012, India

来源：

Mathematics of Operations Research | 2022年 / 47卷 / 03期

关键词：

Approximate value iteration - Dynamical system viewpoint - Fixed point theory - Fixed-point theory for set-valued function - Lyapunov function–based stability - Lyapunov's functions - Set-valued - Set-valued functions - Set-valued stochastic approximation algorithm - Stochastic approximation algorithms - System viewpoints - Value iteration;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

引用

页码：2138 / 2159

共 50 条

[1] Analyzing Approximate Value Iteration Algorithms
Ramaswamy, Arunselvan
Bhatnagar, Shalabh
MATHEMATICS OF OPERATIONS RESEARCH, 2021, : 2138 - 2159
[2] A PERTURBATION APPROACH TO A CLASS OF DISCOUNTED APPROXIMATE VALUE ITERATION ALGORITHMS WITH BOREL SPACES
Vega-Amaya, Oscar
Lopez-Borbon, Joaqun
JOURNAL OF DYNAMICS AND GAMES, 2016, 3 (03): : 261 - 278
[3] Projections for Approximate Policy Iteration Algorithms
Akrour, Riad
Pajarinen, Joni
Peters, Jan
Neumann, Gerhard
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97
[4] Approximate value iteration with randomized policies
de Farias, DP
Van Roy, B
PROCEEDINGS OF THE 39TH IEEE CONFERENCE ON DECISION AND CONTROL, VOLS 1-5, 2000, : 3421 - 3426
[5] Topological Value Iteration Algorithms
Dai, Peng
Mausam
Weld, Daniel S.
Goldsmith, Judy
JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2011, 42 : 181 - 209
[6] Approximate Value Iteration with Temporally Extended Actions
Mann, Timothy A.
Mannor, Shie
Precup, Doina
JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2015, 53 : 375 - 438
[7] Limiting Extrapolation in Linear Approximate Value Iteration
Zanette, Andrea
Lazaric, Alessandro
Kochenderfer, Mykel J.
Brunskill, Emma
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
[8] Approximate Value Iteration Based on Numerical Quadrature
Vinogradska, Julia
Bischoff, Bastian
Peters, Jan
IEEE ROBOTICS AND AUTOMATION LETTERS, 2018, 3 (02): : 1330 - 1337
[9] Empirical Value Iteration for Approximate Dynamic Programming
Haskell, William B.
Jain, Rahul
Kalathil, Dileep
2014 AMERICAN CONTROL CONFERENCE (ACC), 2014, : 495 - 500
[10] Value-gradient iteration with quadratic approximate value functions
Yang, Alan
Boyd, Stephen
ANNUAL REVIEWS IN CONTROL, 2023, 56

← 1 2 3 4 5 →