Analyzing Approximate Value Iteration Algorithms

被引:0
|
作者
Ramaswamy, Arunselvan [1 ]
Bhatnagar, Shalabh [2 ]
机构
[1] Department of Computer Science, Paderborn University, Paderborn,33098, Germany
[2] Department of Computer Science and Automation, the Robert Bosch Center for Cyber-Physical Systems, Indian Institute of Science, Bengaluru,560012, India
关键词
Approximate value iteration - Dynamical system viewpoint - Fixed point theory - Fixed-point theory for set-valued function - Lyapunov function–based stability - Lyapunov's functions - Set-valued - Set-valued functions - Set-valued stochastic approximation algorithm - Stochastic approximation algorithms - System viewpoints - Value iteration;
D O I
暂无
中图分类号
学科分类号
摘要
引用
收藏
页码:2138 / 2159
相关论文
共 50 条
  • [1] Analyzing Approximate Value Iteration Algorithms
    Ramaswamy, Arunselvan
    Bhatnagar, Shalabh
    MATHEMATICS OF OPERATIONS RESEARCH, 2021, : 2138 - 2159
  • [2] A PERTURBATION APPROACH TO A CLASS OF DISCOUNTED APPROXIMATE VALUE ITERATION ALGORITHMS WITH BOREL SPACES
    Vega-Amaya, Oscar
    Lopez-Borbon, Joaqun
    JOURNAL OF DYNAMICS AND GAMES, 2016, 3 (03): : 261 - 278
  • [3] Projections for Approximate Policy Iteration Algorithms
    Akrour, Riad
    Pajarinen, Joni
    Peters, Jan
    Neumann, Gerhard
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97
  • [4] Approximate value iteration with randomized policies
    de Farias, DP
    Van Roy, B
    PROCEEDINGS OF THE 39TH IEEE CONFERENCE ON DECISION AND CONTROL, VOLS 1-5, 2000, : 3421 - 3426
  • [5] Topological Value Iteration Algorithms
    Dai, Peng
    Mausam
    Weld, Daniel S.
    Goldsmith, Judy
    JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2011, 42 : 181 - 209
  • [6] Approximate Value Iteration with Temporally Extended Actions
    Mann, Timothy A.
    Mannor, Shie
    Precup, Doina
    JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2015, 53 : 375 - 438
  • [7] Limiting Extrapolation in Linear Approximate Value Iteration
    Zanette, Andrea
    Lazaric, Alessandro
    Kochenderfer, Mykel J.
    Brunskill, Emma
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [8] Approximate Value Iteration Based on Numerical Quadrature
    Vinogradska, Julia
    Bischoff, Bastian
    Peters, Jan
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2018, 3 (02): : 1330 - 1337
  • [9] Empirical Value Iteration for Approximate Dynamic Programming
    Haskell, William B.
    Jain, Rahul
    Kalathil, Dileep
    2014 AMERICAN CONTROL CONFERENCE (ACC), 2014, : 495 - 500
  • [10] Value-gradient iteration with quadratic approximate value functions
    Yang, Alan
    Boyd, Stephen
    ANNUAL REVIEWS IN CONTROL, 2023, 56