Finite-time bounds for fitted value iteration

被引:0
|
作者
Munos, Rémi [1 ]
Szepesvári, Csaba [2 ]
机构
[1] SequeL Project, INRIA Lille - Nord Europe, 40 avenue Halley, 59650 Villeneuve d'Ascq, France
[2] Department of Computing Science, University of Alberta, Edmonton T6G 2E8, Canada
关键词
D O I
暂无
中图分类号
学科分类号
摘要
引用
收藏
页码:815 / 857
相关论文
共 50 条
  • [1] Finite-time bounds for fitted value iteration
    Munos, Remi
    Szepesvari, Csaba
    JOURNAL OF MACHINE LEARNING RESEARCH, 2008, 9 : 815 - 857
  • [2] Continuous-Time Fitted Value Iteration for Robust Policies
    Lutter, Michael
    Belousov, Boris
    Mannor, Shie
    Fox, Dieter
    Garg, Animesh
    Peters, Jan
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (05) : 5534 - 5548
  • [3] Finite-time error bounds for Greedy-GQ
    Wang, Yue
    Zhou, Yi
    Zou, Shaofeng
    MACHINE LEARNING, 2024, 113 (09) : 5981 - 6018
  • [4] Finite-Time Logarithmic Bayes Regret Upper Bounds
    Atsidakou, Alexia
    Kveton, Branislav
    Katariya, Sumeet
    Caramanis, Constantine
    Sanghavi, Sujay
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [5] Bounds on fluctuations for finite-time quantum Otto cycle
    Saryal, Sushant
    Agarwalla, Bijay Kumar
    PHYSICAL REVIEW E, 2021, 103 (06)
  • [6] New performance bounds for a finite-time Carnot refrigerator
    Velasco, S
    Roco, JMM
    Medina, A
    Hernandez, AC
    PHYSICAL REVIEW LETTERS, 1997, 78 (17) : 3241 - 3244
  • [7] Finite-time Analysis of Approximate Policy Iteration for the Linear Quadratic Regulator
    Krauth, Karl
    Tu, Stephen
    Recht, Benjamin
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [8] Comment on "New performance bounds for a finite-time carnot refrigerator"
    Yan, ZJ
    Chen, JC
    PHYSICAL REVIEW LETTERS, 1998, 81 (24) : 5469 - 5469
  • [9] Finite-time error bounds for distributed linear stochastic approximation
    Lin, Yixuan
    Gupta, Vijay
    Liu, Ji
    AUTOMATICA, 2024, 159
  • [10] Finite-time bounds on the probabilistic violation of the second law of thermodynamics
    Miller, Harry J. D.
    Perarnau-Llobet, Marti
    SCIPOST PHYSICS, 2023, 14 (04):