A Unified Framework for Temporal Difference Methods

被引:0
|
作者
Bertsekas, Dimitri P. [1 ]
机构
[1] MIT, Informat & Decis Syst Lab, Cambridge, MA 02139 USA
关键词
FUNCTION APPROXIMATION; ALGORITHMS;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose a unified framework for a broad class of methods to solve projected equations that approximate the solution of a high-dimensional fixed point problem within a subspace S spanned by a small number of basis functions or features. These methods originated in approximate dynamic programming (DP), where they are collectively known as temporal difference (TD) methods. Our framework is based on a connection with projection methods for monotone variational inequalities, which involve alternative representations of the subspace S (feature scaling). Our methods admit simulation-based implementations, and even when specialized to DP problems, include extensions/new versions of the standard TD algorithms, which offer some special implementation advantages and reduced overhead.
引用
收藏
页码:1 / 7
页数:7
相关论文
共 50 条
  • [1] Identification methods in a unified framework
    Vajk, I
    AUTOMATICA, 2005, 41 (08) : 1385 - 1393
  • [2] Penalty and barrier methods: A unified framework
    Auslender, A
    SIAM JOURNAL ON OPTIMIZATION, 1999, 10 (01) : 211 - 230
  • [3] Temporal Map Labeling: A New Unified Framework with Experiments
    Barth, Lukas
    Niedermann, Benjamin
    Nollenburg, Martin
    Strash, Darren
    24TH ACM SIGSPATIAL INTERNATIONAL CONFERENCE ON ADVANCES IN GEOGRAPHIC INFORMATION SYSTEMS (ACM SIGSPATIAL GIS 2016), 2016,
  • [4] Temporal Link Prediction: A Unified Framework, Taxonomy, and Review
    Qin, Meng
    Yeung, Dit-Yan
    ACM COMPUTING SURVEYS, 2024, 56 (04)
  • [5] SMOOTHING AND FIRST ORDER METHODS: A UNIFIED FRAMEWORK
    Beck, Amir
    Teboulle, Marc
    SIAM JOURNAL ON OPTIMIZATION, 2012, 22 (02) : 557 - 580
  • [6] Unified framework for two robust beamforming methods
    Liu, H
    Liao, G
    Xie, Y
    Zhang, J
    ELECTRONICS LETTERS, 2006, 42 (07) : 425 - 426
  • [7] A Unified Framework for Capturing Concept Development Methods
    Borer, Nicholas K.
    Schwartz, Jana L.
    Odegard, Ryan G.
    2009 IEEE AEROSPACE CONFERENCE, VOLS 1-7, 2009, : 4234 - +
  • [8] A Unified Taylor Framework for Revisiting Attribution Methods
    Deng, Huiqi
    Zou, Na
    Du, Mengnan
    Chen, Weifu
    Feng, Guocan
    Hu, Xia
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 11462 - 11469
  • [9] A unified framework to generate optimized compact finite difference schemes
    Deshpande, Vedang M.
    Bhattacharya, Raktim
    Donzis, Diego A.
    JOURNAL OF COMPUTATIONAL PHYSICS, 2021, 432
  • [10] Temporal Difference Methods for General Projected Equations
    Bertsekas, Dimitri P.
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2011, 56 (09) : 2128 - 2139