Dynamic Programming or Direct Comparison?

被引:0
|
作者
Cao, Xi-Ren [1 ]
机构
[1] Hong Kong Univ Sci & Technol, Kowloon, Hong Kong, Peoples R China
关键词
MARKOV DECISION-PROCESSES; TRANSACTION COSTS; OPTIMIZATION; OPTIMALITY; PORTFOLIO;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The standard approach to stochastic control is dynamic programming. In our recent research, we proposed an alternative approach based on direct comparison of the performance of any two policies. This approach has a number of advantages: the results may be derived in a simple and intuitive way; the approach applies to different optimization problems, including finite and infinite horizon, discounting and average performance, discrete time discrete states and continuous time and continuous stats, etc., in the same way; and it may be generalized to some non-standard problems where dynamic programming fails. This approach also links stochastic control to perturbation analysis, reinforcement learning and other research subjects in optimization, which may stimulate new research directions.
引用
收藏
页码:59 / 76
页数:18
相关论文
共 50 条
  • [1] Direct neural dynamic programming
    Yang, L
    Enns, R
    Wang, YT
    Si, J
    STABILITY AND CONTROL OF DYNAMICAL SYSTEMS WITH APPLICATIONS: A TRIBUTE TO ANTHONY N. MICHEL, 2003, : 193 - 214
  • [2] OPTIMIZATION OF STAGE SYSTEMS . COMPARISON OF DYNAMIC PROGRAMMING PONTRYAGIN METHOD AND A DIRECT METHOD
    FERRARIS, GB
    QUADERNI DELL INGEGNERE CHIMICO ITALIANO, 1969, 5 (03): : 27 - &
  • [3] A boundedness result for the direct heuristic dynamic programming
    Liu, Feng
    Sun, Jian
    Si, Jennie
    Guo, Wentao
    Mei, Shengwei
    NEURAL NETWORKS, 2012, 32 : 229 - 235
  • [4] Direct Heuristic Dynamic Programming with Augmented States
    Sun, Jian
    Liu, Feng
    Si, Jennie
    Mei, Shengwei
    2011 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2011, : 3112 - 3119
  • [5] Text comparison based on dynamic programming
    Pertsemlidis, A
    Garner, HR
    IEEE ENGINEERING IN MEDICINE AND BIOLOGY MAGAZINE, 2004, 23 (06): : 66 - 71
  • [6] SEQUENCE COMPARISON BY DYNAMIC-PROGRAMMING
    DELCOIGNE, A
    HANSEN, P
    BIOMETRIKA, 1975, 62 (03) : 661 - 664
  • [7] Fuzzy dynamic programming for robust direct load control
    Natl Cheng Kung Univ, Tainan, Taiwan
    Proc Int Conf Energy Manage Power Delivery EMPD, (564-569):
  • [8] Fuzzy dynamic programming for robust direct load control
    Huang, KY
    Yang, HT
    Liao, CC
    Huang, CL
    PROCEEDINGS OF EMPD '98 - 1998 INTERNATIONAL CONFERENCE ON ENERGY MANAGEMENT AND POWER DELIVERY, VOLS 1 AND 2 AND SUPPLEMENT, 1998, : 564 - 569
  • [9] Adaptive Dynamic Programming for Direct Current Servo Motor
    Zhu, Liao
    Song, Ruizhuo
    Xie, Yulong
    Li, Junsong
    NEURAL INFORMATION PROCESSING, ICONIP 2017, PT I, 2017, 10634 : 731 - 740
  • [10] An Integrated Design for Intensified Direct Heuristic Dynamic Programming
    Luo, Xiong
    Si, Jennie
    Zhou, Yuchao
    PROCEEDINGS OF THE 2013 IEEE SYMPOSIUM ON ADAPTIVE DYNAMIC PROGRAMMING AND REINFORCEMENT LEARNING (ADPRL), 2013, : 183 - 190