Optimal dynamic fixed-mix portfolios based on reinforcement learning with second order stochastic dominance

被引:2
|
作者
Consigli, Giorgio [1 ]
Gomez, Alvaro A. [1 ]
Zubelli, Jorge P. [1 ,2 ]
机构
[1] Khalifa Univ Sci & Technol, Dept Math, Abu Dhabi, U Arab Emirates
[2] ADIA LAB, Level 26,Al Khatem Tower, Abu Dhabi, U Arab Emirates
关键词
Fixed-mix portfolios; Stochastic dominance; Reinforcement learning; Actor-critic approach; Deep learning; Stochastic gradient; TRADING SYSTEM; RISK MEASURES; OPTIMIZATION;
D O I
10.1016/j.engappai.2024.108599
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We propose a reinforcement learning (RL) approach to address a multiperiod optimization problem in which a portfolio manager seeks an optimal constant proportion portfolio strategy by minimizing a tail risk measure consistent with second order stochastic dominance (SSD) principles. As a risk measure, we consider in particular the Interval Conditional Value -at -Risk (ICVaR) shown to be mathematically related to SSD principles. By including the ICVaR in the reward function of an RL method we show that an optimal fixed -mix policy can be derived as solution of short- to medium -term allocation problems through an accurate specification of the learning parameters under general statistical assumptions. The financial optimization problem, thus, carries several novel features and the article details the required steps to accommodate those features within a reinforcement learning architecture. The methodology is tested in- and out -of -sample on market data showing good performance relative to the SP500, adopted as benchmark policy.
引用
收藏
页数:16
相关论文
共 50 条
  • [21] Optimal control of a class of nonlinear dynamic systems based on reinforcement learning
    Chen, Xue-Song
    Liu, Fu-Chun
    Kongzhi yu Juece/Control and Decision, 2013, 28 (12): : 1889 - 1893
  • [22] A second-order dynamic and static ship path planning model based on reinforcement learning and heuristic search algorithms
    Junfeng Yuan
    Jian Wan
    Xin Zhang
    Yang Xu
    Yan Zeng
    Yongjian Ren
    EURASIP Journal on Wireless Communications and Networking, 2022
  • [23] A second-order dynamic and static ship path planning model based on reinforcement learning and heuristic search algorithms
    Yuan, Junfeng
    Wan, Jian
    Zhang, Xin
    Xu, Yang
    Zeng, Yan
    Ren, Yongjian
    EURASIP JOURNAL ON WIRELESS COMMUNICATIONS AND NETWORKING, 2022, 2022 (01)
  • [24] Second-Order Stochastic Dominance Constraints for Risk Management of a Wind Power Producer's Optimal Bidding Strategy
    AlAshery, Mohamed Kareem
    Xiao, Dongliang
    Qiao, Wei
    IEEE TRANSACTIONS ON SUSTAINABLE ENERGY, 2020, 11 (03) : 1404 - 1413
  • [25] Mean-variance optimal trading problem subject to stochastic dominance constraints with second order autoregressive price dynamics
    Singh, Arti
    Selvamuthu, Dharmaraja
    MATHEMATICAL METHODS OF OPERATIONS RESEARCH, 2017, 86 (01) : 29 - 69
  • [26] A SMOOTHING SAA ALGORITHM FOR A PORTFOLIO CHOICE MODEL BASED ON SECOND-ORDER STOCHASTIC DOMINANCE MEASURES
    Yang, Liu
    Tong, Xiaojiao
    Xiong, Yao
    Shen, Feifei
    JOURNAL OF INDUSTRIAL AND MANAGEMENT OPTIMIZATION, 2020, 16 (03) : 1171 - 1185
  • [27] A reinforcement learning-based scheme for adaptive optimal control of linear stochastic systems
    Wong, Wee Chin
    Lee, Jay H.
    2008 AMERICAN CONTROL CONFERENCE, VOLS 1-12, 2008, : 57 - 62
  • [28] Stochastic optimal generation command dispatch based on improved hierarchical reinforcement learning approach
    Yu, T.
    Wang, Y. M.
    Ye, W. J.
    Zhou, B.
    Chan, K. W.
    IET GENERATION TRANSMISSION & DISTRIBUTION, 2011, 5 (08) : 789 - 797
  • [29] Annealing based dynamic learning in second-order neural networks
    Milenkovic, S
    Obradovic, Z
    Litovski, V
    ICNN - 1996 IEEE INTERNATIONAL CONFERENCE ON NEURAL NETWORKS, VOLS. 1-4, 1996, : 458 - 463
  • [30] TREE-BASED REINFORCEMENT LEARNING FOR ESTIMATING OPTIMAL DYNAMIC TREATMENT REGIMES
    Tao, Yebin
    Wang, Lu
    Almirall, Daniel
    ANNALS OF APPLIED STATISTICS, 2018, 12 (03): : 1914 - 1938