Twin-Delayed Deep Deterministic Policy Gradient Algorithm for Portfolio Selection

被引:1
|
作者
Baard, Nicholas [1 ]
van Zyl, Terence L. [2 ]
机构
[1] Univ Witwatersrand, Comp Sci & Appl Math, Johannesburg, South Africa
[2] Univ Johannesburg, Inst Intelligent Syst, Johannesburg, South Africa
关键词
Reinforcement Learning; Portfolio Selection; TD3; DDPG;
D O I
10.1109/CIFEr52523.2022.9776067
中图分类号
F8 [财政、金融];
学科分类号
0202 ;
摘要
State-of-the-art RL algorithms have shown suboptimal performance in some market conditions with regard to the portfolio selection problem. The reason for suboptimal performance could be due to overestimation bias in actor-critic methods through the use of neural networks as the function approximator. The resulting bias leads to a suboptimal policy being learned by the agent, hindering performance. This research focuses on using the Twin-Delayed Deep Deterministic Policy Gradient (TD3) algorithm for portfolio selection to achieve greater results than previously achieved. In addition, an analysis of the overall effectiveness of the algorithm in various market conditions is needed to determine the TD3's robustness. This research establishes a RL environment for portfolio selection and trains the TD3 alongside three state-of-the-art algorithms in five different market conditions. The algorithms are tested by allowing the agent to manage a portfolio in each market for a specified period. The results are used for the analysis of the algorithms. The research shows improved results achieved by the TD3 algorithm for portfolio selection compared to other state-of-the-art algorithms. Furthermore, the performance of the TD3 across the five selected markets proves the robustness of the algorithm in its use for the portfolio selection problem.
引用
收藏
页数:8
相关论文
共 50 条
  • [21] Twin delayed deep deterministic policy gradient-based intelligent computation offloading for IoT
    Siguang Chen
    Bei Tang
    Kun Wang
    Digital Communications and Networks, 2023, 9 (04) : 836 - 845
  • [22] Twin delayed deep deterministic policy gradient-based intelligent computation offloading for IoT
    Chen, Siguang
    Tang, Bei
    Wang, Kun
    DIGITAL COMMUNICATIONS AND NETWORKS, 2023, 9 (04) : 836 - 845
  • [23] Study on indoor temperature optimal control of air-conditioning based on Twin Delayed Deep Deterministic policy gradient algorithm
    Li, Wei
    Wu, Hongji
    Zhao, Yifan
    Jiang, Changwei
    Zhang, Jili
    ENERGY AND BUILDINGS, 2024, 317
  • [24] Twin actor twin delayed deep deterministic policy gradient (TATD3) learning for batch process control
    Joshi, Tanuja
    Makker, Shikhar
    Kodamana, Hariprasad
    Kandath, Harikumar
    COMPUTERS & CHEMICAL ENGINEERING, 2021, 155
  • [25] Episodic Memory-Double Actor-Critic Twin Delayed Deep Deterministic Policy Gradient
    Shu, Man
    Lu, Shuai
    Gong, Xiaoyu
    An, Daolong
    Li, Songlin
    NEURAL NETWORKS, 2025, 187
  • [26] Path Planning Method for Manipulators Based on Improved Twin Delayed Deep Deterministic Policy Gradient and RRT*
    Cai, Ronggui
    Li, Xiao
    APPLIED SCIENCES-BASEL, 2024, 14 (07):
  • [27] Rank Selection Method of CP Decomposition Based on Deep Deterministic Policy Gradient Algorithm
    Zhang, Shaoshuang
    Li, Zhao
    Liu, Wenlong
    Zhao, Jiaqi
    Qin, Ting
    IEEE ACCESS, 2024, 12 : 97374 - 97385
  • [28] Deep deterministic policy gradient algorithm: A systematic review
    Sumiea, Ebrahim Hamid
    Abdulkadir, Said Jadid
    Alhussian, Hitham Seddig
    Al-Selwi, Safwan Mahmood
    Alqushaibi, Alawi
    Ragab, Mohammed Gamal
    Fati, Suliman Mohamed
    HELIYON, 2024, 10 (09)
  • [29] Deep deterministic policy gradient algorithm for UAV control
    Huang X.
    Liu J.
    Jia C.
    Wang Z.
    Zhang J.
    Hangkong Xuebao/Acta Aeronautica et Astronautica Sinica, 2021, 42 (11):
  • [30] The Control Method of Twin Delayed Deep Deterministic Policy Gradient with Rebirth Mechanism to Multi-DOF Manipulator
    Hou, Yangyang
    Hong, Huajie
    Sun, Zhaomei
    Xu, Dasheng
    Zeng, Zhe
    ELECTRONICS, 2021, 10 (07)