Twin-Delayed Deep Deterministic Policy Gradient Algorithm for Portfolio Selection

被引:1
|
作者
Baard, Nicholas [1 ]
van Zyl, Terence L. [2 ]
机构
[1] Univ Witwatersrand, Comp Sci & Appl Math, Johannesburg, South Africa
[2] Univ Johannesburg, Inst Intelligent Syst, Johannesburg, South Africa
关键词
Reinforcement Learning; Portfolio Selection; TD3; DDPG;
D O I
10.1109/CIFEr52523.2022.9776067
中图分类号
F8 [财政、金融];
学科分类号
0202 ;
摘要
State-of-the-art RL algorithms have shown suboptimal performance in some market conditions with regard to the portfolio selection problem. The reason for suboptimal performance could be due to overestimation bias in actor-critic methods through the use of neural networks as the function approximator. The resulting bias leads to a suboptimal policy being learned by the agent, hindering performance. This research focuses on using the Twin-Delayed Deep Deterministic Policy Gradient (TD3) algorithm for portfolio selection to achieve greater results than previously achieved. In addition, an analysis of the overall effectiveness of the algorithm in various market conditions is needed to determine the TD3's robustness. This research establishes a RL environment for portfolio selection and trains the TD3 alongside three state-of-the-art algorithms in five different market conditions. The algorithms are tested by allowing the agent to manage a portfolio in each market for a specified period. The results are used for the analysis of the algorithms. The research shows improved results achieved by the TD3 algorithm for portfolio selection compared to other state-of-the-art algorithms. Furthermore, the performance of the TD3 across the five selected markets proves the robustness of the algorithm in its use for the portfolio selection problem.
引用
收藏
页数:8
相关论文
共 50 条
  • [1] Twin-delayed deep deterministic policy gradient algorithm for the energy management of microgrids
    Dominguez-Barbero, David
    Garcia-Gonzalez, Javier
    Sanz-Bobi, Miguel A.
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 125
  • [2] Twin-Delayed Deep Deterministic Policy Gradient Algorithm to Control a Boost Converter in a DC Microgrid
    Muktiadji, Rifqi Firmansyah
    Ramli, Makbul A. M.
    Milyani, Ahmad H.
    ELECTRONICS, 2024, 13 (02)
  • [3] Fractional-Order Control Method Based on Twin-Delayed Deep Deterministic Policy Gradient Algorithm
    Jiao, Guangxin
    An, Zhengcai
    Shao, Shuyi
    Sun, Dong
    FRACTAL AND FRACTIONAL, 2024, 8 (02)
  • [4] Active Disturbance Rejection Control Based on Twin-Delayed Deep Deterministic Policy Gradient for an Exoskeleton
    Li, Zhong
    Guan, Xiaorong
    Liu, Chunyang
    Li, Dingzhe
    He, Long
    Cao, Yanfeng
    Long, Yi
    JOURNAL OF BIONIC ENGINEERING, 2025,
  • [5] Twin-Delayed Deep Deterministic Policy Gradient for Low-Frequency Oscillation Damping Control
    Cui, Qiushi
    Kim, Gyoungjae
    Weng, Yang
    ENERGIES, 2021, 14 (20)
  • [6] Optimal speed limit control for network mobility and safety: a twin-delayed deep deterministic policy gradient approach
    Afifah, Fatima
    Guo, Zhaomiao
    TRANSPORTMETRICA B-TRANSPORT DYNAMICS, 2025, 13 (01)
  • [7] Cooperative motion planning and control of a group of autonomous underwater vehicles using twin-delayed deep deterministic policy gradient
    Hadi, Behnaz
    Khosravi, Alireza
    Sarhadi, Pouria
    APPLIED OCEAN RESEARCH, 2024, 147
  • [8] Optimizing Control of Wastewater Treatment Plant With Reinforcement Learning: Technical Evaluation of Twin-Delayed Deep Deterministic Policy Gradient Agent
    Klawikowska, Zuzanna
    Grochowski, Michal
    IEEE ACCESS, 2024, 12 : 130318 - 130333
  • [9] Beyond diagonal-IRS assisted UAVs in terahertz network utilizing twin-delayed deep deterministic policy gradient approach
    Shereen S. Omar
    Ahmed M. Abd EL-Haleem
    Ibrahim I. Ibrahim
    Amany M. Saleh
    Telecommunication Systems, 2025, 88 (2)
  • [10] Twin Delayed Multi-Agent Deep Deterministic Policy Gradient
    Zhan, Mengying
    Chen, Jinchao
    Du, Chenglie
    Duan, Yuxin
    PROCEEDINGS OF THE 2021 IEEE INTERNATIONAL CONFERENCE ON PROGRESS IN INFORMATICS AND COMPUTING (PIC), 2021, : 48 - 52