Twin-Delayed Deep Deterministic Policy Gradient Algorithm for Portfolio Selection

被引：1

作者：

Baard, Nicholas ^{[1
]}

van Zyl, Terence L. ^{[2
]}

机构：

[1] Univ Witwatersrand, Comp Sci & Appl Math, Johannesburg, South Africa

[2] Univ Johannesburg, Inst Intelligent Syst, Johannesburg, South Africa

来源：

2022 IEEE SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE FOR FINANCIAL ENGINEERING AND ECONOMICS (CIFER) | 2022年

关键词：

Reinforcement Learning; Portfolio Selection; TD3; DDPG;

D O I：

10.1109/CIFEr52523.2022.9776067

中图分类号：

F8 [财政、金融];

学科分类号：

0202 ;

摘要：

State-of-the-art RL algorithms have shown suboptimal performance in some market conditions with regard to the portfolio selection problem. The reason for suboptimal performance could be due to overestimation bias in actor-critic methods through the use of neural networks as the function approximator. The resulting bias leads to a suboptimal policy being learned by the agent, hindering performance. This research focuses on using the Twin-Delayed Deep Deterministic Policy Gradient (TD3) algorithm for portfolio selection to achieve greater results than previously achieved. In addition, an analysis of the overall effectiveness of the algorithm in various market conditions is needed to determine the TD3's robustness. This research establishes a RL environment for portfolio selection and trains the TD3 alongside three state-of-the-art algorithms in five different market conditions. The algorithms are tested by allowing the agent to manage a portfolio in each market for a specified period. The results are used for the analysis of the algorithms. The research shows improved results achieved by the TD3 algorithm for portfolio selection compared to other state-of-the-art algorithms. Furthermore, the performance of the TD3 across the five selected markets proves the robustness of the algorithm in its use for the portfolio selection problem.

引用

页数：8

共 50 条

[21] Twin delayed deep deterministic policy gradient-based intelligent computation offloading for IoT
Siguang Chen
Bei Tang
Kun Wang
Digital Communications and Networks, 2023, 9 (04) : 836 - 845
[22] Twin delayed deep deterministic policy gradient-based intelligent computation offloading for IoT
Chen, Siguang
Tang, Bei
Wang, Kun
DIGITAL COMMUNICATIONS AND NETWORKS, 2023, 9 (04) : 836 - 845
[23] Study on indoor temperature optimal control of air-conditioning based on Twin Delayed Deep Deterministic policy gradient algorithm
Li, Wei
Wu, Hongji
Zhao, Yifan
Jiang, Changwei
Zhang, Jili
ENERGY AND BUILDINGS, 2024, 317
[24] Twin actor twin delayed deep deterministic policy gradient (TATD3) learning for batch process control
Joshi, Tanuja
Makker, Shikhar
Kodamana, Hariprasad
Kandath, Harikumar
COMPUTERS & CHEMICAL ENGINEERING, 2021, 155
[25] Episodic Memory-Double Actor-Critic Twin Delayed Deep Deterministic Policy Gradient
Shu, Man
Lu, Shuai
Gong, Xiaoyu
An, Daolong
Li, Songlin
NEURAL NETWORKS, 2025, 187
[26] Path Planning Method for Manipulators Based on Improved Twin Delayed Deep Deterministic Policy Gradient and RRT*
Cai, Ronggui
Li, Xiao
APPLIED SCIENCES-BASEL, 2024, 14 (07):
[27] Rank Selection Method of CP Decomposition Based on Deep Deterministic Policy Gradient Algorithm
Zhang, Shaoshuang
Li, Zhao
Liu, Wenlong
Zhao, Jiaqi
Qin, Ting
IEEE ACCESS, 2024, 12 : 97374 - 97385
[28] Deep deterministic policy gradient algorithm: A systematic review
Sumiea, Ebrahim Hamid
Abdulkadir, Said Jadid
Alhussian, Hitham Seddig
Al-Selwi, Safwan Mahmood
Alqushaibi, Alawi
Ragab, Mohammed Gamal
Fati, Suliman Mohamed
HELIYON, 2024, 10 (09)
[29] Deep deterministic policy gradient algorithm for UAV control
Huang X.
Liu J.
Jia C.
Wang Z.
Zhang J.
Hangkong Xuebao/Acta Aeronautica et Astronautica Sinica, 2021, 42 (11):
[30] The Control Method of Twin Delayed Deep Deterministic Policy Gradient with Rebirth Mechanism to Multi-DOF Manipulator
Hou, Yangyang
Hong, Huajie
Sun, Zhaomei
Xu, Dasheng
Zeng, Zhe
ELECTRONICS, 2021, 10 (07)

← 1 2 3 4 5 →