Transaction-aware inverse reinforcement learning for trading in stock markets

被引：1

作者：

Sun, Qizhou ^{[1
]}

Gong, Xueyuan ^{[2
]}

Si, Yain-Whar ^{[1
]}

机构：

[1] Univ Macau, Dept Comp & Informat Sci, Ave Univ, Macau, Peoples R China

[2] Jinan Univ, Sch Intelligent Syst Sci & Engn, Skinny Dog Rd, Guangzhou, Peoples R China

来源：

APPLIED INTELLIGENCE | 2023年 / 53卷 / 23期

关键词：

Finance; Transaction-aware; Inverse reinforcement learning; Algorithmic trading;

D O I：

10.1007/s10489-023-04959-w

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Training automated trading agents is a long-standing topic that has been widely discussed in artificial intelligence for the quantitative finance. Reinforcement learning (RL) is designed to solve the sequential decision-making tasks, like the stock trading. The output of the RL is the policy which can be presented as the probability values of the possible actions based on a given state. The policy is optimized by the reward function. However, even if the profit is considered as the natural reward function, a trading agent equipped with an RL model has several serious problems. Specifically, profit is only obtained after executing sell action, different profits exist at the same time step due to the varying-length transactions and the hold action deals with two opposite states, empty or nonempty position. To alleviate these shortcomings, in this paper, we introduce a new trading action called wait for the empty position status and design the appropriate rewards to all actions. Based on the new action space and reward functions, a novel approach named Transaction-aware Inverse Reinforcement Learning (TAIRL) is proposed. TAIRL rewards all trading actions for avoiding the reward bias and dilemma. TAIRL is evaluated by backtesting on 12 stocks of US, UK and China stock markets, and compared against other state-of-art RL methods and moving average trading methods. The experimental results show that the agent of TAIRL achieves the state-of-art performance in profitability and anti-risk ability.

引用

页码：28186 / 28206

页数：21

共 50 条

[1] Transaction-aware inverse reinforcement learning for trading in stock markets
Qizhou Sun
Xueyuan Gong
Yain-Whar Si
Applied Intelligence, 2023, 53 : 28186 - 28206
[2] Impact of combinatorial optimization on reinforcement learning for stock trading in financial markets
Santos, Guilherme Dourado
Lima, Karla R. P. S.
PROCEEDINGS OF THE 20TH BRAZILIAN SYMPOSIUM ON INFORMATIONS SYSTEMS, SBSI 2024, 2024,
[3] Reinforcement Learning in Stock Trading
Quang-Vinh Dang
ADVANCED COMPUTATIONAL METHODS FOR KNOWLEDGE ENGINEERING (ICCSAMA 2019), 2020, 1121 : 311 - 322
[4] taTHP: A Transaction-Aware protocol for Web Services transaction coordination
Xu, Wei
Yang, Zongkai
Liu, Wei
Xu, Ling
TENCON 2006 - 2006 IEEE REGION 10 CONFERENCE, VOLS 1-4, 2006, : 965 - +
[5] Reinforcement Learning for Stock Option Trading
Garza, James
2023 31ST IRISH CONFERENCE ON ARTIFICIAL INTELLIGENCE AND COGNITIVE SCIENCE, AICS, 2023,
[6] A transaction-aware coordination protocol for Web services composition
Xu, Wei
Cheng, Wenqing
Liu, Wei
WEB INFORMATION SYSTEMS - WISE 2006, PROCEEDINGS, 2006, 4255 : 126 - 131
[7] Transaction-aware SSD Cache Allocation for the Virtualization Environment
Tang, Zhen
Wu, Heng
Sun, Lei
Ren, Zhongshan
Wang, Wei
Zhou, Wei
Yang, Liang
12TH IEEE SYMPOSIUM ON SERVICE-ORIENTED SYSTEM ENGINEERING (SOSE 2018) / 9TH INTERNATIONAL WORKSHOP ON JOINT CLOUD COMPUTING (JCC 2018), 2018, : 174 - 179
[8] Goldilocks: A Race and Transaction-Aware Java']Java Runtime
Elmas, Tayfun
Qadeer, Shaz
Tasiran, Serdar
PLDI'07: PROCEEDINGS OF THE 2007 ACM SIGPLAN CONFERENCE ON PROGRAMMING LANGUAGE DESIGN AND IMPLEMENTATION, 2007, : 245 - 255
[9] Goldilocks: A race and transaction-aware Java']Java runtime
Elmas, Tayfun
Qadeer, Shaz
Tasiran, Serdar
ACM SIGPLAN NOTICES, 2007, 42 (06) : 245 - 255
[10] Transaction-aware network-on-chip resource reservation
IME, Tsinghua University, China
不详
不详
不详
IEEE Comput. Archit. Lett., 2008, 2 (53-56):

← 1 2 3 4 5 →