Transaction-aware inverse reinforcement learning for trading in stock markets

被引:1
|
作者
Sun, Qizhou [1 ]
Gong, Xueyuan [2 ]
Si, Yain-Whar [1 ]
机构
[1] Univ Macau, Dept Comp & Informat Sci, Ave Univ, Macau, Peoples R China
[2] Jinan Univ, Sch Intelligent Syst Sci & Engn, Skinny Dog Rd, Guangzhou, Peoples R China
关键词
Finance; Transaction-aware; Inverse reinforcement learning; Algorithmic trading;
D O I
10.1007/s10489-023-04959-w
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Training automated trading agents is a long-standing topic that has been widely discussed in artificial intelligence for the quantitative finance. Reinforcement learning (RL) is designed to solve the sequential decision-making tasks, like the stock trading. The output of the RL is the policy which can be presented as the probability values of the possible actions based on a given state. The policy is optimized by the reward function. However, even if the profit is considered as the natural reward function, a trading agent equipped with an RL model has several serious problems. Specifically, profit is only obtained after executing sell action, different profits exist at the same time step due to the varying-length transactions and the hold action deals with two opposite states, empty or nonempty position. To alleviate these shortcomings, in this paper, we introduce a new trading action called wait for the empty position status and design the appropriate rewards to all actions. Based on the new action space and reward functions, a novel approach named Transaction-aware Inverse Reinforcement Learning (TAIRL) is proposed. TAIRL rewards all trading actions for avoiding the reward bias and dilemma. TAIRL is evaluated by backtesting on 12 stocks of US, UK and China stock markets, and compared against other state-of-art RL methods and moving average trading methods. The experimental results show that the agent of TAIRL achieves the state-of-art performance in profitability and anti-risk ability.
引用
收藏
页码:28186 / 28206
页数:21
相关论文
共 50 条
  • [1] Transaction-aware inverse reinforcement learning for trading in stock markets
    Qizhou Sun
    Xueyuan Gong
    Yain-Whar Si
    Applied Intelligence, 2023, 53 : 28186 - 28206
  • [2] Impact of combinatorial optimization on reinforcement learning for stock trading in financial markets
    Santos, Guilherme Dourado
    Lima, Karla R. P. S.
    PROCEEDINGS OF THE 20TH BRAZILIAN SYMPOSIUM ON INFORMATIONS SYSTEMS, SBSI 2024, 2024,
  • [3] Reinforcement Learning in Stock Trading
    Quang-Vinh Dang
    ADVANCED COMPUTATIONAL METHODS FOR KNOWLEDGE ENGINEERING (ICCSAMA 2019), 2020, 1121 : 311 - 322
  • [4] taTHP: A Transaction-Aware protocol for Web Services transaction coordination
    Xu, Wei
    Yang, Zongkai
    Liu, Wei
    Xu, Ling
    TENCON 2006 - 2006 IEEE REGION 10 CONFERENCE, VOLS 1-4, 2006, : 965 - +
  • [5] Reinforcement Learning for Stock Option Trading
    Garza, James
    2023 31ST IRISH CONFERENCE ON ARTIFICIAL INTELLIGENCE AND COGNITIVE SCIENCE, AICS, 2023,
  • [6] A transaction-aware coordination protocol for Web services composition
    Xu, Wei
    Cheng, Wenqing
    Liu, Wei
    WEB INFORMATION SYSTEMS - WISE 2006, PROCEEDINGS, 2006, 4255 : 126 - 131
  • [7] Transaction-aware SSD Cache Allocation for the Virtualization Environment
    Tang, Zhen
    Wu, Heng
    Sun, Lei
    Ren, Zhongshan
    Wang, Wei
    Zhou, Wei
    Yang, Liang
    12TH IEEE SYMPOSIUM ON SERVICE-ORIENTED SYSTEM ENGINEERING (SOSE 2018) / 9TH INTERNATIONAL WORKSHOP ON JOINT CLOUD COMPUTING (JCC 2018), 2018, : 174 - 179
  • [8] Goldilocks: A Race and Transaction-Aware Java']Java Runtime
    Elmas, Tayfun
    Qadeer, Shaz
    Tasiran, Serdar
    PLDI'07: PROCEEDINGS OF THE 2007 ACM SIGPLAN CONFERENCE ON PROGRAMMING LANGUAGE DESIGN AND IMPLEMENTATION, 2007, : 245 - 255
  • [9] Goldilocks: A race and transaction-aware Java']Java runtime
    Elmas, Tayfun
    Qadeer, Shaz
    Tasiran, Serdar
    ACM SIGPLAN NOTICES, 2007, 42 (06) : 245 - 255
  • [10] Transaction-aware network-on-chip resource reservation
    IME, Tsinghua University, China
    不详
    不详
    不详
    IEEE Comput. Archit. Lett., 2008, 2 (53-56):