Improving stock trend prediction with pretrain multi-granularity denoising contrastive learning

被引:0
|
作者
Wang, Mingjie [2 ]
Wang, Siyuan [3 ]
Guo, Jianxiong [1 ,2 ]
Jia, Weijia [1 ,2 ]
机构
[1] Beijing Normal Univ, Adv Inst Nat Sci, Zhuhai 519087, Peoples R China
[2] BNU, HKBU United Int Coll, Dept Comp Sci, Guangdong Key Lab AI & Multimodal Data Proc, Zhuhai 519087, Peoples R China
[3] BNU, Fac Sci & Technol, Dept Math Sci, HKBU United Int Coll, Zhuhai 519087, Peoples R China
基金
中国国家自然科学基金;
关键词
Contrastive learning; Multi-granularity data; Memory; Denoising; Stock trend prediction; Pre-training; TIME-SERIES; NOISY DATA; CLASSIFICATION; INDEX; MODEL;
D O I
10.1007/s10115-023-02006-1
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Stock trend prediction (STP) aims to predict price fluctuation, which is critical in financial trading. The existing STP approaches only use market data with the same granularity (e.g., as daily market data). However, in the actual financial investment, there are a large number of more detailed investment signals contained in finer-grained data (e.g., high-frequency data). This motivates us to research how to leverage multi-granularity market data to capture more useful information and improve the accuracy in the task of STP. However, the effective utilization of multi-granularity data presents a major challenge. Firstly, the iteration of multi-granularity data with time will lead to more complex noise, which is difficult to extract signals. Secondly, the difference in granularity may lead to opposite target trends in the same time interval. Thirdly, the target trends of stocks with similar features can be quite different, and different sizes of granularity will aggravate this gap. In order to address these challenges, we present a self-supervised framework of multi-granularity denoising contrastive learning (MDC). Specifically, we construct a dynamic dictionary of memory, which can obtain clear and unified representations by filtering noise and aligning multi-granularity data. Moreover, we design two contrast learning modules during the fine-tuning stage to solve the differences in trends by constructing additional self-supervised signals. Besides, in the pre-training stage, we design the granularity domain adaptation module (GDA) to address the issues of temporal inconsistency and data imbalance associated with different granularity in financial data, alongside the memory self-distillation module (MSD) to tackle the challenge posed by a low signal-to-noise ratio. The GDA alleviates these complications by replacing a portion of the coarse-grained data with the preceding time step's fine-grained data, while the MSD seeks to filter out intrinsic noise by aligning the fine-grained representations with the coarse-grained representations' distribution using a self-distillation mechanism. Extensive experiments on the CSI 300 and CSI 100 datasets show that our framework stands out from the existing top-level systems and has excellent profitability in real investing scenarios.
引用
收藏
页码:2439 / 2466
页数:28
相关论文
共 50 条
  • [21] Progressive Multi-granularity Analysis for Video Prediction
    Jingwei Xu
    Bingbing Ni
    Xiaokang Yang
    International Journal of Computer Vision, 2021, 129 : 601 - 618
  • [22] FMC: Multimodal fake news detection based on multi-granularity feature fusion and contrastive learning
    Yan, Facheng
    Zhang, Mingshu
    Wei, Bin
    Ren, Kelan
    Jiang, Wen
    ALEXANDRIA ENGINEERING JOURNAL, 2024, 109 : 376 - 393
  • [23] Multi-granularity Prediction for Scene Text Recognition
    Wang, Peng
    Da, Cheng
    Yao, Cong
    COMPUTER VISION - ECCV 2022, PT XXVIII, 2022, 13688 : 339 - 355
  • [24] Multi-granularity Inter-class Correlation Based Contrastive Learning for Open Set Recognition
    Zhu P.-F.
    Zhang W.-Y.
    Wang Y.
    Hu Q.-H.
    Ruan Jian Xue Bao/Journal of Software, 2022, 33 (04): : 1156 - 1169
  • [25] A multi-granularity combined prediction model based on fuzzy trend forecasting and particle swarm techniques
    Deng, Weihui
    Wang, Guoyin
    Zhang, Xuerui
    Xu, Ji
    Li, Guangdi
    NEUROCOMPUTING, 2016, 173 : 1671 - 1682
  • [26] MultiGran-SMILES: multi-granularity SMILES learning for molecular property prediction
    Jiang, Jing
    Zhang, Ruisheng
    Zhao, Zhili
    Ma, Jun
    Liu, Yunwu
    Yuan, Yongna
    Niu, Bojuan
    BIOINFORMATICS, 2022, 38 (19) : 4573 - 4580
  • [27] Text-enhanced Multi-Granularity Temporal Graph Learning for Event Prediction
    Han, Xiaoxue
    Ning, Yue
    2022 IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2022, : 171 - 180
  • [28] Multi-granularity physicochemical-inspired molecular representation learning for property prediction
    Guan, Kai
    Wang, Hong
    Zhuang, Luhe
    Sun, Yanshen
    EXPERT SYSTEMS WITH APPLICATIONS, 2025, 266
  • [29] Multi-granularity Complex Network Representation Learning
    Li, Peisen
    Wang, Guoyin
    Hu, Jun
    Li, Yun
    ROUGH SETS, IJCRS 2020, 2020, 12179 : 236 - 250
  • [30] Multi-Granularity Denoising and Bidirectional Alignment for Weakly Supervised Semantic Segmentation
    Chen, Tao
    Yao, Yazhou
    Tang, Jinhui
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 2960 - 2971