Improving stock trend prediction with pretrain multi-granularity denoising contrastive learning

被引:0
|
作者
Wang, Mingjie [2 ]
Wang, Siyuan [3 ]
Guo, Jianxiong [1 ,2 ]
Jia, Weijia [1 ,2 ]
机构
[1] Beijing Normal Univ, Adv Inst Nat Sci, Zhuhai 519087, Peoples R China
[2] BNU, HKBU United Int Coll, Dept Comp Sci, Guangdong Key Lab AI & Multimodal Data Proc, Zhuhai 519087, Peoples R China
[3] BNU, Fac Sci & Technol, Dept Math Sci, HKBU United Int Coll, Zhuhai 519087, Peoples R China
基金
中国国家自然科学基金;
关键词
Contrastive learning; Multi-granularity data; Memory; Denoising; Stock trend prediction; Pre-training; TIME-SERIES; NOISY DATA; CLASSIFICATION; INDEX; MODEL;
D O I
10.1007/s10115-023-02006-1
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Stock trend prediction (STP) aims to predict price fluctuation, which is critical in financial trading. The existing STP approaches only use market data with the same granularity (e.g., as daily market data). However, in the actual financial investment, there are a large number of more detailed investment signals contained in finer-grained data (e.g., high-frequency data). This motivates us to research how to leverage multi-granularity market data to capture more useful information and improve the accuracy in the task of STP. However, the effective utilization of multi-granularity data presents a major challenge. Firstly, the iteration of multi-granularity data with time will lead to more complex noise, which is difficult to extract signals. Secondly, the difference in granularity may lead to opposite target trends in the same time interval. Thirdly, the target trends of stocks with similar features can be quite different, and different sizes of granularity will aggravate this gap. In order to address these challenges, we present a self-supervised framework of multi-granularity denoising contrastive learning (MDC). Specifically, we construct a dynamic dictionary of memory, which can obtain clear and unified representations by filtering noise and aligning multi-granularity data. Moreover, we design two contrast learning modules during the fine-tuning stage to solve the differences in trends by constructing additional self-supervised signals. Besides, in the pre-training stage, we design the granularity domain adaptation module (GDA) to address the issues of temporal inconsistency and data imbalance associated with different granularity in financial data, alongside the memory self-distillation module (MSD) to tackle the challenge posed by a low signal-to-noise ratio. The GDA alleviates these complications by replacing a portion of the coarse-grained data with the preceding time step's fine-grained data, while the MSD seeks to filter out intrinsic noise by aligning the fine-grained representations with the coarse-grained representations' distribution using a self-distillation mechanism. Extensive experiments on the CSI 300 and CSI 100 datasets show that our framework stands out from the existing top-level systems and has excellent profitability in real investing scenarios.
引用
收藏
页码:2439 / 2466
页数:28
相关论文
共 50 条
  • [31] Semantic Segmentation in Thermal Videos: A New Benchmark and Multi-Granularity Contrastive Learning-Based Framework
    Zheng, Yu
    Zhou, Fugen
    Liang, Shangying
    Song, Wentao
    Bai, Xiangzhi
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2023, 24 (12) : 14783 - 14799
  • [32] Text-Guided Face Recognition using Multi-Granularity Cross-Modal Contrastive Learning
    Hasan, Md Mahedi
    Sami, Shoaib Meraj
    Nasrabadi, Nasser
    2024 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION, WACV 2024, 2024, : 5772 - 5781
  • [33] Improving PTM Site Prediction by Coupling of Multi-Granularity Structure and Multi-Scale Sequence Representation
    Li, Zhengyi
    Li, Menglu
    Zhu, Lida
    Zhang, Wen
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 1, 2024, : 188 - 196
  • [34] Multi-granularity Evolution Network for Dynamic Link Prediction
    Yang, Yi
    Gu, Xiaoyan
    Fan, Haihui
    Li, Bo
    Wang, Weiping
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2022, PT I, 2022, 13280 : 393 - 405
  • [35] Multi-granularity scenarios understanding network for trajectory prediction
    Yang, Biao
    Yang, Jicheng
    Ni, Rongrong
    Yang, Changchun
    Liu, Xiaofeng
    COMPLEX & INTELLIGENT SYSTEMS, 2023, 9 (01) : 851 - 864
  • [36] An explainable molecular property prediction via multi-granularity
    Sun, Haichao
    Wang, Guoyin
    Liu, Qun
    Yang, Jie
    Zheng, Mingyue
    INFORMATION SCIENCES, 2023, 642
  • [37] Multi-granularity scenarios understanding network for trajectory prediction
    Biao Yang
    Jicheng Yang
    Rongrong Ni
    Changchun Yang
    Xiaofeng Liu
    Complex & Intelligent Systems, 2023, 9 : 851 - 864
  • [38] Multi-Granularity Federated Learning by Graph-Partitioning
    Dai, Ziming
    Zhao, Yunfeng
    Qiu, Chao
    Wang, Xiaofei
    Yao, Haipeng
    Niyato, Dusit
    IEEE TRANSACTIONS ON CLOUD COMPUTING, 2025, 13 (01) : 18 - 33
  • [39] Information-guided signal multi-granularity contrastive feature learning for fault diagnosis with few labeled data
    Lin, Yanzhuo
    Wang, Yu
    Zhang, Mingquan
    Wang, Zenghui
    Zhang, Haijun
    Zhao, Ming
    ADVANCED ENGINEERING INFORMATICS, 2024, 61
  • [40] Multi-granularity hierarchical contrastive learning between foreground and background for semi-supervised video action detection
    Zhang, Qiming
    Hu, Zhengping
    Wang, Yulu
    Zhang, Hehao
    Di, Jirui
    KNOWLEDGE-BASED SYSTEMS, 2025, 309