Improving stock trend prediction with pretrain multi-granularity denoising contrastive learning

被引：0

作者：

Wang, Mingjie ^{[2
]}

Wang, Siyuan ^{[3
]}

Guo, Jianxiong ^{[1
,2
]}

Jia, Weijia ^{[1
,2
]}

机构：

[1] Beijing Normal Univ, Adv Inst Nat Sci, Zhuhai 519087, Peoples R China

[2] BNU, HKBU United Int Coll, Dept Comp Sci, Guangdong Key Lab AI & Multimodal Data Proc, Zhuhai 519087, Peoples R China

[3] BNU, Fac Sci & Technol, Dept Math Sci, HKBU United Int Coll, Zhuhai 519087, Peoples R China

来源：

KNOWLEDGE AND INFORMATION SYSTEMS | 2024年 / 66卷 / 04期

基金：

中国国家自然科学基金;

关键词：

Contrastive learning; Multi-granularity data; Memory; Denoising; Stock trend prediction; Pre-training; TIME-SERIES; NOISY DATA; CLASSIFICATION; INDEX; MODEL;

D O I：

10.1007/s10115-023-02006-1

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Stock trend prediction (STP) aims to predict price fluctuation, which is critical in financial trading. The existing STP approaches only use market data with the same granularity (e.g., as daily market data). However, in the actual financial investment, there are a large number of more detailed investment signals contained in finer-grained data (e.g., high-frequency data). This motivates us to research how to leverage multi-granularity market data to capture more useful information and improve the accuracy in the task of STP. However, the effective utilization of multi-granularity data presents a major challenge. Firstly, the iteration of multi-granularity data with time will lead to more complex noise, which is difficult to extract signals. Secondly, the difference in granularity may lead to opposite target trends in the same time interval. Thirdly, the target trends of stocks with similar features can be quite different, and different sizes of granularity will aggravate this gap. In order to address these challenges, we present a self-supervised framework of multi-granularity denoising contrastive learning (MDC). Specifically, we construct a dynamic dictionary of memory, which can obtain clear and unified representations by filtering noise and aligning multi-granularity data. Moreover, we design two contrast learning modules during the fine-tuning stage to solve the differences in trends by constructing additional self-supervised signals. Besides, in the pre-training stage, we design the granularity domain adaptation module (GDA) to address the issues of temporal inconsistency and data imbalance associated with different granularity in financial data, alongside the memory self-distillation module (MSD) to tackle the challenge posed by a low signal-to-noise ratio. The GDA alleviates these complications by replacing a portion of the coarse-grained data with the preceding time step's fine-grained data, while the MSD seeks to filter out intrinsic noise by aligning the fine-grained representations with the coarse-grained representations' distribution using a self-distillation mechanism. Extensive experiments on the CSI 300 and CSI 100 datasets show that our framework stands out from the existing top-level systems and has excellent profitability in real investing scenarios.

引用

页码：2439 / 2466

页数：28

共 50 条

[31] Semantic Segmentation in Thermal Videos: A New Benchmark and Multi-Granularity Contrastive Learning-Based Framework
Zheng, Yu
Zhou, Fugen
Liang, Shangying
Song, Wentao
Bai, Xiangzhi
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2023, 24 (12) : 14783 - 14799
[32] Text-Guided Face Recognition using Multi-Granularity Cross-Modal Contrastive Learning
Hasan, Md Mahedi
Sami, Shoaib Meraj
Nasrabadi, Nasser
2024 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION, WACV 2024, 2024, : 5772 - 5781
[33] Improving PTM Site Prediction by Coupling of Multi-Granularity Structure and Multi-Scale Sequence Representation
Li, Zhengyi
Li, Menglu
Zhu, Lida
Zhang, Wen
THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 1, 2024, : 188 - 196
[34] Multi-granularity Evolution Network for Dynamic Link Prediction
Yang, Yi
Gu, Xiaoyan
Fan, Haihui
Li, Bo
Wang, Weiping
ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2022, PT I, 2022, 13280 : 393 - 405
[35] Multi-granularity scenarios understanding network for trajectory prediction
Yang, Biao
Yang, Jicheng
Ni, Rongrong
Yang, Changchun
Liu, Xiaofeng
COMPLEX & INTELLIGENT SYSTEMS, 2023, 9 (01) : 851 - 864
[36] An explainable molecular property prediction via multi-granularity
Sun, Haichao
Wang, Guoyin
Liu, Qun
Yang, Jie
Zheng, Mingyue
INFORMATION SCIENCES, 2023, 642
[37] Multi-granularity scenarios understanding network for trajectory prediction
Biao Yang
Jicheng Yang
Rongrong Ni
Changchun Yang
Xiaofeng Liu
Complex & Intelligent Systems, 2023, 9 : 851 - 864
[38] Multi-Granularity Federated Learning by Graph-Partitioning
Dai, Ziming
Zhao, Yunfeng
Qiu, Chao
Wang, Xiaofei
Yao, Haipeng
Niyato, Dusit
IEEE TRANSACTIONS ON CLOUD COMPUTING, 2025, 13 (01) : 18 - 33
[39] Information-guided signal multi-granularity contrastive feature learning for fault diagnosis with few labeled data
Lin, Yanzhuo
Wang, Yu
Zhang, Mingquan
Wang, Zenghui
Zhang, Haijun
Zhao, Ming
ADVANCED ENGINEERING INFORMATICS, 2024, 61
[40] Multi-granularity hierarchical contrastive learning between foreground and background for semi-supervised video action detection
Zhang, Qiming
Hu, Zhengping
Wang, Yulu
Zhang, Hehao
Di, Jirui
KNOWLEDGE-BASED SYSTEMS, 2025, 309

← 1 2 3 4 5 →