A model fusion method based on multi-source heterogeneous data for stock trading signal prediction

被引:2
|
作者
Chen, Xi [1 ,2 ]
Hirota, Kaoru [1 ]
Dai, Yaping [1 ]
Jia, Zhiyang [1 ]
机构
[1] Beijing Inst Technol, Sch Automat, Beijing 100081, Peoples R China
[2] Fujian Normal Univ, Coll Phys & Energy, Fuzhou 350117, Peoples R China
关键词
Stock trading signal prediction; Model fusion; Multi-source heterogeneous data; Sentiment analysis; PIECEWISE-LINEAR REPRESENTATION; SUPPORT VECTOR MACHINE; DIRECTION;
D O I
10.1007/s00500-022-07714-4
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the prediction of turning points (TPs) of time series, the improved model of integrating piecewise linear representation and weighted support vector machine (IPLR-WSVM) has achieved good performance. However, due to the single data source and the limitation of algorithm, IPLR-WSVM has encountered challenges in profitability. In this paper, a model fusion method based on multi-source heterogeneous data and different learning algorithms is proposed for the prediction of TPs (MF-MSHD). Multi-source heterogeneous data include weighted unstructured and structured information with different granularities. RF, WSVM, BPNN, GBDT, and LSTM are selected to be the learning algorithms. The differences among meta-models are constructed by different inputs and algorithms as much as possible, and a model fusion rule is designed to determine the final TPs. Moreover, the TPs are generated based on the characteristics of individual stock. For sentiment analysis, a more accurate sentiment dictionary of stock market comments is established. Specifically, the fine-grained data is introduced to jointly determine the accurate trading moment. The prediction level of the proposal improves the accuracy and profitability, and also outperforms the composite indexes. Experimental results show that the profit rate of randomly selected stocks in MF-MSHD reaches 0.5172, while the highest value is 0.2841 in single meta-model and 0.0992 in buy and hold strategy, respectively. The other indicators including the accuracy are also modified. Compared with the increases of 0.1648, 0.4051, and 0.3397 in Shanghai Composite Index, Shenzhen Composite Index, and CSI 300 Index, MF-MSHD shows higher profitability in stock trading signal prediction.
引用
收藏
页码:6587 / 6611
页数:25
相关论文
共 50 条
  • [21] Prediction method of rockburst in deep buried tunnel based on multi-source data fusion
    Zhang P.
    Ren S.
    Wu F.
    Liu Y.
    Chen X.
    Dongnan Daxue Xuebao (Ziran Kexue Ban)/Journal of Southeast University (Natural Science Edition), 2024, 54 (03): : 707 - 716
  • [22] A practical prediction method for grinding accuracy based on multi-source data fusion in manufacturing
    Wu, Haipeng
    Li, Zhihang
    Tang, Qian
    Zhang, Penghui
    Xia, Dong
    Zhao, Lianchang
    INTERNATIONAL JOURNAL OF ADVANCED MANUFACTURING TECHNOLOGY, 2023, 127 (3-4): : 1407 - 1417
  • [23] Multi-source Heterogeneous Data Fusion Algorithm Based on Federated Learning
    Zhou, Jincheng
    Lei, Yang
    SOFT COMPUTING IN DATA SCIENCE, SCDS 2023, 2023, 1771 : 46 - 60
  • [24] A graph-based approach to multi-source heterogeneous information fusion in stock market
    Wang, Jun
    Li, Xiaohan
    Jia, Huading
    Peng, Tao
    PLOS ONE, 2022, 17 (08):
  • [25] Evaluation model of aluminum electrolysis cell condition based on multi-source heterogeneous data fusion
    Yubo Sun
    Weihua Gui
    Xiaofang Chen
    Yongfang Xie
    International Journal of Machine Learning and Cybernetics, 2024, 15 : 1375 - 1396
  • [26] A multi-source heterogeneous data fusion method for intelligent systems in the Internet of Things
    Sun, Rongrong
    Ren, Yuemei
    INTELLIGENT SYSTEMS WITH APPLICATIONS, 2024, 23
  • [27] Evaluation model of aluminum electrolysis cell condition based on multi-source heterogeneous data fusion
    Sun, Yubo
    Gui, Weihua
    Chen, Xiaofang
    Xie, Yongfang
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2024, 15 (04) : 1375 - 1396
  • [28] Multi-source Heterogeneous Data Fusion Method for Pipe Gallery Condition Monitoring
    Wang, Gang
    Liu, Jingwen
    Li, Guopeng
    Li, Zhilei
    Gong, Zhidan
    Huang, Wenlin
    Wang, Helan
    Cai, Guoyuan
    PROCEEDINGS OF 2020 IEEE 9TH DATA DRIVEN CONTROL AND LEARNING SYSTEMS CONFERENCE (DDCLS'20), 2020, : 1359 - 1363
  • [29] Multi-source Data Fusion Method Based on Difference Information
    Wang, Shu
    Ren, Yu
    Guan, Zhan-Xu
    Wang, Jing
    Dongbei Daxue Xuebao/Journal of Northeastern University, 2021, 42 (09): : 1246 - 1253
  • [30] Habitat Prediction of Northwest Pacific Saury Based on Multi-Source Heterogeneous Remote Sensing Data Fusion
    Han, Yanling
    Guo, Junyan
    Ma, Zhenling
    Wang, Jing
    Zhou, Ruyan
    Zhang, Yun
    Hong, Zhonghua
    Pan, Haiyan
    REMOTE SENSING, 2022, 14 (19)