A model fusion method based on multi-source heterogeneous data for stock trading signal prediction

被引:2
|
作者
Chen, Xi [1 ,2 ]
Hirota, Kaoru [1 ]
Dai, Yaping [1 ]
Jia, Zhiyang [1 ]
机构
[1] Beijing Inst Technol, Sch Automat, Beijing 100081, Peoples R China
[2] Fujian Normal Univ, Coll Phys & Energy, Fuzhou 350117, Peoples R China
关键词
Stock trading signal prediction; Model fusion; Multi-source heterogeneous data; Sentiment analysis; PIECEWISE-LINEAR REPRESENTATION; SUPPORT VECTOR MACHINE; DIRECTION;
D O I
10.1007/s00500-022-07714-4
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the prediction of turning points (TPs) of time series, the improved model of integrating piecewise linear representation and weighted support vector machine (IPLR-WSVM) has achieved good performance. However, due to the single data source and the limitation of algorithm, IPLR-WSVM has encountered challenges in profitability. In this paper, a model fusion method based on multi-source heterogeneous data and different learning algorithms is proposed for the prediction of TPs (MF-MSHD). Multi-source heterogeneous data include weighted unstructured and structured information with different granularities. RF, WSVM, BPNN, GBDT, and LSTM are selected to be the learning algorithms. The differences among meta-models are constructed by different inputs and algorithms as much as possible, and a model fusion rule is designed to determine the final TPs. Moreover, the TPs are generated based on the characteristics of individual stock. For sentiment analysis, a more accurate sentiment dictionary of stock market comments is established. Specifically, the fine-grained data is introduced to jointly determine the accurate trading moment. The prediction level of the proposal improves the accuracy and profitability, and also outperforms the composite indexes. Experimental results show that the profit rate of randomly selected stocks in MF-MSHD reaches 0.5172, while the highest value is 0.2841 in single meta-model and 0.0992 in buy and hold strategy, respectively. The other indicators including the accuracy are also modified. Compared with the increases of 0.1648, 0.4051, and 0.3397 in Shanghai Composite Index, Shenzhen Composite Index, and CSI 300 Index, MF-MSHD shows higher profitability in stock trading signal prediction.
引用
收藏
页码:6587 / 6611
页数:25
相关论文
共 50 条
  • [31] Multi-source heterogeneous data fusion prediction technique for the utility tunnel fire detection
    Sun, Bin
    Li, Yan
    Zhang, Yangyang
    Guo, Tong
    RELIABILITY ENGINEERING & SYSTEM SAFETY, 2024, 248
  • [32] Key Data Source Identification Method Based on Multi-Source Traffic Data Fusion
    Li, Shuo
    Zhang, Mengmeng
    Chen, Yongheng
    CICTP 2020: TRANSPORTATION EVOLUTION IMPACTING FUTURE MOBILITY, 2020, : 364 - 375
  • [33] Multi-Source Data Fusion and Target Tracking of Heterogeneous Network Based on Data Mining
    Quan, Xunzhong
    Chen, Jie
    TRAITEMENT DU SIGNAL, 2021, 38 (03) : 663 - 671
  • [34] A digital twin modeling method based on multi-source crack growth prediction data fusion
    Fang, Xin
    Liu, Guijie
    Wang, Honghui
    Tian, Xiaojie
    ENGINEERING FAILURE ANALYSIS, 2023, 154
  • [35] Prediction method of coal mine gas occurrence law based on multi-source data fusion
    Jiao, Huice
    Song, Weihua
    Cao, Peng
    Jiao, Dengming
    HELIYON, 2023, 9 (06)
  • [36] A multi-source heterogeneous data analytic method for future price fluctuation prediction
    Chai, Lei
    Xu, Hongfeng
    Luo, Zhiming
    Li, Shaozi
    NEUROCOMPUTING, 2020, 418 : 11 - 20
  • [37] Technology State Control Based on Multi-source Heterogeneous Data Fusion in Manufacturing
    Yu, Jie
    Gu, Shenggao
    Zhang, Wei
    INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE SYSTEMS, 2020, 13 (01) : 638 - 644
  • [38] Technology State Control Based on Multi-source Heterogeneous Data Fusion in Manufacturing
    Jie Yu
    Shenggao Gu
    Wei Zhang
    International Journal of Computational Intelligence Systems, 2020, 13 : 638 - 644
  • [39] A Hybrid Photovoltaic Power Prediction Model Based on Multi-source Data Fusion and Deep Learning
    Si, Zhiyuan
    Yang, Ming
    Yu, Yixiao
    Ding, Tingting
    Li, Menglin
    2020 IEEE STUDENT CONFERENCE ON ELECTRIC MACHINES AND SYSTEMS (SCEMS 2020), 2020, : 608 - 613
  • [40] A multi-source heterogeneous spatial big data fusion method based on multiple similarity and voting decision
    Zeqiu Chen
    Jianghui Zhou
    Ruizhi Sun
    Soft Computing, 2023, 27 : 2479 - 2492