A model fusion method based on multi-source heterogeneous data for stock trading signal prediction

被引:2
|
作者
Chen, Xi [1 ,2 ]
Hirota, Kaoru [1 ]
Dai, Yaping [1 ]
Jia, Zhiyang [1 ]
机构
[1] Beijing Inst Technol, Sch Automat, Beijing 100081, Peoples R China
[2] Fujian Normal Univ, Coll Phys & Energy, Fuzhou 350117, Peoples R China
关键词
Stock trading signal prediction; Model fusion; Multi-source heterogeneous data; Sentiment analysis; PIECEWISE-LINEAR REPRESENTATION; SUPPORT VECTOR MACHINE; DIRECTION;
D O I
10.1007/s00500-022-07714-4
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the prediction of turning points (TPs) of time series, the improved model of integrating piecewise linear representation and weighted support vector machine (IPLR-WSVM) has achieved good performance. However, due to the single data source and the limitation of algorithm, IPLR-WSVM has encountered challenges in profitability. In this paper, a model fusion method based on multi-source heterogeneous data and different learning algorithms is proposed for the prediction of TPs (MF-MSHD). Multi-source heterogeneous data include weighted unstructured and structured information with different granularities. RF, WSVM, BPNN, GBDT, and LSTM are selected to be the learning algorithms. The differences among meta-models are constructed by different inputs and algorithms as much as possible, and a model fusion rule is designed to determine the final TPs. Moreover, the TPs are generated based on the characteristics of individual stock. For sentiment analysis, a more accurate sentiment dictionary of stock market comments is established. Specifically, the fine-grained data is introduced to jointly determine the accurate trading moment. The prediction level of the proposal improves the accuracy and profitability, and also outperforms the composite indexes. Experimental results show that the profit rate of randomly selected stocks in MF-MSHD reaches 0.5172, while the highest value is 0.2841 in single meta-model and 0.0992 in buy and hold strategy, respectively. The other indicators including the accuracy are also modified. Compared with the increases of 0.1648, 0.4051, and 0.3397 in Shanghai Composite Index, Shenzhen Composite Index, and CSI 300 Index, MF-MSHD shows higher profitability in stock trading signal prediction.
引用
收藏
页码:6587 / 6611
页数:25
相关论文
共 50 条
  • [41] Dynamic quantitative assessment method of chemical safety risk based on multi-source heterogeneous data fusion
    Liu Q.
    Qu Q.
    Zhao D.
    Liu S.
    Wang J.
    Huagong Xuebao/CIESC Journal, 2021, 72 (03): : 1769 - 1777
  • [42] A multi-source heterogeneous spatial big data fusion method based on multiple similarity and voting decision
    Chen, Zeqiu
    Zhou, Jianghui
    Sun, Ruizhi
    SOFT COMPUTING, 2023, 27 (05) : 2479 - 2492
  • [43] Multi-source fusion-based security detection method for heterogeneous networks
    Wu, Hao
    Wang, Zhonghua
    COMPUTERS & SECURITY, 2018, 74 : 55 - 70
  • [44] Bearing fault diagnosis method based on multi-source heterogeneous information fusion
    Zhang, Ke
    Gao, Tianhao
    Shi, Huaitao
    MEASUREMENT SCIENCE AND TECHNOLOGY, 2022, 33 (07)
  • [45] Multi-source Heterogeneous Data Fusion for Toxin Level Quantification
    Strelet, Eugeniu
    Wang, Zhenyu
    Peng, You
    Castillo, Ivan
    Rendall, Ricardo
    Braun, Bea
    Joswiak, Mark
    Chiang, Leo
    Reis, Marco S.
    IFAC PAPERSONLINE, 2021, 54 (03): : 67 - 72
  • [46] Configurable acquisition method of multi-source heterogeneous data based on FPGA
    Li, Zhanpeng
    Zou, Xiaofu
    Su, Yonghe
    Zhang, Changzhi
    Tao, Fei
    Jisuanji Jicheng Zhizao Xitong/Computer Integrated Manufacturing Systems, CIMS, 2021, 27 (04): : 1008 - 1020
  • [47] Analysis of Substation Joint Safety Control System and Model Based on Multi-Source Heterogeneous Data Fusion
    Wu, Bo
    Hu, Yifan
    IEEE ACCESS, 2023, 11 : 35281 - 35297
  • [48] A multi-source data fusion model for traffic flow prediction in smart cities
    Li, Xiaoqin
    Nie, Dalu
    Applied Mathematics and Nonlinear Sciences, 2024, 9 (01)
  • [49] Tourism Information Data Processing Method Based on Multi-Source Data Fusion
    Li, YaoGuang
    Gan, HeChi
    JOURNAL OF SENSORS, 2021, 2021
  • [50] Prediction of Cancellation Probability of Online Car-Hailing Orders Based on Multi-source Heterogeneous Data Fusion
    Sun, Haokai
    Lv, Zhiqiang
    Li, Jianbo
    Xu, Zhihao
    Sheng, Zhaoyu
    Ma, Zhaobin
    WIRELESS ALGORITHMS, SYSTEMS, AND APPLICATIONS (WASA 2022), PT II, 2022, 13472 : 168 - 180