Multi-source aggregated classification for stock price movement prediction

被引:52
|
作者
Ma, Yu [1 ]
Mao, Rui [2 ]
Lin, Qika [3 ]
Wu, Peng [4 ]
Cambria, Erik [2 ]
机构
[1] Nanjing Univ Sci & Technol, Sch Econ & Management, 200 Xiaolingwei Rd, Nanjing 210094, Jiangsu, Peoples R China
[2] Nanyang Technol Univ, Sch Comp Sci & Engn, 50 Nanyang Ave, Singapore 639798, Singapore
[3] Xi An Jiao Tong Univ, Sch Comp Sci & Technol, Xian 710049, Shaanxi, Peoples R China
[4] Nanjing Univ Sci & Technol, Sch Intelligent Mfg, 200 Xiaolingwei Rd, Nanjing 210094, Jiangsu, Peoples R China
基金
中国国家自然科学基金;
关键词
Stock prediction; Event-driven investing; Multi-source aggregating; Sentiment analysis; MARKET PREDICTION; NEURAL-NETWORK; PUBLIC MOOD; SPILLOVER; MEDIA; NEWS;
D O I
10.1016/j.inffus.2022.10.025
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Predicting stock price movements is a challenging task. Previous studies mostly used numerical features and news sentiments of target stocks to predict stock price movements. However, their semantics-based sentiment analysis is sub-optimal to represent real market sentiments. Moreover, only considering the information of target companies is insufficient because the stock prices of target companies can be affected by their related companies. Thus, we propose a novel Multi-source Aggregated Classification (MAC) method for stock price movement prediction. MAC incorporates the numerical features and market-driven news sentiments of target stocks, as well as the news sentiments of their related stocks. To better represent real market sentiments from the news, we pre-train an embedding feature generator by fitting the news to real stock price movements. Embeddings given by the pre-trained sentiment classifier can represent the sentiment information in vector space. Moreover, MAC introduces a graph convolutional network to capture the news effects of related companies on the target stock. Finally, MAC can predict stock price movements for the next trading day based on the aforementioned features. Extensive experiments prove that MAC outperforms state-of-the-art baselines in stock price movement prediction, Sharpe Ratio, and backtesting trading incomes.
引用
收藏
页码:515 / 528
页数:14
相关论文
共 50 条
  • [1] Incorporating Multi-Source Market Sentiment and Price Data for Stock Price Prediction
    Fu, Kui
    Zhang, Yanbin
    MATHEMATICS, 2024, 12 (10)
  • [2] Multi-source data driven cryptocurrency price movement prediction and portfolio optimization
    Zhou, Zhongbao
    Song, Zhengyang
    Xiao, Helu
    Ren, Tiantian
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 219
  • [3] House Price Prediction: A Multi-Source Data Fusion Perspective
    Zhao, Yaping
    Zhao, Jichang
    Lam, Edmund Y.
    BIG DATA MINING AND ANALYTICS, 2024, 7 (03): : 603 - 620
  • [4] A Multi-Source Information Learning Framework for Airbnb Price Prediction
    Jiang, Lu
    Li, Yuanhan
    Luo, Na
    Wang, Jianan
    Ning, Qiao
    2022 IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS, ICDMW, 2022, : 1 - 7
  • [5] Classification-based prediction models for stock price index movement
    Tufekci, Pinar
    INTELLIGENT DATA ANALYSIS, 2016, 20 (02) : 357 - 376
  • [6] A novel LASSO-ATT-LSTM model of stock price prediction based on multi-source heterogeneous data
    Li, Huiru
    Hu, Yanrong
    Liu, Hongjiu
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2023, 44 (06) : 10511 - 10521
  • [7] Stock price prediction for new energy vehicle companies based on multi-source data and hybrid attention structure
    Liu, Xueyong
    Wu, Yanhui
    Luo, Min
    Chen, Zhensong
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 255
  • [8] Stock Market Prediction via Multi-Source Multiple Instance Learning
    Zhang, Xi
    Qu, Siyu
    Huang, Jieyun
    Fang, Binxing
    Yu, Philip
    IEEE ACCESS, 2018, 6 : 50720 - 50728
  • [9] A multi-source heterogeneous data analytic method for future price fluctuation prediction
    Chai, Lei
    Xu, Hongfeng
    Luo, Zhiming
    Li, Shaozi
    NEUROCOMPUTING, 2020, 418 : 11 - 20
  • [10] Distributed classification in a multi-source environment
    Schuck, TM
    Hunter, JB
    FUSION 2003: PROCEEDINGS OF THE SIXTH INTERNATIONAL CONFERENCE OF INFORMATION FUSION, VOLS 1 AND 2, 2003, : 874 - 880