Prediction of stock price movement using an improved NSGA-II-RF algorithm with a three-stage feature engineering process

被引:1
|
作者
Zeng, Xiaohua [1 ]
Cai, Jieping [1 ]
Liang, Changzhou [1 ]
Yuan, Chiping [1 ,2 ]
机构
[1] Guangzhou Xinhua Univ, Sch Econ & Trade, Dongguan, Peoples R China
[2] Sun Yat Sen Univ, Lingnan Coll, Guangzhou, Peoples R China
来源
PLOS ONE | 2023年 / 18卷 / 06期
关键词
PARTICLE SWARM OPTIMIZATION; MICRO PLANAR COMBUSTOR; FEATURE-SELECTION; NEURAL-NETWORK; TUBE OUTLET; INFORMATION; PERFORMANCE; RELEVANCE; DIRECTION; MODEL;
D O I
10.1371/journal.pone.0287754
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Prediction of stock price has been a hot topic in artificial intelligence field. Computational intelligent methods such as machine learning or deep learning are explored in the prediction system in recent years. However, making accurate predictions of stock price direction is still a big challenge because stock prices are affected by nonlinear, nonstationary, and high dimensional features. In previous works, feature engineering was overlooked. How to select the optimal feature sets that affect stock price is a prominent solution. Hence, our motivation for this article is to propose an improved many-objective optimization algorithm integrating random forest (I-NSGA-II-RF) algorithm with a three-stage feature engineering process in order to decrease the computational complexity and improve the accuracy of prediction system. Maximizing accuracy and minimizing the optimal solution set are the optimization directions of the model in this study. The integrated information initialization population of two filtered feature selection methods is used to optimize the I-NSGA-II algorithm, using multiple chromosome hybrid coding to synchronously select features and optimize model parameters. Finally, the selected feature subset and parameters are input to the RF for training, prediction, and iterative optimization. Experimental results show that the I-NSGA-II-RF algorithm has the highest average accuracy, the smallest optimal solution set, and the shortest running time compared to the unmodified multi-objective feature selection algorithm and the single target feature selection algorithm. Compared to the deep learning model, this model has interpretability, higher accuracy, and less running time.
引用
收藏
页数:30
相关论文
共 17 条
  • [1] Prediction of stock price direction using a hybrid GA-XGBoost algorithm with a three-stage feature engineering process
    Yun, Kyung Keun
    Yoon, Sang Won
    Won, Daehan
    EXPERT SYSTEMS WITH APPLICATIONS, 2021, 186
  • [2] Deep learning-based feature engineering for stock price movement prediction
    Long, Wen
    Lu, Zhichen
    Cui, Lingxiao
    KNOWLEDGE-BASED SYSTEMS, 2019, 164 : 163 - 173
  • [3] Hybrid model using three-stage algorithm for simultaneous load and price forecasting
    Nazar, Mehrdad Setayesh
    Fard, Ashkan Eslami
    Heidari, Alireza
    Shafie-khah, Miadreza
    Catalao, Joao P. S.
    ELECTRIC POWER SYSTEMS RESEARCH, 2018, 165 : 214 - 228
  • [4] Air quality index prediction based on three-stage feature engineering, model matching, and optimized ensemble
    Yucheng Yin
    Hui Liu
    Air Quality, Atmosphere & Health, 2023, 16 : 1871 - 1890
  • [5] Air quality index prediction based on three-stage feature engineering, model matching, and optimized ensemble
    Yin, Yucheng
    Liu, Hui
    AIR QUALITY ATMOSPHERE AND HEALTH, 2023, 16 (09): : 1871 - 1890
  • [6] COVID-19 vaccine prediction based on an interpretable CNN-LSTM model with three-stage feature engineering
    Altarawneh, Lubna
    Wang, Hao
    Jin, Yu
    HEALTH AND TECHNOLOGY, 2024, 14 (06) : 1241 - 1261
  • [7] A new algorithm using front prediction and NSGA-II for solving two and three-objective optimization problems
    Salim Fettaka
    Jules Thibault
    Yash Gupta
    Optimization and Engineering, 2015, 16 : 713 - 736
  • [8] A new algorithm using front prediction and NSGA-II for solving two and three-objective optimization problems
    Fettaka, Salim
    Thibault, Jules
    Gupta, Yash
    OPTIMIZATION AND ENGINEERING, 2015, 16 (04) : 713 - 736
  • [9] Multi-objective optimization method using an improved NSGA-II algorithm for oil-gas production process
    Liu, Tan
    Gao, Xianwen
    Wang, Lina
    JOURNAL OF THE TAIWAN INSTITUTE OF CHEMICAL ENGINEERS, 2015, 57 : 42 - 53
  • [10] An Improved Three-Stage Inversion Algorithm in Forest Height Estimation Using Single-Baseline Polarimetric SAR Interferometry Data
    Managhebi, Tayebe
    Maghsoudi, Yasser
    Zoej, Mohammad Javad Valadan
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2018, 15 (06) : 887 - 891