Innovative Sentiment Analysis and Prediction of Stock Price Using FinBERT, GPT-4 and Logistic Regression: A Data-Driven Approach

被引:1
|
作者
Shobayo, Olamilekan [1 ]
Adeyemi-Longe, Sidikat [1 ]
Popoola, Olusogo [1 ]
Ogunleye, Bayode [2 ]
机构
[1] Sheffield Hallam Univ, Sch Comp & Digital Technol, Sheffield S1 2NU, England
[2] Univ Brighton, Dept Comp & Math, Brighton BN2 4GJ, England
关键词
FinBERT model; logistic regression; FinBERT; Optuna; time series cross-validation;
D O I
10.3390/bdcc8110143
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This study explores the comparative performance of cutting-edge AI models, i.e., Finaance Bidirectional Encoder representations from Transsformers (FinBERT), Generatice Pre-trained Transformer GPT-4, and Logistic Regression, for sentiment analysis and stock index prediction using financial news and the NGX All-Share Index data label. By leveraging advanced natural language processing models like GPT-4 and FinBERT, alongside a traditional machine learning model, Logistic Regression, we aim to classify market sentiment, generate sentiment scores, and predict market price movements. This research highlights global AI advancements in stock markets, showcasing how state-of-the-art language models can contribute to understanding complex financial data. The models were assessed using metrics such as accuracy, precision, recall, F1 score, and ROC AUC. Results indicate that Logistic Regression outperformed the more computationally intensive FinBERT and predefined approach of versatile GPT-4, with an accuracy of 81.83% and a ROC AUC of 89.76%. The GPT-4 predefined approach exhibited a lower accuracy of 54.19% but demonstrated strong potential in handling complex data. FinBERT, while offering more sophisticated analysis, was resource-demanding and yielded a moderate performance. Hyperparameter optimization using Optuna and cross-validation techniques ensured the robustness of the models. This study highlights the strengths and limitations of the practical applications of AI approaches in stock market prediction and presents Logistic Regression as the most efficient model for this task, with FinBERT and GPT-4 representing emerging tools with potential for future exploration and innovation in AI-driven financial analytics.
引用
收藏
页数:21
相关论文
共 50 条
  • [31] Using data-driven approach for wind power prediction: A comparative study
    Renani, Ehsan Taslimi
    Elias, Mohamad Fathi Mohamad
    Rahim, Nasrudin Abd.
    ENERGY CONVERSION AND MANAGEMENT, 2016, 118 : 193 - 203
  • [32] Degradation Estimation and Prediction of Electronic Packages Using Data-Driven Approach
    Prisacaru, Alexandru
    Gromala, Przemyslaw Jakub
    Han, Bongtae
    Zhang, Gui Qi
    IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2022, 69 (03) : 2996 - 3006
  • [33] Data-driven price trends prediction of Ethereum: A hybrid machine learning and signal processing approach
    Atta Mills, Ebenezer Fiifi Emire
    Liao, Yuexin
    Deng, Zihui
    Blockchain: Research and Applications, 2024, 5 (04):
  • [34] Housing Price Analysis Using Linear Regression and Logistic Regression: A Comprehensive Explanation Using Melbourne Real Estate Data
    He, Keren
    He, Cuiwei
    2021 IEEE INTERNATIONAL CONFERENCE ON COMPUTING (ICOCO), 2021, : 241 - 246
  • [35] Drag prediction of rough-wall turbulent flow using data-driven regression
    Shi, Zhaoyu
    Habibi Khorasani, Seyed Morteza
    Shin, Heesoo
    Yang, Jiasheng
    Lee, Sangseung
    Bagheri, Shervin
    FLOW, 2025, 5
  • [36] An optimal deep learning-based LSTM for stock price prediction using twitter sentiment analysis
    Swathi, T.
    Kasiviswanath, N.
    Rao, A. Ananda
    APPLIED INTELLIGENCE, 2022, 52 (12) : 13675 - 13688
  • [37] A Hybrid Framework Using PCA, EMD and LSTM Methods for Stock Market Price Prediction with Sentiment Analysis
    Srijiranon, Krittakom
    Lertratanakham, Yoskorn
    Tanantong, Tanatorn
    APPLIED SCIENCES-BASEL, 2022, 12 (21):
  • [38] A Novel Dynamic Data-Driven Algorithmic Trading Strategy Using Joint Forecasts of Volatility and Stock Price
    Liang, You
    Thavaneswaran, Aerambamoorthy
    Paseka, Alexander
    Zhu, Zimo
    Thulasiram, Ruppa K.
    2020 IEEE 44TH ANNUAL COMPUTERS, SOFTWARE, AND APPLICATIONS CONFERENCE (COMPSAC 2020), 2020, : 225 - 234
  • [39] A data-driven approach for flood prediction using grid-based meteorological data
    Wang, Yizhi
    Liu, Jia
    Li, Chuanzhe
    Liu, Yuchen
    Xu, Lin
    Yu, Fuliang
    HYDROLOGICAL PROCESSES, 2023, 37 (03)
  • [40] BERT-Driven stock price trend prediction utilizing tokenized stock data and multi-step optimization approach
    Teng, Xiaojian
    Zhang, Liang
    Gao, Peiwen
    Yu, Chuanwei
    Sun, Song
    APPLIED SOFT COMPUTING, 2025, 170