Innovative Sentiment Analysis and Prediction of Stock Price Using FinBERT, GPT-4 and Logistic Regression: A Data-Driven Approach

被引:1
|
作者
Shobayo, Olamilekan [1 ]
Adeyemi-Longe, Sidikat [1 ]
Popoola, Olusogo [1 ]
Ogunleye, Bayode [2 ]
机构
[1] Sheffield Hallam Univ, Sch Comp & Digital Technol, Sheffield S1 2NU, England
[2] Univ Brighton, Dept Comp & Math, Brighton BN2 4GJ, England
关键词
FinBERT model; logistic regression; FinBERT; Optuna; time series cross-validation;
D O I
10.3390/bdcc8110143
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This study explores the comparative performance of cutting-edge AI models, i.e., Finaance Bidirectional Encoder representations from Transsformers (FinBERT), Generatice Pre-trained Transformer GPT-4, and Logistic Regression, for sentiment analysis and stock index prediction using financial news and the NGX All-Share Index data label. By leveraging advanced natural language processing models like GPT-4 and FinBERT, alongside a traditional machine learning model, Logistic Regression, we aim to classify market sentiment, generate sentiment scores, and predict market price movements. This research highlights global AI advancements in stock markets, showcasing how state-of-the-art language models can contribute to understanding complex financial data. The models were assessed using metrics such as accuracy, precision, recall, F1 score, and ROC AUC. Results indicate that Logistic Regression outperformed the more computationally intensive FinBERT and predefined approach of versatile GPT-4, with an accuracy of 81.83% and a ROC AUC of 89.76%. The GPT-4 predefined approach exhibited a lower accuracy of 54.19% but demonstrated strong potential in handling complex data. FinBERT, while offering more sophisticated analysis, was resource-demanding and yielded a moderate performance. Hyperparameter optimization using Optuna and cross-validation techniques ensured the robustness of the models. This study highlights the strengths and limitations of the practical applications of AI approaches in stock market prediction and presents Logistic Regression as the most efficient model for this task, with FinBERT and GPT-4 representing emerging tools with potential for future exploration and innovation in AI-driven financial analytics.
引用
收藏
页数:21
相关论文
共 50 条
  • [21] Energy price prediction using data-driven models: A decade review Preface
    Diaz, Josep
    Nesetril, Jarik
    COMPUTER SCIENCE REVIEW, 2021, 39
  • [22] Analysis and prediction in SCR experiments using GPT-4 with an effective chain-of-thought prompting strategy
    Lu, Muyu
    Gao, Fengyu
    Tang, Xiaolong
    Chen, Linjiang
    ISCIENCE, 2024, 27 (04)
  • [23] Sentiment Analysis of Unstructured Data Using Spark for Predicting Stock Market Price Movement
    Darji, Miss Dhara N.
    Parikh, Satyen M.
    Patel, Hiral R.
    INVENTIVE COMPUTATION AND INFORMATION TECHNOLOGIES, ICICIT 2021, 2022, 336 : 521 - 530
  • [24] Identifying new innovative services using M&A data: An integrated approach of data-driven morphological analysis
    Ha, Sohee
    Geum, Youngjung
    TECHNOLOGICAL FORECASTING AND SOCIAL CHANGE, 2022, 174
  • [25] A Deep Learning-Based LSTM for Stock Price Prediction Using Twitter Sentiment Analysis
    Ouf, Shimaa
    Hawary, Mona El
    Aboutabl, Amal
    Adel, Sherif
    International Journal of Advanced Computer Science and Applications, 2024, 15 (12) : 207 - 218
  • [26] Survey on Sentiment Analysis based Stock Prediction using Big data Analytics
    Balaji, S. Naveen
    Paul, P. Victer
    Saravanan, R.
    2017 INNOVATIONS IN POWER AND ADVANCED COMPUTING TECHNOLOGIES (I-PACT), 2017,
  • [27] B3 Stock Price Prediction Using LSTM Neural Networks and Sentiment Analysis
    Vargas, Gabriel M.
    Silvestre, Leonardo J.
    Rigo Jr, Luis O.
    Rocha, Helder R. O.
    IEEE LATIN AMERICA TRANSACTIONS, 2021, 20 (07) : 1067 - 1074
  • [28] Stock Price Prediction Using a Multivariate Multistep LSTM: A Sentiment and Public Engagement Analysis Model
    Aasi, Bipin
    Imtiaz, Syeda Aniqa
    Qadeer, Hamzah Arif
    Singarajah, Magdalean
    Kashef, Rasha
    2021 IEEE INTERNATIONAL IOT, ELECTRONICS AND MECHATRONICS CONFERENCE (IEMTRONICS), 2021, : 161 - 168
  • [29] S_I_LSTM: stock price prediction based on multiple data sources and sentiment analysis
    Wu, Shengting
    Liu, Yuling
    Zou, Ziran
    Weng, Tien-Hsiung
    CONNECTION SCIENCE, 2022, 34 (01) : 44 - 62
  • [30] Data-driven Approach for Equipment Reliability Prediction Using Neural Network
    Ding, Feng
    Han, Xingben
    PRECISION ENGINEERING AND NON-TRADITIONAL MACHINING, 2012, 411 : 563 - 566