FinSoSent: Advancing Financial Market Sentiment Analysis through Pretrained Large Language Models

被引:1
|
作者
Delgadillo, Josiel [1 ]
Kinyua, Johnson [2 ]
Mutigwe, Charles [3 ]
机构
[1] Univ Penn, Sch Engn & Appl Sci, Philadelphia, PA 19104 USA
[2] Penn State Univ, Coll Informat Sci & Technol, Philadelphia, PA 19104 USA
[3] Western New England Univ, Coll Business, Springfield, MA 01119 USA
关键词
BERT; financial markets; Twitter/X; StockTwits; sentiment analysis; LLM; social media;
D O I
10.3390/bdcc8080087
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Predicting the directions of financial markets has been performed using a variety of approaches, and the large volume of unstructured data generated by traders and other stakeholders on social media microblog platforms provides unique opportunities for analyzing financial markets using additional perspectives. Pretrained large language models (LLMs) have demonstrated very good performance on a variety of sentiment analysis tasks in different domains. However, it is known that sentiment analysis is a very domain-dependent NLP task that requires knowledge of the domain ontology, and this is particularly the case with the financial domain, which uses its own unique vocabulary. Recent developments in NLP and deep learning including LLMs have made it possible to generate actionable financial sentiments using multiple sources including financial news, company fundamentals, technical indicators, as well social media microblogs posted on platforms such as StockTwits and X (formerly Twitter). We developed a financial social media sentiment analyzer (FinSoSent), which is a domain-specific large language model for the financial domain that was pretrained on financial news articles and fine-tuned and tested using several financial social media corpora. We conducted a large number of experiments using different learning rates, epochs, and batch sizes to yield the best performing model. Our model outperforms current state-of-the-art FSA models based on over 860 experiments, demonstrating the efficacy and effectiveness of FinSoSent. We also conducted experiments using ensemble models comprising FinSoSent and the other current state-of-the-art FSA models used in this research, and a slight performance improvement was obtained based on majority voting. Based on the results obtained across all models in these experiments, the significance of this study is that it highlights the fact that, despite the recent advances of LLMs, sentiment analysis even in domain-specific contexts remains a difficult research problem.
引用
收藏
页数:21
相关论文
共 50 条
  • [31] Sentiment and Emotion Analysis with Large Language Models for Political Security Prediction Framework
    Zaabar, Liyana Safra
    Yacob, Adriana Arul
    Isa, Mohd Rizal Mohd
    Wook, Muslihah
    Abdullah, Nor Asiakin
    Ramli, Suzaimah
    Razali, Noor Afiza Mat
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2025, 16 (01) : 954 - 960
  • [32] Entropy Based Measure Sentiment Analysis in the Financial Market
    Song, Qiang
    Almahdi, Saud
    Yang, Steve Y.
    2017 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI), 2017, : 301 - 305
  • [33] Sentiment Analysis Based on Financial Tweets and Market Information
    Ao, Shen
    2018 INTERNATIONAL CONFERENCE ON AUDIO, LANGUAGE AND IMAGE PROCESSING (ICALIP), 2018, : 321 - 326
  • [34] Sentiment as a shipping market predictor: Testing market-specific language models
    Sui, Cong
    Wang, Shuhan
    Zheng, Wei
    TRANSPORTATION RESEARCH PART E-LOGISTICS AND TRANSPORTATION REVIEW, 2024, 189
  • [35] Application of the Deep Pretrained Language Model Processing Method in Social Network Sentiment Analysis
    Yana, A.
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2022, 2022
  • [36] Financial Context News Sentiment Analysis for the Lithuanian Language
    Strimaitis, Rokas
    Stefanovic, Pavel
    Ramanauskaite, Simona
    Slotkiene, Asta
    APPLIED SCIENCES-BASEL, 2021, 11 (10):
  • [37] Application of the Deep Pretrained Language Model Processing Method in Social Network Sentiment Analysis
    Yana, A.
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2022, 2022
  • [38] Training Large-Scale News Recommenders with Pretrained Language Models in the Loop
    Xiao, Shitao
    Liu, Zheng
    Shao, Yingxia
    Di, Tao
    Middha, Bhuvan
    Wu, Fangzhao
    Xie, Xing
    PROCEEDINGS OF THE 28TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2022, 2022, : 4215 - 4225
  • [39] Multimodal Clinical Prediction with Unified Prompts and Pretrained Large-Language Models
    Winston, Caleb
    Winston, Chloe
    Winston, Cailin
    Winston, Claris
    Winston, Cleah
    2024 IEEE 12TH INTERNATIONAL CONFERENCE ON HEALTHCARE INFORMATICS, ICHI 2024, 2024, : 679 - 683
  • [40] Large language models for newspaper sentiment analysis during COVID-19: The Guardian
    Chandra, Rohitash
    Zhu, Baicheng
    Fang, Qingying
    Shinjikashvili, Eka
    APPLIED SOFT COMPUTING, 2025, 171