LLMs to the Moon? Reddit Market Sentiment Analysis with Large Language Models

被引:13
|
作者
Deng, Xiang [1 ,4 ]
Bashlovkina, Vasilisa [2 ]
Han, Feng [2 ]
Baumgartner, Simon [2 ]
Bendersky, Michael [3 ]
机构
[1] Ohio State Univ, Columbus, OH 43210 USA
[2] Google Res, NYC, New York, NY USA
[3] Google Res, Mountain View, CA USA
[4] Google, Mountain View, CA 94043 USA
关键词
Sentiment Analysis; Social Media; Finance; Large Language Model; Natural Language Processing; TEXTUAL ANALYSIS;
D O I
10.1145/3543873.3587605
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Market sentiment analysis on social media content requires knowledge of both financial markets and social media jargon, which makes it a challenging task for human raters. The resulting lack of high-quality labeled data stands in the way of conventional supervised learning methods. In this work, we conduct a case study approaching this problem with semi-supervised learning using a large language model (LLM). We select Reddit as the target social media platform due to its broad coverage of topics and content types. Our pipeline first generates weak financial sentiment labels for Reddit posts with an LLM and then uses that data to train a small model that can be served in production. We find that prompting the LLM to produce Chain-of-Thought summaries and forcing it through several reasoning paths helps generate more stable and accurate labels, while training the student model using a regression loss further improves distillation quality. With only a handful of prompts, the final model performs on par with existing supervised models. Though production applications of our model are limited by ethical considerations, the model's competitive performance points to the great potential of using LLMs for tasks that otherwise require skill-intensive annotation.
引用
收藏
页码:1014 / 1019
页数:6
相关论文
共 50 条
  • [1] What do LLMs Know about Financial Markets? A Case Study on Reddit Market Sentiment Analysis
    Deng, Xiang
    Bashlovkina, Vasilisa
    Han, Feng
    Baumgartner, Simon
    Bendersky, Michael
    COMPANION OF THE WORLD WIDE WEB CONFERENCE, WWW 2023, 2023, : 107 - 110
  • [2] What do LLMs Know about Financial Markets? A Case Study on Reddit Market Sentiment Analysis
    Deng, Xiang
    Bashlovkina, Vasilisa
    Han, Feng
    Baumgartner, Simon
    Bendersky, Michael
    ACM Web Conference 2023 - Companion of the World Wide Web Conference, WWW 2023, 2023, : 107 - 110
  • [3] linguagem grande (LLMs) Linguistic ambiguity analysis in large language models (LLMs)
    Moraes, Lavinia de Carvalho
    Silverio, Irene Cristina
    Marques, Rafael Alexandre Sousa
    Anaia, Bianca de Castro
    de Paula, Dandara Freitas
    Faria, Maria Carolina Schincariol de
    Cleveston, Iury
    Correia, Alana de Santana
    Freitag, Raquel Meister Ko
    TEXTO LIVRE-LINGUAGEM E TECNOLOGIA, 2025, 18
  • [4] FinSoSent: Advancing Financial Market Sentiment Analysis through Pretrained Large Language Models
    Delgadillo, Josiel
    Kinyua, Johnson
    Mutigwe, Charles
    BIG DATA AND COGNITIVE COMPUTING, 2024, 8 (08)
  • [5] Sentiment Analysis of Song Dynasty Classical Poetry Using Fine-Tuned Large Language Models: A Study with LLMs
    Ihnaini, Baha
    Sun, Weiyi
    Cai, Yingchao
    Xu, Zhijun
    Sangi, Rashid
    2024 7TH INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND BIG DATA, ICAIBD 2024, 2024, : 590 - 597
  • [6] Lower Energy Large Language Models (LLMs)
    Lin, Hsiao-Ying
    Voas, Jeffrey
    COMPUTER, 2023, 56 (10) : 14 - 16
  • [7] Towards Safer Large Language Models (LLMs)
    Lawrence, Carolin
    Bifulco, Roberto
    Gashteovski, Kiril
    Hung, Chia-Chien
    Ben Rim, Wiem
    Shaker, Ammar
    Oyamada, Masafumi
    Sadamasa, Kunihiko
    Enomoto, Masafumi
    Takeoka, Kunihiro
    NEC Technical Journal, 2024, 17 (02): : 64 - 74
  • [8] LARGE LANGUAGE MODELS (LLMS) AND CHATGPT FOR BIOMEDICINE
    Arighi, Cecilia
    Brenner, Steven
    Lu, Zhiyong
    BIOCOMPUTING 2024, PSB 2024, 2024, : 641 - 644
  • [9] Large language models (LLMs) and the institutionalization of misinformation
    Garry, Maryanne
    Chan, Way Ming
    Foster, Jeffrey
    Henkel, Linda A.
    TRENDS IN COGNITIVE SCIENCES, 2024, 28 (12) : 1078 - 1088
  • [10] Large Language Models in Targeted Sentiment Analysis for Russian
    Rusnachenko, N.
    Golubev, A.
    Loukachevitch, N.
    LOBACHEVSKII JOURNAL OF MATHEMATICS, 2024, 45 (07) : 3148 - 3158