LLMs to the Moon? Reddit Market Sentiment Analysis with Large Language Models

被引：13

作者：

Deng, Xiang ^{[1
,4
]}

Bashlovkina, Vasilisa ^{[2
]}

Han, Feng ^{[2
]}

Baumgartner, Simon ^{[2
]}

Bendersky, Michael ^{[3
]}

机构：

[1] Ohio State Univ, Columbus, OH 43210 USA

[2] Google Res, NYC, New York, NY USA

[3] Google Res, Mountain View, CA USA

[4] Google, Mountain View, CA 94043 USA

来源：

COMPANION OF THE WORLD WIDE WEB CONFERENCE, WWW 2023 | 2023年

关键词：

Sentiment Analysis; Social Media; Finance; Large Language Model; Natural Language Processing; TEXTUAL ANALYSIS;

D O I：

10.1145/3543873.3587605

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Market sentiment analysis on social media content requires knowledge of both financial markets and social media jargon, which makes it a challenging task for human raters. The resulting lack of high-quality labeled data stands in the way of conventional supervised learning methods. In this work, we conduct a case study approaching this problem with semi-supervised learning using a large language model (LLM). We select Reddit as the target social media platform due to its broad coverage of topics and content types. Our pipeline first generates weak financial sentiment labels for Reddit posts with an LLM and then uses that data to train a small model that can be served in production. We find that prompting the LLM to produce Chain-of-Thought summaries and forcing it through several reasoning paths helps generate more stable and accurate labels, while training the student model using a regression loss further improves distillation quality. With only a handful of prompts, the final model performs on par with existing supervised models. Though production applications of our model are limited by ethical considerations, the model's competitive performance points to the great potential of using LLMs for tasks that otherwise require skill-intensive annotation.

引用

页码：1014 / 1019

页数：6

共 50 条

[1] What do LLMs Know about Financial Markets? A Case Study on Reddit Market Sentiment Analysis
Deng, Xiang
Bashlovkina, Vasilisa
Han, Feng
Baumgartner, Simon
Bendersky, Michael
COMPANION OF THE WORLD WIDE WEB CONFERENCE, WWW 2023, 2023, : 107 - 110
[2] What do LLMs Know about Financial Markets? A Case Study on Reddit Market Sentiment Analysis
Deng, Xiang
Bashlovkina, Vasilisa
Han, Feng
Baumgartner, Simon
Bendersky, Michael
ACM Web Conference 2023 - Companion of the World Wide Web Conference, WWW 2023, 2023, : 107 - 110
[3] linguagem grande (LLMs) Linguistic ambiguity analysis in large language models (LLMs)
Moraes, Lavinia de Carvalho
Silverio, Irene Cristina
Marques, Rafael Alexandre Sousa
Anaia, Bianca de Castro
de Paula, Dandara Freitas
Faria, Maria Carolina Schincariol de
Cleveston, Iury
Correia, Alana de Santana
Freitag, Raquel Meister Ko
TEXTO LIVRE-LINGUAGEM E TECNOLOGIA, 2025, 18
[4] FinSoSent: Advancing Financial Market Sentiment Analysis through Pretrained Large Language Models
Delgadillo, Josiel
Kinyua, Johnson
Mutigwe, Charles
BIG DATA AND COGNITIVE COMPUTING, 2024, 8 (08)
[5] Sentiment Analysis of Song Dynasty Classical Poetry Using Fine-Tuned Large Language Models: A Study with LLMs
Ihnaini, Baha
Sun, Weiyi
Cai, Yingchao
Xu, Zhijun
Sangi, Rashid
2024 7TH INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND BIG DATA, ICAIBD 2024, 2024, : 590 - 597
[6] Lower Energy Large Language Models (LLMs)
Lin, Hsiao-Ying
Voas, Jeffrey
COMPUTER, 2023, 56 (10) : 14 - 16
[7] Towards Safer Large Language Models (LLMs)
Lawrence, Carolin
Bifulco, Roberto
Gashteovski, Kiril
Hung, Chia-Chien
Ben Rim, Wiem
Shaker, Ammar
Oyamada, Masafumi
Sadamasa, Kunihiko
Enomoto, Masafumi
Takeoka, Kunihiro
NEC Technical Journal, 2024, 17 (02): : 64 - 74
[8] LARGE LANGUAGE MODELS (LLMS) AND CHATGPT FOR BIOMEDICINE
Arighi, Cecilia
Brenner, Steven
Lu, Zhiyong
BIOCOMPUTING 2024, PSB 2024, 2024, : 641 - 644
[9] Large language models (LLMs) and the institutionalization of misinformation
Garry, Maryanne
Chan, Way Ming
Foster, Jeffrey
Henkel, Linda A.
TRENDS IN COGNITIVE SCIENCES, 2024, 28 (12) : 1078 - 1088
[10] Large Language Models in Targeted Sentiment Analysis for Russian
Rusnachenko, N.
Golubev, A.
Loukachevitch, N.
LOBACHEVSKII JOURNAL OF MATHEMATICS, 2024, 45 (07) : 3148 - 3158

← 1 2 3 4 5 →