Multi-dimensional Classification on Social Media Data for Detailed Reporting with Large Language Models

被引:2
|
作者
Cantini, Riccardo [1 ]
Cosentino, Cristian [1 ]
Marozzo, Fabrizio [1 ]
机构
[1] Univ Calabria, Arcavacata Di Rende, CS, Italy
关键词
Large Language Models; Deep Learning; Natural Language Processing; ChatGPT; Social media data; Reporting;
D O I
10.1007/978-3-031-63215-0_8
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Every day, more and more people harness the power of social media platforms to express their thoughts, share information and personal experiences, and engage with others. All this knowledge can then be transformed into informative reports with the assistance of Large Language Models (LLMs), like ChatGPT, which leverage deep learning techniques to analyze data and generate comprehensive analyses. By effectively classifying user-generated posts based on dimensions such as topic, sentiment, and emotion, it is possible to create even more detailed reports by carefully condensing large amounts of data collected along the different dimensions considered. To tackle this challenge, we have developed an automated approach with two primary goals: (i) categorizing posts across different dimensions using ready-to-use and fine-tuned classifiers; and (ii) generating detailed reports via LLMs that summarize posts with similar characteristics along the defined dimensions. In our analysis, we examined a large and varied set of posts about COVID, classifying them along several dimensions, including topic, content type, expressed sentiment and emotions, and reliability of information. Specifically, by choosing to generate a report for the main discussion topics present in the dataset, such as allergic reactions or school issues, and using the remaining dimensions for post classification, we successfully created highly detailed and informative reports with ChatGPT. These reports outperformed those generated directly by ChatGPT, in both quantitative measures such as linguistic scores and qualitative evaluations by field experts.
引用
收藏
页码:100 / 114
页数:15
相关论文
共 50 条
  • [1] A Multi-Dimensional Analysis and Data Cube for Unstructured Text and Social Media
    Lee, Suan
    Kim, Namsoo
    Kim, Jinho
    2014 IEEE FOURTH INTERNATIONAL CONFERENCE ON BIG DATA AND CLOUD COMPUTING (BDCLOUD), 2014, : 761 - 764
  • [2] Understanding Sarcoidosis Using Large Language Models and Social Media Data
    Xi, Nan Miles
    Ji, Hong-Long
    Wang, Lin
    JOURNAL OF HEALTHCARE INFORMATICS RESEARCH, 2024,
  • [3] Parallel indexing of large multi-dimensional data
    Funaki, Kenta
    Hochin, Teruhisa
    Nomiya, Hiroki
    Nakanishi, Hideya
    Kojima, Mamoru
    2013 SECOND IIAI INTERNATIONAL CONFERENCE ON ADVANCED APPLIED INFORMATICS (IIAI-AAI 2013), 2013, : 324 - 329
  • [4] Evaluating large language models for health-related text classification tasks with public social media data
    Guo, Yuting
    Ovadje, Anthony
    Al-Garadi, Mohammed Ali
    Sarker, Abeed
    JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2024, 31 (10) : 2181 - 2189
  • [5] A multi-dimensional data organization for natural language processing
    Cheng, Kam-Hoi
    Faris, Waleed
    Journal of Computational Methods in Sciences and Engineering, 2009, 9 (SUPPL.1)
  • [6] Exploring register variation on Reddit A multi-dimensional study of language use on a social media website
    Liimatta, Aatu
    REGISTER STUDIES, 2019, 1 (02) : 269 - 295
  • [7] Exploring ChatGPT as a virtual tutor: A multi-dimensional analysis of large language models in academic support
    Al-Abri, Abdullah
    EDUCATION AND INFORMATION TECHNOLOGIES, 2025,
  • [8] Kicking Prejudice: Large Language Models for Racism Classification in Soccer Discourse on Social Media
    Santos, Guto Leoni
    dos Santos, Vitor Gaboardi
    Kearns, Colm
    Sinclair, Gary
    Black, Jaack
    Doidge, Mark
    Fletcher, Thomas
    Kilvington, Dan
    Endo, Patricia Takako
    Listod, Katie
    Lynn, Theo
    ADVANCED INFORMATION SYSTEMS ENGINEERING, CAISE 2024, 2024, 14663 : 547 - 562
  • [9] A Multi-dimensional study on Bias in Vision-Language models
    Ruggeri, Gabriele
    Nozza, Debora
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, 2023, : 6445 - 6455
  • [10] MDER: Multi-Dimensional Event Recommendation in Social Media Context
    Troudi, Abir
    Ghorbel, Leila
    Zayani, Corinne Amel
    Jamoussi, Salma
    Amous, Ikram
    COMPUTER JOURNAL, 2021, 64 (03): : 369 - 382