Estimating Distributed Representation Performance in Disaster-Related Social Media Classification

被引：11

作者：

Jain, Pallavi ^{[1
]}

Ross, Robert ^{[1
]}

Schoen-Phelan, Bianca ^{[1
]}

机构：

[1] Technol Univ Dublin, Sch Comp Sci, Dublin, Ireland

来源：

PROCEEDINGS OF THE 2019 IEEE/ACM INTERNATIONAL CONFERENCE ON ADVANCES IN SOCIAL NETWORKS ANALYSIS AND MINING (ASONAM 2019) | 2019年

关键词：

Text classification; Twitter; Word Embedding; ELMo; BERT;

D O I：

10.1145/3341161.3343680

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper examines the effectiveness of a range of pre-trained language representations in order to determine the informativeness and information type of social media in the event of natural or man-made disasters. Within the context of disaster tweet analysis, we aim to accurately analyse tweets while minimising both false positive and false negatives in the automated information analysis. The investigation is performed across a number of well known disaster-related twitter datasets. Models that are built from pre-trained word embeddings from Word2Vec, GloVe, ELMo and BERT are used for performance evaluation. Given the relative ubiquity of BERT as a standout language representation in recent times it was expected that BERT dominates results. However, results are more diverse, with classical Word2Vec and GloVe both displaying strong results. As part of the analysis, we discuss some challenges related to automated twitter analysis including the fine-tuning of language models to disaster-related scenarios.

引用

页码：723 / 727

页数：5

共 50 条

[31] The performance of publicness in social media: tracing patterns in tweets after a disaster
Matheson, Donald
MEDIA CULTURE & SOCIETY, 2018, 40 (04) : 584 - 599
[32] Sentiment Classification of Cryptocurrency-Related Social Media Posts
Kulakowski, Mikolaj
Frasincar, Flavius
IEEE INTELLIGENT SYSTEMS, 2023, 38 (04) : 5 - 9
[33] MAM: Multimodel Attention Mechanism for Social Media Natural Disaster Management Tweet Classification
Sangeetha, M.
Devi, R. Manjula
Sharma, Bhisham
Chowdhury, Subrata
Ben Dhaou, Imed
2023 20TH ACS/IEEE INTERNATIONAL CONFERENCE ON COMPUTER SYSTEMS AND APPLICATIONS, AICCSA, 2023,
[34] Classification without (Proper) Representation: Political Heterogeneity in Social Media and Its Implications for Classification and Behavioral Analysis
Alkiek, Kenan
Zhang, Bohan
Jurgens, David
FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), 2022, : 504 - 522
[35] Semantic labeling of social big media using distributed online robust classification
Sadigh, Alireza Naeimi
Bahraini, Tahereh
Yazdi, Hadi Sadoghi
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 132
[36] Sentiment Analysis for Code-Mixed Indian Social Media Text With Distributed Representation
Shalini, K.
Ganesh, Barathi H. B.
Kumar, Anand M.
Soman, K. P.
2018 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2018, : 1126 - 1131
[37] Empirical Investigation of Work-Related Social Media Usage and Social-Related Social Media Usage on Employees' Work Performance
Dantas, Rui Miguel
Aftab, Hira
Aslam, Sumaira
Majeed, Muhammad Ussama
Correia, Anabela Batista
Qureshi, Hamza Ahmad
Lucas, Joao Luis
BEHAVIORAL SCIENCES, 2022, 12 (08)
[38] Internal or external social media? The effects of work-related and social-related use of social media on improving employee performance
Chen, Xiayu
Ou, Carol Xiaojuan
Davison, Robert M.
INTERNET RESEARCH, 2022, 32 (03) : 680 - 707
[39] Performance evaluation of NLP and CNN models for disaster detection using social media data
Islam, Md. Azharul
Rabbi, Fazla
Hossain, Niamat Ullah Ibne
SOCIAL NETWORK ANALYSIS AND MINING, 2024, 14 (01)
[40] user2Vec: Social Media User Representation Based on Distributed Document Embeddings
Hallac, Ibrahim R.
Makinist, Semiha
Ay, Betul
Aydin, Galip
2019 INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND DATA PROCESSING (IDAP 2019), 2019,

← 1 2 3 4 5 →