Estimating Distributed Representation Performance in Disaster-Related Social Media Classification

被引:11
|
作者
Jain, Pallavi [1 ]
Ross, Robert [1 ]
Schoen-Phelan, Bianca [1 ]
机构
[1] Technol Univ Dublin, Sch Comp Sci, Dublin, Ireland
来源
PROCEEDINGS OF THE 2019 IEEE/ACM INTERNATIONAL CONFERENCE ON ADVANCES IN SOCIAL NETWORKS ANALYSIS AND MINING (ASONAM 2019) | 2019年
关键词
Text classification; Twitter; Word Embedding; ELMo; BERT;
D O I
10.1145/3341161.3343680
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper examines the effectiveness of a range of pre-trained language representations in order to determine the informativeness and information type of social media in the event of natural or man-made disasters. Within the context of disaster tweet analysis, we aim to accurately analyse tweets while minimising both false positive and false negatives in the automated information analysis. The investigation is performed across a number of well known disaster-related twitter datasets. Models that are built from pre-trained word embeddings from Word2Vec, GloVe, ELMo and BERT are used for performance evaluation. Given the relative ubiquity of BERT as a standout language representation in recent times it was expected that BERT dominates results. However, results are more diverse, with classical Word2Vec and GloVe both displaying strong results. As part of the analysis, we discuss some challenges related to automated twitter analysis including the fine-tuning of language models to disaster-related scenarios.
引用
收藏
页码:723 / 727
页数:5
相关论文
共 50 条
  • [32] Sentiment Classification of Cryptocurrency-Related Social Media Posts
    Kulakowski, Mikolaj
    Frasincar, Flavius
    IEEE INTELLIGENT SYSTEMS, 2023, 38 (04) : 5 - 9
  • [33] MAM: Multimodel Attention Mechanism for Social Media Natural Disaster Management Tweet Classification
    Sangeetha, M.
    Devi, R. Manjula
    Sharma, Bhisham
    Chowdhury, Subrata
    Ben Dhaou, Imed
    2023 20TH ACS/IEEE INTERNATIONAL CONFERENCE ON COMPUTER SYSTEMS AND APPLICATIONS, AICCSA, 2023,
  • [34] Classification without (Proper) Representation: Political Heterogeneity in Social Media and Its Implications for Classification and Behavioral Analysis
    Alkiek, Kenan
    Zhang, Bohan
    Jurgens, David
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), 2022, : 504 - 522
  • [35] Semantic labeling of social big media using distributed online robust classification
    Sadigh, Alireza Naeimi
    Bahraini, Tahereh
    Yazdi, Hadi Sadoghi
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 132
  • [36] Sentiment Analysis for Code-Mixed Indian Social Media Text With Distributed Representation
    Shalini, K.
    Ganesh, Barathi H. B.
    Kumar, Anand M.
    Soman, K. P.
    2018 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2018, : 1126 - 1131
  • [37] Empirical Investigation of Work-Related Social Media Usage and Social-Related Social Media Usage on Employees' Work Performance
    Dantas, Rui Miguel
    Aftab, Hira
    Aslam, Sumaira
    Majeed, Muhammad Ussama
    Correia, Anabela Batista
    Qureshi, Hamza Ahmad
    Lucas, Joao Luis
    BEHAVIORAL SCIENCES, 2022, 12 (08)
  • [38] Internal or external social media? The effects of work-related and social-related use of social media on improving employee performance
    Chen, Xiayu
    Ou, Carol Xiaojuan
    Davison, Robert M.
    INTERNET RESEARCH, 2022, 32 (03) : 680 - 707
  • [39] Performance evaluation of NLP and CNN models for disaster detection using social media data
    Islam, Md. Azharul
    Rabbi, Fazla
    Hossain, Niamat Ullah Ibne
    SOCIAL NETWORK ANALYSIS AND MINING, 2024, 14 (01)
  • [40] user2Vec: Social Media User Representation Based on Distributed Document Embeddings
    Hallac, Ibrahim R.
    Makinist, Semiha
    Ay, Betul
    Aydin, Galip
    2019 INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND DATA PROCESSING (IDAP 2019), 2019,