A Machine Learning-Sentiment Analysis on Monkeypox Outbreak: An Extensive Dataset to Show the Polarity of Public Opinion From Twitter Tweets

被引:22
|
作者
Bengesi, Staphord [1 ]
Oladunni, Timothy [2 ]
Olusegun, Ruth [1 ]
Audu, Halima [1 ]
机构
[1] Bowie State Univ, Dept Comp Sci, Bowie, MD 20715 USA
[2] Morgan State Univ, Dept Comp Sci, Baltimore, MD 21251 USA
关键词
Social networking (online); Sentiment analysis; Blogs; Classification algorithms; Computational modeling; Machine learning; Count vectorizer; machine learning algorithm; monkeypox; sentiment analysis; twitter; TF-IDF; TextBlob; Vader;
D O I
10.1109/ACCESS.2023.3242290
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Research on sentiment analysis has proven to be very useful in public health, particularly in analyzing infectious diseases. As the world recovers from the onslaught of the COVID-19 pandemic, concerns are rising that another pandemic, known as monkeypox, might hit the world again. Monkeypox is an infectious disease reported in over 73 countries across the globe. This sudden outbreak has become a major concern for many individuals and health authorities. Different social media channels have presented discussions, views, opinions, and emotions about the monkeypox outbreak. Social media sentiments often result in panic, misinformation, and stigmatization of some minority groups. Therefore, accurate information, guidelines, and health protocols related to this virus are critical. We aim to analyze public sentiments on the recent monkeypox outbreak, with the purpose of helping decision-makers gain a better understanding of the public perceptions of the disease. We hope that government and health authorities will find the work useful in crafting health policies and mitigating strategies to control the spread of the disease, and guide against its misrepresentations. Our study was conducted in two stages. In the first stage, we collected over 500,000 multilingual tweets related to the monkeypox post on Twitter and then performed sentiment analysis on them using VADER and TextBlob, to annotate the extracted tweets into positive, negative, and neutral sentiments. The second stage of our study involved the design, development, and evaluation of 56 classification models. Stemming and lemmatization techniques were used for vocabulary normalization. Vectorization was based on CountVectorizer and TF-IDF methodologies. K-Nearest Neighbor (KNN), Support Vector Machine (SVM), Random Forest, Logistic Regression, Multilayer Perceptron (MLP), Naive Bayes, and XGBoost were deployed as learning algorithms. Performance evaluation was based on accuracy, F1 Score, Precision, and Recall. Our experimental results showed that the model developed using TextBlob annotation + Lemmatization + CountVectorizer + SVM yielded the highest accuracy of about 0.9348.
引用
收藏
页码:11811 / 11826
页数:16
相关论文
共 24 条
  • [1] Public sentiment on the global outbreak of monkeypox: an unsupervised machine learning analysis of 352,182 twitter posts
    Ng, Q. X.
    Yau, C. E.
    Lim, Y. L.
    Wong, L. K. T.
    Liew, T. M.
    PUBLIC HEALTH, 2022, 213 : 1 - 4
  • [2] Opinion Mining and Sentiment Study of Tweets Polarity Using Machine Learning
    Mridula, A.
    Kavitha, C. R.
    PROCEEDINGS OF THE 2018 SECOND INTERNATIONAL CONFERENCE ON INVENTIVE COMMUNICATION AND COMPUTATIONAL TECHNOLOGIES (ICICCT), 2018, : 621 - 626
  • [4] Sentiment Analysis of Sindhi Tweets Dataset using Supervised Machine Learning Techniques
    Hammad, Muhammad
    Anwar, Haris
    2019 22ND IEEE INTERNATIONAL MULTI TOPIC CONFERENCE (INMIC), 2019, : 108 - 113
  • [5] Machine Learning, Sentiment Analysis, and Tweets: An Examination of Alzheimer's Disease Stigma on Twitter
    Oscar, Nels
    Fox, Pamela A.
    Croucher, Racheal
    Wernick, Riana
    Keune, Jessica
    Hooker, Karen
    JOURNALS OF GERONTOLOGY SERIES B-PSYCHOLOGICAL SCIENCES AND SOCIAL SCIENCES, 2017, 72 (05): : 742 - 751
  • [6] Public opinion and Chinese exports: evidence from Twitter sentiment analysis
    Deng, Yuping
    Wang, Haicheng
    Wu, Yanrui
    JOURNAL OF THE ASIA PACIFIC ECONOMY, 2024,
  • [7] Public perspectives of monkeypox in Twitter: A social media analysis using machine learning
    Farahat, Ramadan Abdelmoez
    Yassin, Mohammed Abdelwahab
    Al-Tawfiq, Jaffar A.
    Bejan, Cosmin A.
    Abdelazeem, Basel
    NEW MICROBES AND NEW INFECTIONS, 2022, 49-50
  • [8] Monkeypox Outbreak Analysis: An Extensive Study Using Machine Learning Models and Time Series Analysis
    Priyadarshini, Ishaani
    Mohanty, Pinaki
    Kumar, Raghvendra
    Taniar, David
    COMPUTERS, 2023, 12 (02)
  • [9] Detecting Cyberbullying from Tweets Through Machine Learning Techniques with Sentiment Analysis
    Atoum, Jalal Omer
    ADVANCES IN INFORMATION AND COMMUNICATION, FICC, VOL 2, 2023, 652 : 25 - 38
  • [10] Twitter Sentiment Analysis Based Public Emotion Detection using Machine Learning Algorithms
    Fahim, Safa
    Imran, Azhar
    Alzahrani, Abdulkareem
    Fahim, Marwa
    Alheeti, Khattab M. Ali
    Alfateh, Muhammad
    2022 17TH INTERNATIONAL CONFERENCE ON EMERGING TECHNOLOGIES (ICET'22), 2022, : 107 - 112