Leveraging machine learning to analyze sentiment from COVID-19 tweets: A global perspective

被引:7
|
作者
Rahman, Md Mahbubar [1 ]
Khan, Nafiz Imtiaz [1 ]
Sarker, Iqbal H. [2 ]
Ahmed, Mohiuddin [3 ]
Islam, Muhammad Nazrul [1 ]
机构
[1] Mil Inst Sci & Technol MIST, Dept Comp Sci & Engn, Dhaka 1216, Bangladesh
[2] Chittagong Univ Engn & Technol, Dept Comp Sci & Engn, Chittagong, Bangladesh
[3] Edith Cowan Univ, Sch Sci, Joondalup, WA, Australia
关键词
coronavirus; COVID-19; deep neural network; machine learning; outbreak; pandemic; prediction; sentiment analysis; social media; INTERRATER RELIABILITY;
D O I
10.1002/eng2.12572
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Since the advent of the worldwide COVID-19 pandemic, analyzing public sentiment has become one of the major concerns for policy and decision-makers. While the priority is to curb the spread of the virus, mass population (user) sentiment analysis is equally important. Though sentiment analysis using different state-of-the-art technologies has been focused on during the COVID-19 pandemic, the reasons behind the variations in public sentiment are yet to be explored. Moreover, how user sentiment varies due to the COVID-19 pandemic from a cross-country perspective has been less focused on. Therefore, the objectives of this study are: to identify the most effective machine learning (ML) technique for classifying public sentiments, to analyze the variations of public sentiment across the globe, and to find the critical contributing factors to sentiment variations. To attain the objectives, 12,000 tweets, 3000 each from the USA, UK, and Bangladesh, were rigorously annotated by three independent reviewers. Based on the labeled tweets, four different boosting ML models, namely, CatBoost, gradient boost, AdaBoost, and XGBoost, are investigated. Next, the top performed ML model predicted sentiment of 300,000 data (100,000 from each country). The public perceptions have been analyzed based on the labeled data. As an outcome, the CatBoost model showed the highest (85.8%) F1-score, followed by gradient boost (84.3%), AdaBoost (78.9%), and XGBoost (83.1%). Second, it was revealed that during the time of the COVID-19 pandemic, the sentiments of the people of the three countries mainly were negative, followed by positive and neutral. Finally, this study identified a few critical concerns that impact primarily varying public sentiment around the globe: lockdown, quarantine, hospital, mask, vaccine, and the like.
引用
收藏
页数:23
相关论文
共 50 条
  • [31] Sentiment analysis of Indian Tweets about Covid-19 vaccines
    Mir, Aasif Ahmad
    Sevukan, Rathinam
    JOURNAL OF INFORMATION SCIENCE, 2024, 50 (05) : 1308 - 1320
  • [32] Machine Learning and Deep Learning Approaches to Analyze and Detect COVID-19: A Review
    Aishwarya T.
    Ravi Kumar V.
    SN Computer Science, 2021, 2 (3)
  • [33] Comparative analysis of machine learning-based classification models using sentiment classification of tweets related to COVID-19 pandemic
    Gulati, Kamal
    Kumar, S. Saravana
    Boddu, Raja Sarath Kumar
    Sarvakar, Ketan
    Sharma, Dilip Kumar
    Nomani, M. Z. M.
    MATERIALS TODAY-PROCEEDINGS, 2022, 51 : 38 - 41
  • [34] Sentiment Analysis of COVID-19 Tweets Using Deep Learning and Lexicon-Based Approaches
    Ainapure, Bharati Sanjay
    Pise, Reshma Nitin
    Reddy, Prathiba
    Appasani, Bhargav
    Srinivasulu, Avireni
    Khan, Mohammad S. S.
    Bizon, Nicu
    SUSTAINABILITY, 2023, 15 (03)
  • [35] Arabic Tweets Sentiment Analysis about Online Learning during COVID-19 in Saudi Arabia
    Althagafi, Asma
    Althobaiti, Ghofran
    Alhakami, Hosam
    Alsubait, Tahani
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2021, 12 (03) : 620 - 625
  • [36] A novel fusion-based deep learning model for sentiment analysis of COVID-19 tweets
    Basiri, Mohammad Ehsan
    Nemati, Shahla
    Abdar, Moloud
    Asadi, Somayeh
    Acharrya, U. Rajendra
    KNOWLEDGE-BASED SYSTEMS, 2021, 228
  • [37] Modeling the Spread of COVID-19 by Leveraging Machine and Deep Learning Models
    Adnan, Muhammad
    Altalhi, Maryam
    Alarood, Ala Abdulsalam
    Uddin, M. Irfan
    INTELLIGENT AUTOMATION AND SOFT COMPUTING, 2022, 31 (03): : 1857 - 1872
  • [38] Sentiment analysis of tweets about COVID-19 disease during pandemic
    Matosevic, Goran
    Bevanda, Vanja
    2020 43RD INTERNATIONAL CONVENTION ON INFORMATION, COMMUNICATION AND ELECTRONIC TECHNOLOGY (MIPRO 2020), 2020, : 1290 - 1295
  • [39] Sentiment and emotion trends in nurses' tweets about the COVID-19 pandemic
    Xavier, Teenu
    Lambert, Joshua
    JOURNAL OF NURSING SCHOLARSHIP, 2022, 54 (05) : 613 - 622
  • [40] Analysis and Prediction of User Sentiment on COVID-19 Pandemic Using Tweets
    Yeasmin, Nilufa
    Mahbub, Nosin Ibna
    Baowaly, Mrinal Kanti
    Singh, Bikash Chandra
    Alom, Zulfikar
    Aung, Zeyar
    Azim, Mohammad Abdul
    BIG DATA AND COGNITIVE COMPUTING, 2022, 6 (02)