Twitter Discussions and Emotions About the COVID-19 Pandemic: Machine Learning Approach

被引:207
|
作者
Xue, Jia [1 ,2 ]
Chen, Junxiang [3 ]
Hu, Ran [1 ]
Chen, Chen [4 ]
Zheng, Chengda [2 ]
Su, Yue [5 ,6 ]
Zhu, Tingshao [5 ]
机构
[1] Univ Toronto, Factor Inwentash Fac Social Work, Toronto, ON, Canada
[2] Univ Toronto, Fac Informat, Toronto, ON, Canada
[3] Univ Pittsburgh, Sch Med, Pittsburgh, PA USA
[4] Univ Toronto, Middleware Syst Res Grp, Toronto, ON, Canada
[5] Chinese Acad Sci, Inst Psychol, CAS Key Lab Behav Sci, 16 Lincui Rd, Beijing 100101, Peoples R China
[6] Univ Chinese Acad Sci, Dept Psychol, Beijing, Peoples R China
关键词
machine learning; Twitter data; COVID-19; infodemic; infodemiology; infoveillance; public discussion; public sentiment; Twitter; social media; virus; SOCIAL MEDIA; SENTIMENT;
D O I
10.2196/20550
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Background: It is important to measure the public response to the COVID-19 pandemic. Twitter is an important data source for infodemiology studies involving public response monitoring. Objective: The objective of this study is to examine COVID-19-related discussions, concerns, and sentiments using tweets posted by Twitter users. Methods: We analyzed 4 million Twitter messages related to the COVID-19 pandemic using a list of 20 hashtags (eg, "coronavirus," "COVID-19," "quarantine") from March 7 to April 21, 2020. We used a machine learning approach, Latent Dirichlet Allocation (LDA), to identify popular unigrams and bigrams, salient topics and themes, and sentiments in the collected tweets. Results: Popular unigrams included "virus," "lockdown," and "quarantine." Popular bigrams included "COVID-19," "stay home," "corona virus," "social distancing," and "new cases." We identified 13 discussion topics and categorized them into 5 different themes: (1) public health measures to slow the spread of COVID-19, (2) social stigma associated with COVID-19, (3) COVID-19 news, cases, and deaths, (4) COVID-19 in the United States, and (5) COVID-19 in the rest of the world. Across all identified topics, the dominant sentiments for the spread of COVID-19 were anticipation that measures can be taken, followed by mixed feelings of trust, anger, and fear related to different topics. The public tweets revealed a significant feeling of fear when people discussed new COVID-19 cases and deaths compared to other topics. Conclusions: This study showed that Twitter data and machine learning approaches can be leveraged for an infodemiology study, enabling research into evolving public discussions and sentiments during the COVID-19 pandemic. As the situation rapidly evolves, several topics are consistently dominant on Twitter, such as confirmed cases and death rates, preventive measures, health authorities and government policies, COVID-19 stigma, and negative psychological reactions (eg, fear). Real-time monitoring and assessment of Twitter discussions and concerns could provide useful data for public health emergency responses and planning Pandemic-related fear, stigma, and mental health concerns are already evident and may continue to influence public trust when a second wave of COVID-19 occurs or there is a new surge of the current pandemic.
引用
收藏
页数:14
相关论文
共 50 条
  • [21] Machine Learning Approach to Detect Fake News, Misinformation in COVID-19 Pandemic
    Bojjireddy, Sirisha
    Chun, Soon Ae
    Geller, James
    PROCEEDINGS OF THE 22ND ANNUAL INTERNATIONAL CONFERENCE ON DIGITAL GOVERNMENT RESEARCH, DGO 2021, 2021, : 575 - 578
  • [22] Psychotherapists' acceptance of telepsychotherapy during the COVID-19 pandemic: A machine learning approach
    Bekes, Vera
    Aafjes-van Doorn, Katie
    Zilcha-Mano, Sigal
    Prout, Tracy
    Hoffman, Leon
    CLINICAL PSYCHOLOGY & PSYCHOTHERAPY, 2021, 28 (06) : 1403 - 1415
  • [23] Twitter data sentiment analysis of tourism in Thailand during the COVID-19 pandemic using machine learning
    Leelawat, Natt
    Jariyapongpaiboon, Sirawit
    Promjun, Arnon
    Boonyarak, Samit
    Saengtabtim, Kumpol
    Laosunthara, Ampan
    Yudha, Alfan Kurnia
    Tang, Jing
    HELIYON, 2022, 8 (10)
  • [24] Prediction of the COVID-19 pandemic with Machine Learning Models
    Sruthi, P. Lakshmi
    Raju, K. Butchi
    PROCEEDINGS OF THE 2021 FIFTH INTERNATIONAL CONFERENCE ON I-SMAC (IOT IN SOCIAL, MOBILE, ANALYTICS AND CLOUD) (I-SMAC 2021), 2021, : 474 - 481
  • [25] Significant Applications of Machine Learning for COVID-19 Pandemic
    Kushwaha, Shashi
    Bahl, Shashi
    Bagha, Ashok Kumar
    Parmar, Kulwinder Singh
    Javaid, Mohd
    Haleem, Abid
    Singh, Ravi Pratap
    JOURNAL OF INDUSTRIAL INTEGRATION AND MANAGEMENT-INNOVATION AND ENTREPRENEURSHIP, 2020, 5 (04): : 453 - 479
  • [26] Analyzing Discussions Around Rural Health on Twitter During the COVID-19 Pandemic: Social Network Analysis of Twitter Data
    Ahmed, Wasim
    Vidal-Alaball, Josep
    Llobet, Josep Maria Vilaseca
    JMIR INFODEMIOLOGY, 2023, 3 (01):
  • [27] Resuscitation discussions during the COVID-19 pandemic
    Lodwick, E.
    ANAESTHESIA, 2021, 76 : 151 - 151
  • [28] Twitter Opinion Mining on COVID-19 Vaccinations by Machine Learning Presence
    Islam, Md Babul
    Hasibunnahar, Swarna
    Shukla, Piyush Kumar
    Shukla, Prashant Kumar
    Rawat, Paresh
    Dange, Jyoti
    PROCEEDINGS OF THIRD DOCTORAL SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE, DOSCI 2022, 2023, 479 : 37 - 55
  • [29] The COVID-19 pandemic, Twitter and science communication
    Berro, Maximiliano
    REVISTA MEDICA DEL URUGUAY, 2021, 37 (03):
  • [30] Political Partisanship and Antiscience Attitudes in Online Discussions About COVID-19: Twitter Content Analysis
    Rao, Ashwin
    Morstatter, Fred
    Hu, Minda
    Chen, Emily
    Burghardt, Keith
    Ferrara, Emilio
    Lerman, Kristina
    JOURNAL OF MEDICAL INTERNET RESEARCH, 2021, 23 (06)