Trend and Co-occurrence Network of COVID-19 Symptoms From Large-Scale Social Media Data: Infoveillance Study

被引:6
|
作者
Wu, Jiageng [1 ,2 ,3 ]
Wang, Lumin [1 ,2 ,3 ]
Hua, Yining [4 ,5 ]
Li, Minghui [1 ,2 ,3 ]
Zhou, Li [4 ,5 ]
Bates, David W. [4 ,5 ]
Yang, Jie [1 ,2 ,3 ]
机构
[1] Zhejiang Univ, Sch Publ Hlth, Sch Med, 866 Yuhangtang Rd, Hangzhou 310058, Peoples R China
[2] Zhejiang Univ, Affiliated Hosp 2, Sch Med, 866 Yuhangtang Rd, Hangzhou 310058, Peoples R China
[3] Key Lab Intelligent Prevent Med Zhejiang Prov, Hangzhou, Peoples R China
[4] Harvard Med Sch, Dept Biomed Informat, Boston, MA USA
[5] Brigham & Womens Hosp, Div Gen Internal Med & Primary Care, Boston, MA USA
基金
美国国家卫生研究院;
关键词
social media; network analysis; public health; data mining; COVID-19;
D O I
10.2196/45419
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Background: For an emergent pandemic, such as COVID-19, the statistics of symptoms based on hospital data may be biased or delayed due to the high proportion of asymptomatic or mild-symptom infections that are not recorded in hospitals. Meanwhile, the difficulty in accessing large-scale clinical data also limits many researchers from conducting timely research. Objective: Given the wide coverage and promptness of social media, this study aimed to present an efficient workflow to track and visualize the dynamic characteristics and co-occurrence of symptoms for the COVID-19 pandemic from large-scale and long-term social media data. Methods: This retrospective study included 471,553,966 COVID-19-related tweets from February 1, 2020, to April 30, 2022. We curated a hierarchical symptom lexicon for social media containing 10 affected organs/systems, 257 symptoms, and 1808 synonyms. The dynamic characteristics of COVID-19 symptoms over time were analyzed from the perspectives of weekly new cases, overall distribution, and temporal prevalence of reported symptoms. The symptom evolutions between virus strains (Delta and Omicron) were investigated by comparing the symptom prevalence during their dominant periods. A co-occurrence symptom network was developed and visualized to investigate inner relationships among symptoms and affected body systems. Results: This study identified 201 COVID-19 symptoms and grouped them into 10 affected body systems. There was a significant correlation between the weekly quantity of self-reported symptoms and new COVID-19 infections (Pearson correlation coefficient=0.8528; P<.001). We also observed a 1-week leading trend (Pearson correlation coefficient=0.8802; P<.001) between them. The frequency of symptoms showed dynamic changes as the pandemic progressed, from typical respiratory symptoms in the early stage to more musculoskeletal and nervous symptoms in the later stages. We identified the difference in symptoms between the Delta and Omicron periods. There were fewer severe symptoms (coma and dyspnea), more flu-like symptoms (throat pain and nasal congestion), and fewer typical COVID symptoms (anosmia and taste altered) in the Omicron period than in the Delta period (all P<.001). Network analysis revealed co-occurrences among symptoms and systems corresponding to specific disease progressions, including palpitations (cardiovascular) and dyspnea (respiratory), and alopecia (musculoskeletal) and impotence (reproductive). Conclusions: This study identified more and milder COVID-19 symptoms than clinical research and characterized the dynamic symptom evolution based on 400 million tweets over 27 months. The symptom network revealed potential comorbidity risk and prognostic disease progression. These findings demonstrate that the cooperation of social media and a well-designed workflow can depict a holistic picture of pandemic symptoms to complement clinical studies.
引用
收藏
页数:16
相关论文
共 50 条
  • [21] Revealing the spatial co-occurrence patterns of multi-emotions from social media data
    Wang, Dongyang
    Wang, Yandong
    Fu, Xiaokang
    Dou, Mingxuan
    Dong, Shihai
    Zhang, Duocai
    TELEMATICS AND INFORMATICS, 2023, 84
  • [22] Considering social inequalities in health in large-scale testing for COVID-19 in Montreal: a qualitative case study
    Gagnon-Dufresne, Marie-Catherine
    Gautier, Lara
    Beaujoin, Camille
    Lamothe, Ashley Savard
    Mikanagu, Rachel
    Cloos, Patrick
    Ridde, Valery
    Zinszer, Kate
    BMC PUBLIC HEALTH, 2022, 22 (01)
  • [23] Using Co-occurrence Analysis to Expand Consumer Health Vocabularies from Social Media Data
    Jiang, Ling
    Yang, Christopher C.
    2013 IEEE INTERNATIONAL CONFERENCE ON HEALTHCARE INFORMATICS (ICHI 2013), 2013, : 74 - 81
  • [24] Understanding Urban Park-Based Social Interaction in Shanghai During the COVID-19 Pandemic: Insights from Large-Scale Social Media Analysis
    Wang, Haotian
    Su, Tianyu
    Zhao, Wanting
    ISPRS INTERNATIONAL JOURNAL OF GEO-INFORMATION, 2025, 14 (02)
  • [25] Mining of Opinions on COVID-19 Large-Scale Social Restrictions in Indonesia: Public Sentiment and Emotion Analysis on Online Media
    Sakti, Andi Muhammad Tri
    Mohamad, Emma
    Azlan, Arina Anis
    JOURNAL OF MEDICAL INTERNET RESEARCH, 2021, 23 (08)
  • [26] Effects of Antidepressants on COVID-19 Outcomes: Retrospective Study on Large-Scale Electronic Health Record Data
    Rahman, Mahmudur
    Mahi, Atqiya Munawara
    Melamed, Rachel
    Alam, Mohammad Arif Ul
    INTERACTIVE JOURNAL OF MEDICAL RESEARCH, 2023, 12
  • [27] Approach of a japanese co-occurrence words collection method for construction of linked open data for COVID-19
    Nagai, Yuki
    Oda, Tetsuya
    Saito, Nobuki
    Hirata, Aoto
    Hirota, Masaharu
    Katayama, Kengo
    2020 IEEE 9th Global Conference on Consumer Electronics, GCCE 2020, 2020, : 478 - 479
  • [28] COS2: Detecting Large-Scale COVID-19 Misinformation in Social Networks
    Xu, Hailu
    Curci, Macro
    Ek, Sophanna
    Liu, Pinchao
    Li, Zhengxiong
    Xu, Shuai
    CLOUD COMPUTING, CLOUD 2021, 2022, 12989 : 97 - 104
  • [29] Large-scale decrease in the social salience of climate change during the COVID-19 pandemic
    Spisak, Brian R.
    State, Bogdan
    van de Leemput, Ingrid
    Scheffer, Marten
    Liu, Yuwei
    PLOS ONE, 2022, 17 (01):
  • [30] Changes in Public Response Associated With Various COVID-19 Restrictions in Ontario, Canada: Observational Infoveillance Study Using Social Media Time Series Data
    Chum, Antony
    Nielsen, Andrew
    Bellows, Zachary
    Farrell, Eddie
    Durette, Pierre-Nicolas
    Banda, Juan M.
    Cupchik, Gerald
    JOURNAL OF MEDICAL INTERNET RESEARCH, 2021, 23 (08)