Trend and Co-occurrence Network of COVID-19 Symptoms From Large-Scale Social Media Data: Infoveillance Study

被引:6
|
作者
Wu, Jiageng [1 ,2 ,3 ]
Wang, Lumin [1 ,2 ,3 ]
Hua, Yining [4 ,5 ]
Li, Minghui [1 ,2 ,3 ]
Zhou, Li [4 ,5 ]
Bates, David W. [4 ,5 ]
Yang, Jie [1 ,2 ,3 ]
机构
[1] Zhejiang Univ, Sch Publ Hlth, Sch Med, 866 Yuhangtang Rd, Hangzhou 310058, Peoples R China
[2] Zhejiang Univ, Affiliated Hosp 2, Sch Med, 866 Yuhangtang Rd, Hangzhou 310058, Peoples R China
[3] Key Lab Intelligent Prevent Med Zhejiang Prov, Hangzhou, Peoples R China
[4] Harvard Med Sch, Dept Biomed Informat, Boston, MA USA
[5] Brigham & Womens Hosp, Div Gen Internal Med & Primary Care, Boston, MA USA
基金
美国国家卫生研究院;
关键词
social media; network analysis; public health; data mining; COVID-19;
D O I
10.2196/45419
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Background: For an emergent pandemic, such as COVID-19, the statistics of symptoms based on hospital data may be biased or delayed due to the high proportion of asymptomatic or mild-symptom infections that are not recorded in hospitals. Meanwhile, the difficulty in accessing large-scale clinical data also limits many researchers from conducting timely research. Objective: Given the wide coverage and promptness of social media, this study aimed to present an efficient workflow to track and visualize the dynamic characteristics and co-occurrence of symptoms for the COVID-19 pandemic from large-scale and long-term social media data. Methods: This retrospective study included 471,553,966 COVID-19-related tweets from February 1, 2020, to April 30, 2022. We curated a hierarchical symptom lexicon for social media containing 10 affected organs/systems, 257 symptoms, and 1808 synonyms. The dynamic characteristics of COVID-19 symptoms over time were analyzed from the perspectives of weekly new cases, overall distribution, and temporal prevalence of reported symptoms. The symptom evolutions between virus strains (Delta and Omicron) were investigated by comparing the symptom prevalence during their dominant periods. A co-occurrence symptom network was developed and visualized to investigate inner relationships among symptoms and affected body systems. Results: This study identified 201 COVID-19 symptoms and grouped them into 10 affected body systems. There was a significant correlation between the weekly quantity of self-reported symptoms and new COVID-19 infections (Pearson correlation coefficient=0.8528; P<.001). We also observed a 1-week leading trend (Pearson correlation coefficient=0.8802; P<.001) between them. The frequency of symptoms showed dynamic changes as the pandemic progressed, from typical respiratory symptoms in the early stage to more musculoskeletal and nervous symptoms in the later stages. We identified the difference in symptoms between the Delta and Omicron periods. There were fewer severe symptoms (coma and dyspnea), more flu-like symptoms (throat pain and nasal congestion), and fewer typical COVID symptoms (anosmia and taste altered) in the Omicron period than in the Delta period (all P<.001). Network analysis revealed co-occurrences among symptoms and systems corresponding to specific disease progressions, including palpitations (cardiovascular) and dyspnea (respiratory), and alopecia (musculoskeletal) and impotence (reproductive). Conclusions: This study identified more and milder COVID-19 symptoms than clinical research and characterized the dynamic symptom evolution based on 400 million tweets over 27 months. The symptom network revealed potential comorbidity risk and prognostic disease progression. These findings demonstrate that the cooperation of social media and a well-designed workflow can depict a holistic picture of pandemic symptoms to complement clinical studies.
引用
收藏
页数:16
相关论文
共 50 条
  • [41] Understanding Side Effects of Antidepressants: Large-scale Longitudinal Study on Social Media Data
    Saha, Koustuv
    Torous, John
    Kiciman, Emre
    De Choudhury, Munmun
    JMIR MENTAL HEALTH, 2021, 8 (03):
  • [42] Loneliness and Social Isolation During the COVID-19 Pandemic A Systematic Review Enriched With Empirical Evidence From a Large-Scale Diary Study
    Buecker, Susanne
    Horstmann, Kai T.
    EUROPEAN PSYCHOLOGIST, 2021, 26 (04) : 272 - 284
  • [43] A Network Analysis of the Fear of COVID-19 Scale (FCV-19S): A Large-Scale Cross-Cultural Study in Iran, Bangladesh, and Norway
    Lecuona, Oscar
    Lin, Chung-Ying
    Rozgonjuk, Dmitri
    Norekval, Tone M.
    Iversen, Marjolein M.
    Mamun, Mohammed A.
    Griffiths, Mark D.
    Lin, Ting-, I
    Pakpour, Amir H.
    INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH, 2022, 19 (11)
  • [44] A large-scale machine learning study of sociodemographic factors contributing to COVID-19 severity
    Tumbas, Marko
    Markovic, Sofija
    Salom, Igor
    Djordjevic, Marko
    FRONTIERS IN BIG DATA, 2023, 6
  • [45] COVID-19 Vaccine Hesitancy Among Chinese Population: A Large-Scale National Study
    Wu, Jian
    Li, Quanman
    Silver Tarimo, Clifford
    Wang, Meiyun
    Gu, Jianqin
    Wei, Wei
    Ma, Mingze
    Zhao, Lipei
    Mu, Zihan
    Miao, Yudong
    FRONTIERS IN IMMUNOLOGY, 2021, 12
  • [46] Assessing consumer complaints during COVID-19 in Mexico using large-scale Twitter data
    Gutierrez Leefmans, Manuela
    Garcia Miramon, Fiorentina
    David Murck, Maximilian
    CUADERNOS INFO, 2023, (55) : 162 - 185
  • [47] Leveraging large-scale genetic data to assess the causal impact of COVID-19 on multisystemic diseases
    Zhang, Xiangyang
    Jiang, Zhaohui
    Ma, Jiayao
    Qi, Yaru
    Li, Yin
    Zhang, Yan
    Liu, Yihan
    Wei, Chaochao
    Chen, Yihong
    Liu, Ping
    Peng, Yinghui
    Tan, Jun
    Han, Ying
    Zeng, Shan
    Cai, Changjing
    Shen, Hong
    JOURNAL OF BIG DATA, 2024, 11 (01)
  • [48] Evolution of Public Opinion on COVID-19 Vaccination in Japan: Large-Scale Twitter Data Analysis
    Kobayashi, Ryota
    Takedomi, Yuka
    Nakayama, Yuri
    Suda, Towa
    Uno, Takeaki
    Hashimoto, Takako
    Toyoda, Masashi
    Yoshinaga, Naoki
    Kitsuregawa, Masaru
    Rocha, Luis E. C.
    JOURNAL OF MEDICAL INTERNET RESEARCH, 2022, 24 (12)
  • [49] Dynamic Associations Between Centers for Disease Control and Prevention Social Media Contents and Epidemic Measures During COVID-19: Infoveillance Study
    Yin, Shuhua
    Chen, Shi
    Ge, Yaorong
    JMIR INFODEMIOLOGY, 2024, 4 (01):
  • [50] COVID-19 and labour market resilience: evidence from large-scale recruitment behaviour
    Yu, Xinguo
    Song, Hengxu
    Ren, Ting
    Xue, Yanbo
    REGIONAL STUDIES, 2024, 58 (01) : 45 - 60