Trend and Co-occurrence Network of COVID-19 Symptoms From Large-Scale Social Media Data: Infoveillance Study

被引:6
|
作者
Wu, Jiageng [1 ,2 ,3 ]
Wang, Lumin [1 ,2 ,3 ]
Hua, Yining [4 ,5 ]
Li, Minghui [1 ,2 ,3 ]
Zhou, Li [4 ,5 ]
Bates, David W. [4 ,5 ]
Yang, Jie [1 ,2 ,3 ]
机构
[1] Zhejiang Univ, Sch Publ Hlth, Sch Med, 866 Yuhangtang Rd, Hangzhou 310058, Peoples R China
[2] Zhejiang Univ, Affiliated Hosp 2, Sch Med, 866 Yuhangtang Rd, Hangzhou 310058, Peoples R China
[3] Key Lab Intelligent Prevent Med Zhejiang Prov, Hangzhou, Peoples R China
[4] Harvard Med Sch, Dept Biomed Informat, Boston, MA USA
[5] Brigham & Womens Hosp, Div Gen Internal Med & Primary Care, Boston, MA USA
基金
美国国家卫生研究院;
关键词
social media; network analysis; public health; data mining; COVID-19;
D O I
10.2196/45419
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Background: For an emergent pandemic, such as COVID-19, the statistics of symptoms based on hospital data may be biased or delayed due to the high proportion of asymptomatic or mild-symptom infections that are not recorded in hospitals. Meanwhile, the difficulty in accessing large-scale clinical data also limits many researchers from conducting timely research. Objective: Given the wide coverage and promptness of social media, this study aimed to present an efficient workflow to track and visualize the dynamic characteristics and co-occurrence of symptoms for the COVID-19 pandemic from large-scale and long-term social media data. Methods: This retrospective study included 471,553,966 COVID-19-related tweets from February 1, 2020, to April 30, 2022. We curated a hierarchical symptom lexicon for social media containing 10 affected organs/systems, 257 symptoms, and 1808 synonyms. The dynamic characteristics of COVID-19 symptoms over time were analyzed from the perspectives of weekly new cases, overall distribution, and temporal prevalence of reported symptoms. The symptom evolutions between virus strains (Delta and Omicron) were investigated by comparing the symptom prevalence during their dominant periods. A co-occurrence symptom network was developed and visualized to investigate inner relationships among symptoms and affected body systems. Results: This study identified 201 COVID-19 symptoms and grouped them into 10 affected body systems. There was a significant correlation between the weekly quantity of self-reported symptoms and new COVID-19 infections (Pearson correlation coefficient=0.8528; P<.001). We also observed a 1-week leading trend (Pearson correlation coefficient=0.8802; P<.001) between them. The frequency of symptoms showed dynamic changes as the pandemic progressed, from typical respiratory symptoms in the early stage to more musculoskeletal and nervous symptoms in the later stages. We identified the difference in symptoms between the Delta and Omicron periods. There were fewer severe symptoms (coma and dyspnea), more flu-like symptoms (throat pain and nasal congestion), and fewer typical COVID symptoms (anosmia and taste altered) in the Omicron period than in the Delta period (all P<.001). Network analysis revealed co-occurrences among symptoms and systems corresponding to specific disease progressions, including palpitations (cardiovascular) and dyspnea (respiratory), and alopecia (musculoskeletal) and impotence (reproductive). Conclusions: This study identified more and milder COVID-19 symptoms than clinical research and characterized the dynamic symptom evolution based on 400 million tweets over 27 months. The symptom network revealed potential comorbidity risk and prognostic disease progression. These findings demonstrate that the cooperation of social media and a well-designed workflow can depict a holistic picture of pandemic symptoms to complement clinical studies.
引用
收藏
页数:16
相关论文
共 50 条
  • [1] Learning from Large-Scale Wearable Device Data for Predicting Epidemics Trend of COVID-19
    Zhu, Guokang
    Li, Jia
    Meng, Zi
    Yu, Yi
    Li, Yanan
    Tang, Xiao
    Dong, Yuling
    Sun, Guangxin
    Zhou, Rui
    Wang, Hui
    Wang, Kongqiao
    Huang, Wang
    DISCRETE DYNAMICS IN NATURE AND SOCIETY, 2020, 2020
  • [2] The large-scale organization of the bacterial network of ecological co-occurrence interactions
    Freilich, Shiri
    Kreimer, Anat
    Meilijson, Isacc
    Gophna, Uri
    Sharan, Roded
    Ruppin, Eytan
    NUCLEIC ACIDS RESEARCH, 2010, 38 (12) : 3857 - 3868
  • [3] Psychosocial Effects of the COVID-19 Pandemic: Large-scale Quasi-Experimental Study on Social Media
    Saha, Koustuv
    Torous, John
    Caine, Eric D.
    De Choudhury, Munmun
    JOURNAL OF MEDICAL INTERNET RESEARCH, 2020, 22 (11)
  • [4] Predicting COVID-19 Spread from Large-Scale Mobility Data
    Schwabe, Amray
    Persson, Joel
    Feuerriegel, Stefan
    KDD '21: PROCEEDINGS OF THE 27TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2021, : 3531 - 3539
  • [5] Using Reports of Symptoms and Diagnoses on Social Media to Predict COVID-19 Case Counts in Mainland China: Observational Infoveillance Study
    Shen, Cuihua
    Chen, Anfan
    Luo, Chen
    Zhang, Jingwen
    Feng, Bo
    Liao, Wang
    JOURNAL OF MEDICAL INTERNET RESEARCH, 2020, 22 (05)
  • [6] Visual Co-occurrence Network: Using Context for Large-Scale Object Recognition in Retail
    Advani, Siddharth
    Smith, Brigid
    Tanabe, Yasuki
    Irick, Kevin
    Cotter, Matthew
    Sampson, Jack
    Narayanan, Vijaykrishnan
    2015 13TH IEEE SYMPOSIUM ON EMBEDDED SYSTEMS FOR REAL-TIME MULTIMEDIA, 2015, : 103 - 112
  • [7] An In-Depth Analysis of COVID-19 Symptoms Considering the Co-Occurrence of Symptoms Using Clustering Algorithms
    Benito, Diego Javier
    Robles, Jesus Rufino
    Ramirez, Juan
    Anta, Antonio Fernandez
    Aguilar, Jose
    IEEE ACCESS, 2024, 12 : 127792 - 127804
  • [8] COVID-19 Mask Usage and Social Distancing in Social Media Images: Large-scale Deep Learning Analysis
    Singh, Asmit Kumar
    Mehan, Paras
    Sharma, Divyanshu
    Pandey, Rohan
    Sethi, Tavpritesh
    Kumaraguru, Ponnurangam
    JMIR PUBLIC HEALTH AND SURVEILLANCE, 2022, 8 (01):
  • [9] Lessons from a large-scale COVID-19 vaccine trial
    Mahla, Ranjeet Singh
    Dustin, Lynn B.
    JOURNAL OF CLINICAL INVESTIGATION, 2022, 132 (18):
  • [10] The Impact of Public Health Events on COVID-19 Vaccine Hesitancy on Chinese Social Media: National Infoveillance Study
    Zhang, Zizheng
    Feng, Guanrui
    Xu, Jiahong
    Zhang, Yimin
    Li, Jinhui
    Huang, Jian
    Akinwunmi, Babatunde
    Zhang, Casper J. P.
    Ming, Wai-kit
    JMIR PUBLIC HEALTH AND SURVEILLANCE, 2021, 7 (11):