Cost-effective time-efficient subnational-level surveillance using Twitter: Kingdom of Saudi Arabia case study

被引:1
|
作者
Elteir, Marwa K. [1 ]
机构
[1] City Sci Res & Technol Applicat SRTA City, Informat Res Inst IRI, Alexandria, Egypt
关键词
Twitter; Surveillance; Subnational; Dataset; COVID-19; KSAGeoCOV; TRACK;
D O I
10.1007/s42452-024-06425-9
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
An effective Twitter-based surveillance system should provide insights at national and subnational levels. The literature identifies two methodologies for geolocating tweets: using only geotagged tweets or retrieving and geolocating all relevant tweets, then filtering out those not belonging to the target geographical region.The first methodology is accurate, cost-effective, and time-efficient but has limited coverage. The second offers better coverage but is less accurate, particularly for informal Arabic text, and is neither cost-effective nor time-efficient due to Twitter's new policies. There is a gap in the literature for an accurate, cost-effective, and time-efficient solution with reasonable coverage at national and subnational levels. To fill this gap, we propose a methodology that uses an underutilized feature in the Twitter backend to geolocate tweets during data collection.This retrieves both geotagged and geolocated tweets, ensuring accuracy and better coverage. It is also cost-effective and time-efficient as only the target tweets are retrieved. Applying this to Saudi Arabia for COVID-19, we generated a dataset, KSAGeoCOV, with 4.25 times more tweets than a geotagged-only dataset. It successfully predicted two COVID-19 outbreaks in June 2021 and January 2022. The Pearson correlation coefficient between WHO weekly reported cases and weekly returned tweets, with a 1-week lag, is r = 0.733; p < 0.001 for Arabic tweets and r = 0.814; p < 0.001 when including English tweets, indicating a very strong correlation at the national level. At the subnational level, top-populated provinces show strong correlations ( r = 0.64 to 0.74; p < 0.003).
引用
收藏
页数:22
相关论文
共 50 条
  • [41] Polo-Oltra et al. Cost-Effective and Time-Efficient Molecular Assisted Selection for PPV Resistance in Apricot Based on ParPMC2 Allele-Specific PCR (vol 10, 1292, 2020)
    Polo-Oltra, Angela
    Romero, Carlos
    Lopez, Inmaculada
    Badenes, Maria Luisa
    Zuriaga, Elena
    AGRONOMY-BASEL, 2022, 12 (03):
  • [42] Just-in-Time Cost-Effective Off-the-Shelf Remote Telementoring of Paramedical Personnel in Bedside Lung Sonography-A Technical Case Study
    Biegler, Nancy
    McBeth, Paul B.
    Tevez-Molina, Martha C.
    McMillan, Janelle
    Crawford, Innes
    Hamilton, Douglas R.
    Kirkpatrick, Andrew W.
    TELEMEDICINE AND E-HEALTH, 2012, 18 (10) : 807 - 809
  • [43] Cost-effective building renovation at district level combining energy efficiency & renewables - Methodology assessment proposed in IEA EBC Annex 75 and a demonstration case study
    Teres-Zubiaga, Jon
    Bolliger, Roman
    Almeida, Manuela G.
    Barbosa, Ricardo
    Rose, Jorgen
    Thomsen, Kirsten E.
    Montero, Eduardo
    Briones-Llorente, Raul
    ENERGY AND BUILDINGS, 2020, 224
  • [44] A cost-effective and efficient strategy for Illumina sequencing of fungal communities: A case study of beech endophytes identified elevation as main explanatory factor for diversity and community composition
    Siddique, A. B.
    Unterseher, M.
    FUNGAL ECOLOGY, 2016, 20 : 175 - 185
  • [45] Green AI-Driven Concept for the Development of Cost-Effective and Energy-Efficient Deep Learning Method: Application in the Detection of Eimeria Parasites as a Case Study
    Acmali, Suheda Semih
    Ortakci, Yasin
    Seker, Huseyin
    ADVANCED INTELLIGENT SYSTEMS, 2024, 6 (07)
  • [46] Urban Land Cover Change Modelling Using Time-Series Satellite Images: A Case Study of Urban Growth in Five Cities of Saudi Arabia
    Alqurashi, Abdullah E.
    Kumar, Lalit
    Sinha, Priyakant
    REMOTE SENSING, 2016, 8 (10)
  • [47] Using fuzzy cost-time profile for effective implementation of lean programmes; SAIPA automotive manufacturer, case study
    Keykavoussi, Ashkan
    Ebrahimi, Ahmad
    TOTAL QUALITY MANAGEMENT & BUSINESS EXCELLENCE, 2020, 31 (13-14) : 1519 - 1543
  • [48] A cost-effective and efficient strategy for illumina sequencing of fungal communities: A case study of beech endophytes identified elevation as main explanatory factor for diversity and community composition (vol 20, pg 175, 2016)
    Siddique, A. B.
    Unterseher, M.
    FUNGAL ECOLOGY, 2016, 22 : 1 - 1
  • [49] Time Series Forecasting Using a Two-Level Multi-Objective Genetic Algorithm: A Case Study of Maintenance Cost Data for Tunnel Fans
    Al-Douri, Yamur K.
    Hamodi, Hussan
    Lundberg, Jan
    ALGORITHMS, 2018, 11 (08)
  • [50] Optimal, Reliable and Cost-Effective Framework of Photovoltaic-Wind-Battery Energy System Design Considering Outage Concept Using Grey Wolf Optimizer Algorithm-Case Study for Iran
    Naderipour, Amirreza
    Abdul-Malek, Zulkurnain
    Vahid, Masoud Zahedi
    Seifabad, Zahra Mirzaei
    Hajivand, Mohammad
    Arabi-Nowdeh, Saber
    IEEE ACCESS, 2019, 7 : 182611 - 182623