Data-driven methods for dengue prediction and surveillance using real-world and Big Data: A systematic review

被引:24
|
作者
Sylvestre, Emmanuelle [1 ,2 ]
Joachim, Clarisse [3 ,4 ]
Cecilia-Joseph, Elsa [2 ]
Bouzille, Guillaume [1 ]
Campillo-Gimenez, Boris [1 ,5 ]
Cuggia, Marc [1 ]
Cabie, Andre [6 ,7 ,8 ]
机构
[1] Univ Rennes, CHU Rennes, INSERM, LTSI,UMR 1099, Rennes, France
[2] CHU Martinique, Ctr Donnees Clin, Martinique, France
[3] CHU Martinique, Pole Cancerol Hematol Urol, Registre Gen Canc Martinique, Martinique, France
[4] CHU Martinique, Pole Cancerol Hematol Urol, Martinique Canc Data Hub, Martinique, France
[5] Ctr Lutte Canc Eugene Marquis, Rennes, France
[6] CHU Martinique, Infect & Trop Dis Unit, Martinique, France
[7] CHU Martinique, INSERM, CIC 1424, Martinique, France
[8] Univ Montpellier, INSERM, EFS, Univ Antilles,PCCEI, Montpellier, France
来源
PLOS NEGLECTED TROPICAL DISEASES | 2022年 / 16卷 / 01期
关键词
SOCIAL MEDIA SURVEILLANCE; FEVER SURVEILLANCE; DISEASE OUTBREAKS; MODEL; CLASSIFICATION; EPIDEMICS; PROGNOSIS; NETWORKS; CLIMATE; TRENDS;
D O I
10.1371/journal.pntd.0010056
中图分类号
R51 [传染病];
学科分类号
100401 ;
摘要
BackgroundTraditionally, dengue surveillance is based on case reporting to a central health agency. However, the delay between a case and its notification can limit the system responsiveness. Machine learning methods have been developed to reduce the reporting delays and to predict outbreaks, based on non-traditional and non-clinical data sources. The aim of this systematic review was to identify studies that used real-world data, Big Data and/or machine learning methods to monitor and predict dengue-related outcomes. Methodology/Principal findingsWe performed a search in PubMed, Scopus, Web of Science and grey literature between January 1, 2000 and August 31, 2020. The review (ID: CRD42020172472) focused on data-driven studies. Reviews, randomized control trials and descriptive studies were not included. Among the 119 studies included, 67% were published between 2016 and 2020, and 39% used at least one novel data stream. The aim of the included studies was to predict a dengue-related outcome (55%), assess the validity of data sources for dengue surveillance (23%), or both (22%). Most studies (60%) used a machine learning approach. Studies on dengue prediction compared different prediction models, or identified significant predictors among several covariates in a model. The most significant predictors were rainfall (43%), temperature (41%), and humidity (25%). The two models with the highest performances were Neural Networks and Decision Trees (52%), followed by Support Vector Machine (17%). We cannot rule out a selection bias in our study because of our two main limitations: we did not include preprints and could not obtain the opinion of other international experts. Conclusions/SignificanceCombining real-world data and Big Data with machine learning methods is a promising approach to improve dengue prediction and monitoring. Future studies should focus on how to better integrate all available data sources and methods to improve the response and dengue management by stakeholders. Author summaryDengue is one of the most important arbovirus infections in the world and its public health, societal and economic burden is increasing. Although the majority of dengue cases are asymptomatic or mild, severe disease forms can lead to death. For this reason, early diagnosis and monitoring of dengue are crucial to decrease mortality. However, most endemic regions still rely on traditional monitoring methods, despite the growing availability of novel data sources and data-driven methods based on real-world data, Big Data, and machine learning algorithms. In this systematic review, we identified and analyzed studies that used these novel approaches for dengue monitoring and/or prediction. We found that novel data streams, such as Internet search engines and social media platforms, and machine learning methods can be successfully used to improve dengue management, but are still vastly ignored in real life. These approaches should be combined with traditional methods to help stakeholders better prepare for each outbreak and improve early responsiveness.
引用
收藏
页数:22
相关论文
共 50 条
  • [21] Life Prediction Methods Based on Data-driven: Review and Trend
    Zhu, Lisha
    Jiang, Bin
    Cheng, Yuehua
    2016 IEEE CHINESE GUIDANCE, NAVIGATION AND CONTROL CONFERENCE (CGNCC), 2016, : 1682 - 1686
  • [22] A Systematic Review of Asthma Phenotypes Derived by Data-Driven Methods
    Cunha, Francisco
    Amaral, Rita
    Jacinto, Tiago
    Sousa-Pinto, Bernardo
    Fonseca, Joao A.
    DIAGNOSTICS, 2021, 11 (04)
  • [23] The Role of Heterogenous Real-world Data for Dengue Surveillance in Martinique: Observational Retrospective Study
    Sylvestre, Emmanuelle
    Cecilia-Joseph, Elsa
    Bouzille, Guillaume
    Najioullah, Fatiha
    Etienne, Manuel
    Malouines, Fabrice
    Rosine, Jacques
    Julie, Sandrine
    Cabie, Andre
    Cuggia, Marc
    JMIR PUBLIC HEALTH AND SURVEILLANCE, 2022, 8 (12):
  • [24] Emulations of oncology trials using real-world data: a systematic literature review
    Rider, Jennifer R.
    Wasserman, Asher
    Slipski, Lukas
    Carrigan, Gillis
    Harvey, Raymond
    Jiao, Xiaolong
    Mcroy, Lynn
    Pace, Nelson D.
    Becnel, Lauren
    Bruno, Amanda
    Eckert, Joy C.
    Hodgkins, Priscilla
    Jain, Purva
    Merola, David
    Ovbiosa, Osayi E.
    Natanzon, Yanina
    Pinheiro, Simone
    Quinn, Jameson
    Rodriguez-Watson, Carla
    Campbell, Ulka
    AMERICAN JOURNAL OF EPIDEMIOLOGY, 2025,
  • [25] Real-world data-driven risk prediction of hospitalization for heart failure in non-diabetic CKD
    Kleinjung, F.
    Schuchhardt, J.
    Bauer, C.
    Lindemann, S.
    Brinker, M.
    Kong, S.
    Horvat-Broecker, A.
    Vaitsiakhovich, T.
    Wanner, C.
    EUROPEAN HEART JOURNAL, 2021, 42 : 3057 - 3057
  • [26] A data-driven epidemiological prediction method for dengue outbreaks using local and remote sensing data
    Buczak, Anna L.
    Koshute, Phillip T.
    Babin, Steven M.
    Feighner, Brian H.
    Lewis, Sheryl H.
    BMC MEDICAL INFORMATICS AND DECISION MAKING, 2012, 12
  • [27] A data-driven epidemiological prediction method for dengue outbreaks using local and remote sensing data
    Anna L Buczak
    Phillip T Koshute
    Steven M Babin
    Brian H Feighner
    Sheryl H Lewis
    BMC Medical Informatics and Decision Making, 12
  • [28] A data-driven methodology to define and visualise line of therapy for real-world data epidemiology of cancers
    Abbasi, Ali
    Hermans, Ruben
    Patel, Dony
    Grimson, Fiona
    McMahon, Peter
    Kim, Joseph
    Groves, Eric S.
    Layton, Deborah
    PHARMACOEPIDEMIOLOGY AND DRUG SAFETY, 2018, 27 : 412 - 413
  • [29] Translational Informatics Connects Real-World Information to Knowledge in an Increasingly Data-Driven World
    McDonough, Caitrin W.
    Breitenstein, Matthew K.
    Shahin, Mohamed
    Empey, Philip E.
    Freimuth, Robert R.
    Li, Lang
    Liebman, Michael
    Tuteja, Sony
    CLINICAL PHARMACOLOGY & THERAPEUTICS, 2020, 107 (04) : 738 - 741
  • [30] Data-driven techniques for temperature data prediction: big data analytics approach
    Adamson Oloyede
    Simeon Ozuomba
    Philip Asuquo
    Lanre Olatomiwa
    Omowunmi Mary Longe
    Environmental Monitoring and Assessment, 2023, 195