Data-driven methods for dengue prediction and surveillance using real-world and Big Data: A systematic review

被引:24
|
作者
Sylvestre, Emmanuelle [1 ,2 ]
Joachim, Clarisse [3 ,4 ]
Cecilia-Joseph, Elsa [2 ]
Bouzille, Guillaume [1 ]
Campillo-Gimenez, Boris [1 ,5 ]
Cuggia, Marc [1 ]
Cabie, Andre [6 ,7 ,8 ]
机构
[1] Univ Rennes, CHU Rennes, INSERM, LTSI,UMR 1099, Rennes, France
[2] CHU Martinique, Ctr Donnees Clin, Martinique, France
[3] CHU Martinique, Pole Cancerol Hematol Urol, Registre Gen Canc Martinique, Martinique, France
[4] CHU Martinique, Pole Cancerol Hematol Urol, Martinique Canc Data Hub, Martinique, France
[5] Ctr Lutte Canc Eugene Marquis, Rennes, France
[6] CHU Martinique, Infect & Trop Dis Unit, Martinique, France
[7] CHU Martinique, INSERM, CIC 1424, Martinique, France
[8] Univ Montpellier, INSERM, EFS, Univ Antilles,PCCEI, Montpellier, France
来源
PLOS NEGLECTED TROPICAL DISEASES | 2022年 / 16卷 / 01期
关键词
SOCIAL MEDIA SURVEILLANCE; FEVER SURVEILLANCE; DISEASE OUTBREAKS; MODEL; CLASSIFICATION; EPIDEMICS; PROGNOSIS; NETWORKS; CLIMATE; TRENDS;
D O I
10.1371/journal.pntd.0010056
中图分类号
R51 [传染病];
学科分类号
100401 ;
摘要
BackgroundTraditionally, dengue surveillance is based on case reporting to a central health agency. However, the delay between a case and its notification can limit the system responsiveness. Machine learning methods have been developed to reduce the reporting delays and to predict outbreaks, based on non-traditional and non-clinical data sources. The aim of this systematic review was to identify studies that used real-world data, Big Data and/or machine learning methods to monitor and predict dengue-related outcomes. Methodology/Principal findingsWe performed a search in PubMed, Scopus, Web of Science and grey literature between January 1, 2000 and August 31, 2020. The review (ID: CRD42020172472) focused on data-driven studies. Reviews, randomized control trials and descriptive studies were not included. Among the 119 studies included, 67% were published between 2016 and 2020, and 39% used at least one novel data stream. The aim of the included studies was to predict a dengue-related outcome (55%), assess the validity of data sources for dengue surveillance (23%), or both (22%). Most studies (60%) used a machine learning approach. Studies on dengue prediction compared different prediction models, or identified significant predictors among several covariates in a model. The most significant predictors were rainfall (43%), temperature (41%), and humidity (25%). The two models with the highest performances were Neural Networks and Decision Trees (52%), followed by Support Vector Machine (17%). We cannot rule out a selection bias in our study because of our two main limitations: we did not include preprints and could not obtain the opinion of other international experts. Conclusions/SignificanceCombining real-world data and Big Data with machine learning methods is a promising approach to improve dengue prediction and monitoring. Future studies should focus on how to better integrate all available data sources and methods to improve the response and dengue management by stakeholders. Author summaryDengue is one of the most important arbovirus infections in the world and its public health, societal and economic burden is increasing. Although the majority of dengue cases are asymptomatic or mild, severe disease forms can lead to death. For this reason, early diagnosis and monitoring of dengue are crucial to decrease mortality. However, most endemic regions still rely on traditional monitoring methods, despite the growing availability of novel data sources and data-driven methods based on real-world data, Big Data, and machine learning algorithms. In this systematic review, we identified and analyzed studies that used these novel approaches for dengue monitoring and/or prediction. We found that novel data streams, such as Internet search engines and social media platforms, and machine learning methods can be successfully used to improve dengue management, but are still vastly ignored in real life. These approaches should be combined with traditional methods to help stakeholders better prepare for each outbreak and improve early responsiveness.
引用
收藏
页数:22
相关论文
共 50 条
  • [31] Data-driven techniques for temperature data prediction: big data analytics approach
    Oloyede, Adamson
    Ozuomba, Simeon
    Asuquo, Philip
    Olatomiwa, Lanre
    Longe, Omowunmi Mary
    ENVIRONMENTAL MONITORING AND ASSESSMENT, 2023, 195 (02)
  • [32] Telemonitoring of Real-World Health Data in Cardiology: A Systematic Review
    Kinast, Benjamin
    Lutz, Matthias
    Schreiweis, Bjorn
    INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH, 2021, 18 (17)
  • [33] Big Data Analytics in Education: A Data-Driven Literature Review
    Shabihi, Negar
    Kim, Mi Song
    IEEE 21ST INTERNATIONAL CONFERENCE ON ADVANCED LEARNING TECHNOLOGIES (ICALT 2021), 2021, : 154 - 156
  • [34] PREDICTION OF FLOOD IN KARKHEH BASIN USING DATA-DRIVEN METHODS
    Kamali, S.
    Saedi, F.
    Asghari, K.
    ISPRS GEOSPATIAL CONFERENCE 2022, JOINT 6TH SENSORS AND MODELS IN PHOTOGRAMMETRY AND REMOTE SENSING, SMPR/4TH GEOSPATIAL INFORMATION RESEARCH, GIRESEARCH CONFERENCES, VOL. 10-4, 2023, : 349 - 354
  • [35] Beyond data sharing: Using real-world data for teaching real-world computational workflows and for benchmarking new methods
    Jansen, Johanna
    Amaro, Rommie
    Tseng, Y. Jane
    Cornell, Wendy
    Esposito, Emilio
    Walters, Pat
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2016, 252
  • [36] SRCON: A Data-Driven Network Performance Simulator for Real-World Wireless Networks
    Luo, Zhi-Quan
    Zheng, Xi
    Lopez-Perez, David
    Yan, Qi
    Chen, Xin
    Wang, Nanbin
    Shi, Qingjiang
    Chang, Tsung-Hui
    Garcia-Rodriguez, Adrian
    IEEE COMMUNICATIONS MAGAZINE, 2023, 61 (06) : 96 - 102
  • [37] A Data-Driven Identification Procedure for HVAC Processes with Laboratory and Real-World Validation
    Minarcik, Peter
    Prochazka, Hynek
    Gulan, Martin
    PROCESSES, 2022, 10 (01)
  • [38] A data-driven workflow to improve energy efficient operation of commercial buildings: A review with real-world examples
    Abuimara, Tareq
    Hobson, Brodie W.
    Gunay, Burak
    O'Brien, William
    BUILDING SERVICES ENGINEERING RESEARCH & TECHNOLOGY, 2022, 43 (04): : 517 - 534
  • [39] A Real-World Data-Driven approach for estimating environmental impacts of traffic accidents
    Liao, Xishun
    Wu, Guoyuan
    Yang, Lan
    Barth, Matthew J.
    TRANSPORTATION RESEARCH PART D-TRANSPORT AND ENVIRONMENT, 2023, 117
  • [40] Data-Driven Fault Detection and Diagnosis: Challenges and Opportunities in Real-World Scenarios
    Calabrese, Francesca
    Regattieri, Alberto
    Bortolini, Marco
    Galizia, Francesco Gabriele
    APPLIED SCIENCES-BASEL, 2022, 12 (18):