Leptospirosis modelling using hydrometeorological indices and random forest machine learning

被引:0
|
作者
Veianthan Jayaramu
Zed Zulkafli
Simon De Stercke
Wouter Buytaert
Fariq Rahmat
Ribhan Zafira Abdul Rahman
Asnor Juraiza Ishak
Wardah Tahir
Jamalludin Ab Rahman
Nik Mohd Hafiz Mohd Fuzi
机构
[1] Universiti Putra Malaysia,Department of Civil Engineering
[2] Imperial College London,Department of Civil and Environmental Engineering
[3] Universiti Putra Malaysia,Department of Electrical and Electronic Engineering
[4] Universiti Teknologi Mara,Flood Control Research Group, Faculty of Civil Engineering
[5] International Islamic University Malaysia,Department of Community Medicine, Kulliyyah of Medicine
[6] Ministry of Health Malaysia,Kelantan State Health Department
来源
International Journal of Biometeorology | 2023年 / 67卷
关键词
Leptospirosis; Hydrometeorological indices; Cross-correlation analysis; Random forest; Variable importance; Feature selection;
D O I
暂无
中图分类号
学科分类号
摘要
Leptospirosis is a zoonosis that has been linked to hydrometeorological variability. Hydrometeorological averages and extremes have been used before as drivers in the statistical prediction of disease. However, their importance and predictive capacity are still little known. In this study, the use of a random forest classifier was explored to analyze the relative importance of hydrometeorological indices in developing the leptospirosis model and to evaluate the performance of models based on the type of indices used, using case data from three districts in Kelantan, Malaysia, that experience annual monsoonal rainfall and flooding. First, hydrometeorological data including rainfall, streamflow, water level, relative humidity, and temperature were transformed into 164 weekly average and extreme indices in accordance with the Expert Team on Climate Change Detection and Indices (ETCCDI). Then, weekly case occurrences were classified into binary classes “high” and “low” based on an average threshold. Seventeen models based on “average,” “extreme,” and “mixed” indices were trained by optimizing the feature subsets based on the model computed mean decrease Gini (MDG) scores. The variable importance was assessed through cross-correlation analysis and the MDG score. The average and extreme models showed similar prediction accuracy ranges (61.5–76.1% and 72.3–77.0%) while the mixed models showed an improvement (71.7–82.6% prediction accuracy). An extreme model was the most sensitive while an average model was the most specific. The time lag associated with the driving indices agreed with the seasonality of the monsoon. The rainfall variable (extreme) was the most important in classifying the leptospirosis occurrence while streamflow was the least important despite showing higher correlations with leptospirosis.
引用
收藏
页码:423 / 437
页数:14
相关论文
共 50 条
  • [21] Investigation of hydrometeorological influences on reservoir releases using explainable machine learning methods
    Fan, Ming
    Zhang, Lujun
    Liu, Siyan
    Yang, Tiantian
    Lu, Dan
    FRONTIERS IN WATER, 2023, 5
  • [22] Pier scour modelling using random forest regression
    Pal, M. (mpce_pal@yahoo.co.uk), 1600, Taylor and Francis Ltd. (19):
  • [23] Research on Machine Learning Framework Based on Random Forest Algorithm
    Ren, Qiong
    Cheng, Hui
    Han, Hai
    ADVANCES IN MATERIALS, MACHINERY, ELECTRONICS I, 2017, 1820
  • [24] A random forest machine learning model to detect fluvial hazards
    Gava, Marco
    Biron, Pascale M.
    Buffin-Belanger, Thomas
    RIVER RESEARCH AND APPLICATIONS, 2024, 40 (10) : 1837 - 1854
  • [25] Identifying Hydrometeorological Factors Influencing Reservoir Releases Using Machine Learning Methods
    Fan, Ming
    Zhang, Lujun
    Liu, Siyan
    Yang, Tiantian
    Lu, Dan
    2022 IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS, ICDMW, 2022, : 1102 - 1110
  • [26] Post-typhoon forest damage estimation using multiple vegetation indices and machine learning models
    Chen, Xinyu
    Avtar, Ram
    Umarhadi, Deha Agus
    Louw, Albertus Stephanus
    Shrivastava, Sourabh
    Yunus, Ali P.
    Khedher, Khaled Mohamed
    Takemi, Tetsuya
    Shibata, Hideaki
    WEATHER AND CLIMATE EXTREMES, 2022, 38
  • [27] Process parameters based machine learning model for bead profile prediction in activated TIG Welding using random forest machine learning
    Munghate, Abhinav Arun
    Thapliyal, Shivraman
    PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART B-JOURNAL OF ENGINEERING MANUFACTURE, 2024, 238 (12) : 1761 - 1768
  • [28] Power Transformer Fault Diagnosis Using Random Forest and Optimized Kernel Extreme Learning Machine
    Kari, Tusongjiang
    He, Zhiyang
    Rouzi, Aisikaer
    Zhang, Ziwei
    Ma, Xiaojing
    Du, Lin
    INTELLIGENT AUTOMATION AND SOFT COMPUTING, 2023, 37 (01): : 691 - 705
  • [29] Construction of a Diagnostic Algorithm for Diagnosis of Adult Asthma Using Machine Learning with Random Forest and XGBoost
    Tomita, Katsuyuki
    Yamasaki, Akira
    Katou, Ryohei
    Ikeuchi, Tomoyuki
    Touge, Hirokazu
    Sano, Hiroyuki
    Tohda, Yuji
    DIAGNOSTICS, 2023, 13 (19)
  • [30] Predicting Global Marine Sediment Density Using the Random Forest Regressor Machine Learning Algorithm
    Graw, J. H.
    Wood, W. T.
    Phrampus, B. J.
    JOURNAL OF GEOPHYSICAL RESEARCH-SOLID EARTH, 2021, 126 (01)