Explainable machine learning models for estimating daily dissolved oxygen concentration of the Tualatin River

被引:4
|
作者
Li, Shuguang [1 ]
Qasem, Sultan Noman [2 ,3 ]
Band, Shahab S. [4 ,5 ,9 ,10 ]
Ameri, Rasoul [6 ]
Pai, Hao-Ting [7 ,11 ]
Mehdizadeh, Saeid [8 ]
机构
[1] Shandong Technol & Business Univ, Sch Comp Sci & Technol, Yantai, Peoples R China
[2] Imam Mohammad Ibn Saud Islamic Univ IMSIU, Coll Comp & Informat Sci, Comp Sci Dept, Riyadh, Saudi Arabia
[3] Taiz Univ, Fac Appl Sci, Comp Sci Dept, Taizi, Yemen
[4] Natl Yunlin Univ Sci & Technol, Future Technol Res Ctr, Touliu, Taiwan
[5] Natl Yunlin Univ Sci & Technol, Int Grad Sch Artificial Intelligence, Dept Informat Management, Touliu, Taiwan
[6] Natl Yunlin Univ Sci & Technol, Dept Informat Management, Touliu, Taiwan
[7] Natl Pingtung Univ, Big Data Applicat Business, Pingtung, Taiwan
[8] Urmia Univ, Water Engn Dept, Orumiyeh, Iran
[9] Natl Yunlin Univ Sci & Technol, Future Technol Res Ctr, 123 Univ Rd,Sect 3, Touliu 64002, Yunlin, Taiwan
[10] Natl Yunlin Univ Sci & Technol, Int Grad Sch Artificial Intelligence, Dept Informat Management, 123 Univ Rd,Sect 3, Touliu 64002, Yunlin, Taiwan
[11] Natl Pingtung Univ, Bachelor Program Big Data Applicat Business, 51 Minsheng E Rd, Pingtung 900392, Pingtung, Taiwan
关键词
Explainable machine learning; dissolved oxygen concentration; estimation; SHapley additive explanations; SUPPORT VECTOR MACHINE; WATER-QUALITY; PREDICTION; REGRESSION;
D O I
10.1080/19942060.2024.2304094
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Monitoring the quality of river water is of fundamental importance and needs to be taken into consideration when it comes to the research into the hydrological field. In this context, the concentration of the dissolved oxygen (DO) is one of the most significant indicators of the quality of river water. The current study aimed to estimate the minimum, maximum, and mean DO concentrations (DO min, DO max, DO mean) at a gauging station located on Tualatin River, United States. To that end, four machine learning models, such as support vector regression (SVR), multi-layer perceptron (MLP), random forest (RF), and gradient boosting (GB) were established. Root mean square error (RMSE), mean absolute error (MAE), coefficient of correlation (R), and Nash-Sutcliffe efficiency (NSE) metrics were employed to better assess the accuracies of these models. The modeling results demonstrated that the SVR and MLP surpassed the RF and GB models. Despite this, the SVR was concluded to be the best-performing method when used to estimate the DO min, DO max, and DO mean. The best error statistics in the testing phase were related to the SVR model with full (four) inputs to estimate DO mean concentration (RMSE = 0.663 mg/l, MAE = 0.508 mg/l, R = 0.945, NSE = 0.875). Finally, the explainability of the superior models (i.e. SVR models) was conducted using SHapley Additive exPlanations (SHAP) for the first time to estimate DO concentration. In fact, evaluating the explainability of machine learning models can provide useful information about the impact of each of the input estimators used in the procedure of models development. It was concluded that the specific conductance (SC) and followed by water temperature (WT) could provide the most contributions for estimating the DO min, DO max, and DO mean concentrations.
引用
收藏
页数:18
相关论文
共 50 条
  • [41] Using Machine Learning Models for Short-Term Prediction of Dissolved Oxygen in a Microtidal Estuary
    Gachloo, Mina
    Liu, Qianqian
    Song, Yang
    Wang, Guozhi
    Zhang, Shuhao
    Hall, Nathan
    WATER, 2024, 16 (14)
  • [42] River flooding mechanisms and their changes in Europe revealed by explainable machine learning
    Jiang, Shijie
    Bevacqua, Emanuele
    Zscheischler, Jakob
    HYDROLOGY AND EARTH SYSTEM SCIENCES, 2022, 26 (24) : 6339 - 6359
  • [43] Explainable machine learning models for early gastric cancer diagnosis
    Du, Hongyang
    Yang, Qingfen
    Ge, Aimin
    Zhao, Chenhao
    Ma, Yunhua
    Wang, Shuyu
    SCIENTIFIC REPORTS, 2024, 14 (01):
  • [44] Use of explainable machine learning models in blast load prediction
    Widanage, C.
    Mohotti, D.
    Lee, C. K.
    Wijesooriya, K.
    Meddage, D. P. P.
    ENGINEERING STRUCTURES, 2024, 312
  • [45] Explainable prediction of loan default based on machine learning models
    Zhu X.
    Chu Q.
    Song X.
    Hu P.
    Peng L.
    Data Science and Management, 2023, 6 (03): : 123 - 133
  • [46] Explainable Debugger for Black-box Machine Learning Models
    Rasouli, Peyman
    Yu, Ingrid Chieh
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [47] Explainable machine learning models for defects detection in industrial processes
    Oliveira, Rodrigo Marcel Araujo
    Sant'Anna, Angelo Marcio Oliveira
    da Silva, Paulo Henrique Ferreira
    COMPUTERS & INDUSTRIAL ENGINEERING, 2024, 192
  • [48] Explainable Online Validation of Machine Learning Models for Practical Applications
    Fuhl, Wolfgang
    Rong, Yao
    Motz, Thomas
    Scheidt, Michael
    Hartel, Andreas
    Koch, Andreas
    Kasneci, Enkelejda
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 3304 - 3311
  • [49] Dissolved oxygen concentration in river sediment of the Lake Biwa tributaries, Japan
    Yoshihiro Yamada
    Yugo Mito
    Akitake Igeta
    Eitaro Wada
    Limnology, 2012, 13 : 149 - 154
  • [50] Dissolved oxygen concentration in river sediment of the Lake Biwa tributaries, Japan
    Yamada, Yoshihiro
    Mito, Yugo
    Igeta, Akitake
    Wada, Eitaro
    LIMNOLOGY, 2012, 13 (01) : 149 - 154