Large-scale prediction of stream water quality using an interpretable deep learning approach

被引:20
|
作者
Zheng, Hang [1 ]
Liu, Yueyi [1 ]
Wan, Wenhua [1 ]
Zhao, Jianshi [2 ]
Xie, Guanti [3 ]
机构
[1] Dongguan Univ Technol, Sch Environm & Civil Engn, Dongguan 523808, Peoples R China
[2] Tsinghua Univ, Dept Hydraul Engn, Beijing 100084, Peoples R China
[3] Dongguan Shigu Sewage Treatment Co Ltd, Dongguan 523808, Peoples R China
基金
中国国家自然科学基金;
关键词
Water quality; Deep learning; Prediction; Interpretable; Large scale; LAND-USE; SPATIOTEMPORAL VARIABILITY; RIVER-BASIN; MODEL; REGRESSION; TURBIDITY; COVER; TIME; FLOW;
D O I
10.1016/j.jenvman.2023.117309
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Deep learning methods, which have strong capabilities for mapping highly nonlinear relationships with acceptable calculation speed, have been increasingly applied for water quality prediction in recent studies. However, it is argued that the practicality of deep learning methods is limited due to the lack of physical mechanics to explain the prediction results of water quality changes. A knowledge gap exists in rationalizing the deep learning results for water quality predictions. To address this gap, an interpretable deep learning framework was established to predict the spatiotemporal variations of water quality parameters in a large spatial region. Mereological, land-use, and socioeconomic variables were adopted to predict the daily variations of stream water quality parameters across 138 sub-catchments in a total of over 575,250 km2 in southern China. The coefficients of determination of chemical oxygen demand (COD), total phosphorus (TP), and total nitrogen (TN) predictions were over 0.80, suggesting a satisfactory prediction performance. The model performance in terms of prediction accuracy could be improved by involving land-use and socioeconomic predictors in addition to hydrological variables. The SHapley Additive exPlanations method used in this study was demonstrated to be effective for interpreting the prediction results by identifying the significant variables and reasoning their influencing directions on the variation of each water quality parameter. The air temperature, proportion of forest area, grain production, population density, and proportion of urban area in each sub-catchment as well as the accumulated rainfall within the previous 3 days were identified as the most significant variables affecting the variations of dissolved oxygen, COD, ammoniacal nitrogen(NH3-N), TN, TP, and turbidity in the stream water in the case area, respectively.
引用
收藏
页数:14
相关论文
共 50 条
  • [41] Prediction and assessment of local stream habitat features using large-scale catchment characteristics
    Davies, NM
    Norris, RH
    Thoms, MC
    FRESHWATER BIOLOGY, 2000, 45 (03) : 343 - 369
  • [42] NetSentry: A deep learning approach to detecting incipient large-scale network attacks
    Liu, Haoyu
    Patras, Paul
    COMPUTER COMMUNICATIONS, 2022, 191 : 119 - 132
  • [43] A Data-Centric Approach for Analyzing Large-Scale Deep Learning Applications
    Vineet, S. Sai
    Joseph, Natasha Meena
    Korgaonkar, Kunal
    Paul, Arnab K.
    PROCEEDINGS OF THE 24TH INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING AND NETWORKING, ICDCN 2023, 2023, : 282 - 283
  • [44] Application of a Deep Learning Approach to Analyze Large-Scale MRI Data of the Spine
    Streckenbach, Felix
    Leifert, Gundram
    Beyer, Thomas
    Mesanovic, Anita
    Waescher, Hanna
    Cantre, Daniel
    Langner, Sonke
    Weber, Marc-Andre
    Lindner, Tobias
    HEALTHCARE, 2022, 10 (11)
  • [45] Coordination of large-scale systems using a new interaction prediction approach
    Sadati, Nasser
    Ramezani, Mohammad Hossein
    PROCEEDINGS OF THE 40TH SOUTHEASTERN SYMPOSIUM ON SYSTEM THEORY, 2008, : 385 - 389
  • [46] An interpretable deep geometric learning model to predict the effects of mutations on protein–protein interactions using large-scale protein language model
    Caiya Zhang
    Yan Sun
    Pingzhao Hu
    Journal of Cheminformatics, 17 (1)
  • [47] Efficient Learning of Fuzzy Logic Systems for Large-Scale Data Using Deep Learning
    Koklu, Ata
    Guven, Yusuf
    Kumbasar, Tufan
    INTELLIGENT AND FUZZY SYSTEMS, INFUS 2024 CONFERENCE, VOL 1, 2024, 1088 : 406 - 413
  • [48] Improving Material Property Prediction by Leveraging the Large-Scale Computational Database and Deep Learning
    Chen, Pin
    Chen, Jianwen
    Yan, Hui
    Mo, Qing
    Xu, Zexin
    Liu, Jinyu
    Zhang, Wenqing
    Yang, Yuedong
    Lu, Yutong
    JOURNAL OF PHYSICAL CHEMISTRY C, 2022, 126 (38): : 16297 - 16305
  • [49] Time series prediction method of large-scale surface subsidence based on deep learning
    Liu Q.
    Zhang Y.
    Deng M.
    Wu H.
    Kang Y.
    Wei J.
    Cehui Xuebao/Acta Geodaetica et Cartographica Sinica, 2021, 50 (03): : 396 - 404
  • [50] A Deep Learning Methodology for Citation Count Prediction with Large-scale Biblio-Features
    Li, Mengjun
    Xu, Jianguo
    Ge, Bingfeng
    Liu, Jia
    Jiang, Jiang
    Zhao, Qingsong
    2019 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC), 2019, : 1172 - 1176