Large-scale prediction of stream water quality using an interpretable deep learning approach

被引:20
|
作者
Zheng, Hang [1 ]
Liu, Yueyi [1 ]
Wan, Wenhua [1 ]
Zhao, Jianshi [2 ]
Xie, Guanti [3 ]
机构
[1] Dongguan Univ Technol, Sch Environm & Civil Engn, Dongguan 523808, Peoples R China
[2] Tsinghua Univ, Dept Hydraul Engn, Beijing 100084, Peoples R China
[3] Dongguan Shigu Sewage Treatment Co Ltd, Dongguan 523808, Peoples R China
基金
中国国家自然科学基金;
关键词
Water quality; Deep learning; Prediction; Interpretable; Large scale; LAND-USE; SPATIOTEMPORAL VARIABILITY; RIVER-BASIN; MODEL; REGRESSION; TURBIDITY; COVER; TIME; FLOW;
D O I
10.1016/j.jenvman.2023.117309
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Deep learning methods, which have strong capabilities for mapping highly nonlinear relationships with acceptable calculation speed, have been increasingly applied for water quality prediction in recent studies. However, it is argued that the practicality of deep learning methods is limited due to the lack of physical mechanics to explain the prediction results of water quality changes. A knowledge gap exists in rationalizing the deep learning results for water quality predictions. To address this gap, an interpretable deep learning framework was established to predict the spatiotemporal variations of water quality parameters in a large spatial region. Mereological, land-use, and socioeconomic variables were adopted to predict the daily variations of stream water quality parameters across 138 sub-catchments in a total of over 575,250 km2 in southern China. The coefficients of determination of chemical oxygen demand (COD), total phosphorus (TP), and total nitrogen (TN) predictions were over 0.80, suggesting a satisfactory prediction performance. The model performance in terms of prediction accuracy could be improved by involving land-use and socioeconomic predictors in addition to hydrological variables. The SHapley Additive exPlanations method used in this study was demonstrated to be effective for interpreting the prediction results by identifying the significant variables and reasoning their influencing directions on the variation of each water quality parameter. The air temperature, proportion of forest area, grain production, population density, and proportion of urban area in each sub-catchment as well as the accumulated rainfall within the previous 3 days were identified as the most significant variables affecting the variations of dissolved oxygen, COD, ammoniacal nitrogen(NH3-N), TN, TP, and turbidity in the stream water in the case area, respectively.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] Large-scale prediction of tropical stream water quality using Rough Sets Theory
    Albuquerque, Laysson Guillen
    Roque, Fabio de Oliveira
    Valente-Neto, Francisco
    Koroiva, Ricardo
    Buss, Daniel Forsin
    Baptista, Darcilio Fernandes
    Hepp, Luiz Ubiratan
    Kuhlmann, Monica Luisa
    Sundar, S.
    Covich, Alan P.
    Pereira Pinto, Joao Onofre
    ECOLOGICAL INFORMATICS, 2021, 61
  • [2] Prediction of estuarine water quality using interpretable machine learning approach
    Wang, Shuo
    Peng, Hui
    Liang, Shengkang
    JOURNAL OF HYDROLOGY, 2022, 605
  • [3] Rich Punctuations Prediction Using Large-scale Deep Learning
    Wu, Xueyang
    Zhu, Su
    Wu, Yue
    Yu, Kai
    2016 10TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2016,
  • [4] An Integrated Deep Neural Network Approach for Large-Scale Water Quality Time Series Prediction
    Dong, QuanXi
    Lin, YongZhe
    Bi, Jing
    Yuan, Haitao
    2019 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC), 2019, : 3537 - 3542
  • [5] Large-scale water quality prediction with integrated deep neural network
    Bi, Jing
    Lin, Yongze
    Dong, Quanxi
    Yuan, Haitao
    Zhou, MengChu
    INFORMATION SCIENCES, 2021, 571 (571) : 191 - 205
  • [6] Large-Scale Water Quality Prediction With Deep Decomposition Architecture and Auto-Correlation
    Bi, Jing
    Yuan, Mingxing
    Yuan, Haitao
    Qiao, Junfei
    Zhang, Jia
    Zhou, Mengchu
    IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2025,
  • [7] Interpretable deep learning for consistent large-scale urban population estimation using Earth observation data
    Doda, Sugandha
    Kahl, Matthias
    Ouan, Kim
    Obadic, Ivica
    Wang, Yuanyuan
    Taubenboeck, Hannes
    Zhu, Xiao Xiang
    INTERNATIONAL JOURNAL OF APPLIED EARTH OBSERVATION AND GEOINFORMATION, 2024, 128
  • [8] Large-Scale Transportation Network Congestion Evolution Prediction Using Deep Learning Theory
    Ma, Xiaolei
    Yu, Haiyang
    Wang, Yunpeng
    Wang, Yinhai
    PLOS ONE, 2015, 10 (03):
  • [9] Effective interpretable learning for large-scale categorical data
    Zhang, Yishuo
    Zaidi, Nayyar
    Zhou, Jiahui
    Wang, Tao
    Li, Gang
    DATA MINING AND KNOWLEDGE DISCOVERY, 2024, 38 (04) : 2223 - 2251
  • [10] A Network Traffic Flow Prediction with Deep Learning Approach for Large-scale Metropolitan Area Network
    Wang, Weitao
    Bai, Yuebin
    Yu, Chao
    Gu, Yuhao
    Feng, Peng
    Wang, Xiaojing
    Wang, Rui
    NOMS 2018 - 2018 IEEE/IFIP NETWORK OPERATIONS AND MANAGEMENT SYMPOSIUM, 2018,