Predicting few disinfection byproducts in the water distribution systems using machine learning models

被引:0
|
作者
Shakhawat Chowdhury [1 ]
Karim Asif Sattar [4 ]
Syed Masiur Rahman [2 ]
机构
[1] King Fahd University of Petroleum & Minerals,Department of Civil and Environmental Engineering
[2] Research Engineer I,undefined
[3] Interdisciplinary Research Center for Smart Mobility and Logistics. King Fahd University of Petroleum & Minerals,undefined
[4] Research Engineer I,undefined
[5] Applied Research Center for Environment & Marine Studies,undefined
[6] Research Institute,undefined
[7] King Fahd University of Petroleum & Minerals,undefined
[8] IRC CBM,undefined
[9] King Fahd University of Petroleum & Minerals,undefined
关键词
Machine learning models; Drinking water; Water distribution system; Disinfection byproducts; Model training and testing; Risk reduction;
D O I
10.1007/s11356-025-35933-3
中图分类号
学科分类号
摘要
Concerns regarding disinfection byproducts (DBPs) in drinking water persist, with measurements in water treatment plants (WTPs) being relatively easier than those in water distribution systems (WDSs) due to accessibility challenges, especially during adverse weather conditions. Machine learning (ML) models offer improved predictions of DBPs in WDSs. This study developed multiple ML models to predict Trihalomethanes (THMs), Haloacetic Acids (HAAs), Dichloroacetonitrile (DCAN), and N-nitrosodimethylamine (NDMA) in WDSs using data collected over 13 years (2008–2020) from 113 water supply systems (WSS) in Ontario. Data were collected tri-monthly (four times/year) following Ontario's regulatory requirements. Four common ML models—linear regressor (LR), random forest regressor (RFR), support vector regressor (SVR), and artificial neural networks with multiple folds cross-validation (ANN-MV) and single fold validation (ANN-SV)—were trained and tested using different datasets. R2 values for training datasets of THMs, HAAs, DCAN, and NDMA models ranged from 0.533 to 0.976, 0.560 to 0.980, 0.602 to 0.993, and 0.449 to 0.858, respectively. For testing datasets, R2 ranged from 0.517 to 0.939, 0.437 to 0.945, 0.565 to 0.973, and 0.517 to 0.718, respectively. Among THMs, HAAs, and DCAN, ANN-SV models were identified as the best, followed by the RFR model, whereas for NDMA, SVR was the superior model, followed by the LR model. Some models reliably predicted DBPs, suggesting they could replace costly sampling and experimental analysis for DBPs in the WDSs, thereby enhancing DBPs control in WDSs and reducing human exposure and associated risks.
引用
收藏
页码:3776 / 3794
页数:18
相关论文
共 50 条
  • [21] Predicting hydrolysis kinetics for multiple types of halogenated disinfection byproducts via QSAR models
    Wang, Lei
    Chen, Baiyang
    Zhang, Tian
    CHEMICAL ENGINEERING JOURNAL, 2018, 342 : 372 - 385
  • [22] Intrusion Detection in Water Distribution Systems using Machine Learning Techniques: A Survey
    Mabunda, Hlayisani D.
    Ramotsoela, Daniel T.
    Abu-Mahfouz, Adnan M.
    2022 IEEE 31ST INTERNATIONAL SYMPOSIUM ON INDUSTRIAL ELECTRONICS (ISIE), 2022, : 418 - 423
  • [23] Suspect and Nontarget Screening of Coexisting Emerging Contaminants and Aromatic Halogenated Disinfection Byproducts in Drinking Water Distribution Systems
    Gao, Quan
    Wang, Zhenyu
    Long, Wenqing
    Huang, Qiuyun
    Zhang, Jinna
    Zhang, Jin
    Hua, Pei
    Ying, Guang-Guo
    ACS ES&T WATER, 2024, 4 (08): : 3380 - 3390
  • [24] Impacts of bacteria and corrosion on removal of natural organic matter and disinfection byproducts in different drinking water distribution systems
    Wang, Haibo
    Zhu, Ying
    Hu, Chun
    INTERNATIONAL BIODETERIORATION & BIODEGRADATION, 2017, 117 : 52 - 59
  • [25] Water distribution pipe lifespans: Predicting when to repair the pipes in municipal water distribution networks using machine learning techniques
    Farajzadeh, Nacer
    Sadeghzadeh, Nima
    Jokar, Nastaran
    PLOS WATER, 2024, 3 (01):
  • [26] Using Machine Learning Models for Predicting the Water Quality Index in the La Buong River, Vietnam
    Khoi, Dao Nguyen
    Quan, Nguyen Trong
    Linh, Do Quang
    Nhi, Pham Thi Thao
    Thuy, Nguyen Thi Diem
    WATER, 2022, 14 (10)
  • [27] Predicting the formation of disinfection by-products using multiple linear and machine learning regression
    Peng, Fangyuan
    Lu, Yi
    Wang, Yingyang
    Yang, Long
    Yang, Zhaoguang
    Li, Haipu
    JOURNAL OF ENVIRONMENTAL CHEMICAL ENGINEERING, 2023, 11 (05):
  • [28] Comparison of ANN models for predicting water quality in distribution systems
    D'Souza, Celia D.
    Kumar, M. S. Mohan
    JOURNAL AMERICAN WATER WORKS ASSOCIATION, 2010, 102 (07): : 92 - +
  • [29] Comparison of ANN models for predicting water quality in distribution systems
    D'Souza C.D.
    Kumar M.S.M.
    Journal / American Water Works Association, 2010, 102 (07): : 92 - 106
  • [30] Predicting the Occurrence of Metabolic Syndrome Using Machine Learning Models
    Trigka, Maria
    Dritsas, Elias
    Lahoz-Beltra, Rafael
    Zhang, Yudong
    COMPUTATION, 2023, 11 (09)