Application of machine learning and deep learning methods for hydrated electron rate constant prediction

被引:8
|
作者
Zheng, Shanshan [1 ]
Guo, Wanqian [1 ]
Li, Chao [2 ]
Sun, Yongbin [3 ]
Zhao, Qi [1 ]
Lu, Hao [1 ]
Si, Qishi [1 ]
Wang, Huazhe [1 ]
机构
[1] Harbin Inst Technol, State Key Lab Urban Water Resource & Environm, Harbin 150090, Peoples R China
[2] Northeast Normal Univ, Sch Environm, State Environm Protect Key Lab Wetland Ecol & Vege, 2555 Jingyue St, Changchun 130117, Jilin, Peoples R China
[3] Shandong First Med Univ & Shandong Acad Med Sci, Sch Chem & Pharmaceut Engn, Tai An 271016, Shandong, Peoples R China
基金
中国国家自然科学基金;
关键词
Rate constant prediction; Machine learning; Deep learning; Hydrated electron(e(aq)(-) ); SHAP; Grad-CAM; REDUCTIVE DEFLUORINATION; PERFLUOROOCTANOIC ACID; MODELS; DEGRADATION;
D O I
10.1016/j.envres.2023.115996
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Accurately determining the second-order rate constant with e(aq)(-) (k(eaq-)) for organic compounds (OCs) is crucial in the e(aq)(-) induced advanced reduction processes (ARPs). In this study, we collected 867 k(eaq-) values at different pHs from peer-reviewed publications and applied machine learning (ML) algorithm-XGBoost and deep learning (DL) algorithm-convolutional neural network (CNN) to predict k(eaq-). Our results demonstrated that the CNN model with transfer learning and data augmentation (CNN-TL&DA) greatly improved the prediction results and over-came over-fitting. Furthermore, we compared the ML/DL modeling methods and found that the CNN-TL&DA, which combined molecular images (MI), achieved the best overall performance (R-test(2) = 0.896, RMSEtest = 0.362, MAE(test) = 0.261) when compared to the XGBoost algorithm combined with Mordred descriptors (MD) (0.692, RMSEtest = 0.622, MAE(test) = 0.399) and Morgan fingerprint (MF) (R-test(2) = 0.512, RMSEtest = 0.783, MAE(test )= 0.520). Moreover, the interpretation of the MD-XGBoost and MF-XGBoost models using the SHAP method revealed the significance of MDs (e.g., molecular size, branching, electron distribution, polarizability, and bond types), MFs (e.g, aromatic carbon, carbonyl oxygen, nitrogen, and halogen) and environmental conditions (e.g., pH) that effectively influence the k(eaq-) prediction. The interpretation of the 2D molecular image-CNN (MI-CNN) models using the Grad-CAM method showed that they correctly identified key functional groups such as -CN, -NO2, and -X functional groups that can increase the k(eaq-) values. Additionally, almost all electron-withdrawing groups and a small part of electron-donating groups for the MI-CNN model can be highlighted for estimating k(eaq-). Overall, our results suggest that the CNN approach has smaller errors when compared to ML algorithms, making it a promising candidate for predicting other rate constants.
引用
收藏
页数:8
相关论文
共 50 条
  • [1] Application of machine learning and deep learning for the prediction of HIV/AIDS
    Alehegn, Minyechil
    HIV & AIDS REVIEW, 2022, 21 (01): : 17 - 23
  • [2] Machine Learning Methods for Smartphone Application Prediction
    Lu, Enze
    Zhang, Long
    2022 IEEE 31ST INTERNATIONAL SYMPOSIUM ON INDUSTRIAL ELECTRONICS (ISIE), 2022, : 1174 - 1179
  • [3] Machine learning and deep learning methods that use omics data for metastasis prediction
    Albaradei, Somayah
    Thafar, Maha
    Alsaedi, Asim
    Van Neste, Christophe
    Gojobori, Takashi
    Essack, Magbubah
    Gao, Xin
    COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL, 2021, 19 : 5008 - 5018
  • [4] News-based Machine Learning and Deep Learning Methods for Stock Prediction
    Guo, Junjie
    Tuckfield, Bradford
    4TH INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE APPLICATIONS AND TECHNOLOGIES (AIAAT 2020), 2020, 1642
  • [5] Monthly streamflow prediction and performance comparison of machine learning and deep learning methods
    Ayana, Omer
    Kanbak, Deniz Furkan
    Keles, Muemine Kaya
    Turhan, Evren
    ACTA GEOPHYSICA, 2023, 71 (06) : 2905 - 2922
  • [6] Monthly streamflow prediction and performance comparison of machine learning and deep learning methods
    Ömer Ayana
    Deniz Furkan Kanbak
    Mümine Kaya Keleş
    Evren Turhan
    Acta Geophysica, 2023, 71 : 2905 - 2922
  • [7] An application of learning machine methods in prediction of wear rate of wear resistant casting parts
    Slavkovic, Radomir
    Jugovic, Zvonimir
    Dragicevic, Snezana
    Jovicic, Aleksandar
    Slavkovic, Vladimir
    COMPUTERS & INDUSTRIAL ENGINEERING, 2013, 64 (03) : 850 - 857
  • [8] Machine Learning and Deep Learning for Throughput Prediction
    Lee, Dongwon
    Lee, Joohyun
    12TH INTERNATIONAL CONFERENCE ON UBIQUITOUS AND FUTURE NETWORKS (ICUFN 2021), 2021, : 452 - 454
  • [9] Machine Learning and Deep Learning Methods for Cybersecurity
    Xin, Yang
    Kong, Lingshuang
    Liu, Zhi
    Chen, Yuling
    Li, Yanmiao
    Zhu, Hongliang
    Gao, Mingcheng
    Hou, Haixia
    Wang, Chunhua
    IEEE ACCESS, 2018, 6 : 35365 - 35381
  • [10] Deep Learning Versus Traditional Machine Learning Methods for Aggregated Energy Demand Prediction
    Paterakis, Nikolaos G.
    Mocanu, Elena
    Gibescu, Madeleine
    Stappers, Bart
    van Alst, Walter
    2017 IEEE PES INNOVATIVE SMART GRID TECHNOLOGIES CONFERENCE EUROPE (ISGT-EUROPE), 2017,