Development of models predicting biodegradation rate rating with multiple linear regression and support vector machine algorithms

被引:45
|
作者
Tang, Weihao [1 ]
Li, Yanying [1 ]
Yu, Yang [2 ]
Wang, Zhongyu [1 ]
Xu, Tong [1 ]
Chen, Jingwen [1 ]
Lin, Jun [2 ]
Li, Xuehua [1 ]
机构
[1] Dalian Univ Technol, Sch Environm Sci & Technol, Key Lab Ind Ecol & Environm Engn MOE, Dalian 116024, Peoples R China
[2] Minist Ecol & Environm MEE, Solid Waste & Chem Management Ctr, Beijing 100029, Peoples R China
基金
国家重点研发计划; 中国国家自然科学基金;
关键词
Biodegradability; Quantitative structure-activity relationship; Multiple linear regression; Support vector machine; Molecular structure descriptors; AEROBIC BIODEGRADATION; READY BIODEGRADABILITY; BIOACCUMULATIVE ORGANICS; CHEMICALS; PERSISTENT; QSAR; POLLUTANTS;
D O I
10.1016/j.chemosphere.2020.126666
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Biodegradation is a significant process for removing organic chemicals from water, soil and sediment environments, and therefore biodegradability is critical to evaluate the environmental persistence of organic chemicals. In this study, based on a dataset with 171 compounds, four quantitative structure-activity relationship (QSAR) models were developed for predicting primary and ultimate biodegradation rate rating with multiple linear regression (MLR) and support vector machine (SVM) algorithms. Two MLR models were built with a dataset with carbon atom number <= 9, and two SVM models were built with a dataset with carbon atom number >9. In the MLR models, n(ArX) (number of X on aromatic ring) is the most important descriptor governing primary and ultimate biodegradation of organic chemicals. For the SVM models, determination coefficient (R-2) values, cross-validated coefficients (Q(LOO)(2)) and external validation coefficient (Q(ext)(2)) values are over 0.9, indicating the SVM models have satisfactory goodness-of-fit, robustness and external predictive abilities. The applicability domains of these models were visualized by the Williams plot. The developed models can be used as effective tools to predict biodegradability of organic chemicals. (C) 2020 Elsevier Ltd. All rights reserved.
引用
收藏
页数:7
相关论文
共 50 条
  • [41] Synergy of logistic regression and support vector machine in multiple-class classification
    Chang, YCI
    Lin, SC
    INTELLIGENT DATA ENGINEERING AND AUTOMATED LEARNING IDEAL 2004, PROCEEDINGS, 2004, 3177 : 132 - 141
  • [42] Adaptively predicting time series with local v-Support vector regression machine
    Zeng, FZ
    Qiu, ZD
    PROCEEDINGS OF 2003 INTERNATIONAL CONFERENCE ON NEURAL NETWORKS & SIGNAL PROCESSING, PROCEEDINGS, VOLS 1 AND 2, 2003, : 790 - 792
  • [43] Predicting corporate financial distress based on integration of support vector machine and logistic regression
    Hua, Zhongsheng
    Wang, Yu
    Xu, Xiaoyan
    Zhang, Bin
    Liang, Liang
    EXPERT SYSTEMS WITH APPLICATIONS, 2007, 33 (02) : 434 - 440
  • [44] Predicting streamflow in Peninsular Malaysia using support vector machine and deep learning algorithms
    Yusuf Essam
    Yuk Feng Huang
    Jing Lin Ng
    Ahmed H. Birima
    Ali Najah Ahmed
    Ahmed El-Shafie
    Scientific Reports, 12
  • [45] Predicting streamflow in Peninsular Malaysia using support vector machine and deep learning algorithms
    Essam, Yusuf
    Huang, Yuk Feng
    Ng, Jing Lin
    Birima, Ahmed H.
    Ahmed, Ali Najah
    El-Shafie, Ahmed
    SCIENTIFIC REPORTS, 2022, 12 (01)
  • [46] Development of Support Vector Regression Models for Northeast Monsoon Rainfall Prediction
    Dash, Yajnaseni
    INTERNATIONAL CONFERENCE ON ADVANCES IN CIVIL ENGINEERING, ICACE 2022, 2024, 3010
  • [47] Construction of precise support vector machine based models for predicting promoter strength
    Hailin Meng
    Yingfei Ma
    Guoqin Mai
    Yong Wang
    Chenli Liu
    Quantitative Biology, 2017, 5 (01) : 90 - 98
  • [48] Predicting the Protein Folding Rate Based on Sequence Feature Screening and Support Vector Regression
    Li Yong
    Zhou Wei
    Dai Zhi-Jun
    Chen Yuan
    Wang Zhi-Ming
    Yuan Zhe-Ming
    ACTA PHYSICO-CHIMICA SINICA, 2014, 30 (06) : 1091 - 1098
  • [49] Comparing Machine Learning Algorithms And Regression Models For Predicting Functional Outcome In The Stratis Registry
    Jumaa, Mouhammad A.
    Zoghi, Zeinab
    Zaidi, Syed F.
    Mueller-Kronast, Nils
    Zaidat, Osama
    Castonguay, Alicia C.
    STROKE, 2022, 53
  • [50] Non-Linear Modeling and Chemical Interpretation with Aid of Support Vector Machine and Regression
    Hasegawa, Kiyoshi
    Funatsu, Kimito
    CURRENT COMPUTER-AIDED DRUG DESIGN, 2010, 6 (01) : 24 - 36