Development of models predicting biodegradation rate rating with multiple linear regression and support vector machine algorithms

被引:45
|
作者
Tang, Weihao [1 ]
Li, Yanying [1 ]
Yu, Yang [2 ]
Wang, Zhongyu [1 ]
Xu, Tong [1 ]
Chen, Jingwen [1 ]
Lin, Jun [2 ]
Li, Xuehua [1 ]
机构
[1] Dalian Univ Technol, Sch Environm Sci & Technol, Key Lab Ind Ecol & Environm Engn MOE, Dalian 116024, Peoples R China
[2] Minist Ecol & Environm MEE, Solid Waste & Chem Management Ctr, Beijing 100029, Peoples R China
基金
国家重点研发计划; 中国国家自然科学基金;
关键词
Biodegradability; Quantitative structure-activity relationship; Multiple linear regression; Support vector machine; Molecular structure descriptors; AEROBIC BIODEGRADATION; READY BIODEGRADABILITY; BIOACCUMULATIVE ORGANICS; CHEMICALS; PERSISTENT; QSAR; POLLUTANTS;
D O I
10.1016/j.chemosphere.2020.126666
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Biodegradation is a significant process for removing organic chemicals from water, soil and sediment environments, and therefore biodegradability is critical to evaluate the environmental persistence of organic chemicals. In this study, based on a dataset with 171 compounds, four quantitative structure-activity relationship (QSAR) models were developed for predicting primary and ultimate biodegradation rate rating with multiple linear regression (MLR) and support vector machine (SVM) algorithms. Two MLR models were built with a dataset with carbon atom number <= 9, and two SVM models were built with a dataset with carbon atom number >9. In the MLR models, n(ArX) (number of X on aromatic ring) is the most important descriptor governing primary and ultimate biodegradation of organic chemicals. For the SVM models, determination coefficient (R-2) values, cross-validated coefficients (Q(LOO)(2)) and external validation coefficient (Q(ext)(2)) values are over 0.9, indicating the SVM models have satisfactory goodness-of-fit, robustness and external predictive abilities. The applicability domains of these models were visualized by the Williams plot. The developed models can be used as effective tools to predict biodegradability of organic chemicals. (C) 2020 Elsevier Ltd. All rights reserved.
引用
收藏
页数:7
相关论文
共 50 条
  • [31] Predicting EPBM advance rate performance using support vector regression modeling
    Mokhtari, Soroush
    Mooney, Michael A.
    TUNNELLING AND UNDERGROUND SPACE TECHNOLOGY, 2020, 104
  • [32] Hidden Logistic Linear Regression for Support Vector Machine based Phone Verification
    Li, Bo
    Sim, Khe Chai
    11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4, 2010, : 2622 - 2625
  • [33] Iterated time series prediction with multiple support vector regression models
    Zhang, Li
    Zhou, Wei-Da
    Chang, Pei-Chann
    Yang, Ji-Wen
    Li, Fan-Zhang
    NEUROCOMPUTING, 2013, 99 : 411 - 422
  • [34] Improved support vector regression models for predicting rock mass parameters using tunnel boring machine driving data
    Liu, Bin
    Wang, Ruirui
    Guan, Zengda
    Li, Jianbin
    Xu, Zhenhao
    Guo, Xu
    Wang, Yaxu
    TUNNELLING AND UNDERGROUND SPACE TECHNOLOGY, 2019, 91
  • [35] Robust and Distributionally Robust Optimization Models for Linear Support Vector Machine
    Faccini, Daniel
    Maggioni, Francesca
    Potra, Florian A.
    COMPUTERS & OPERATIONS RESEARCH, 2022, 147
  • [36] Robust and Distributionally Robust Optimization Models for Linear Support Vector Machine
    Faccini, Daniel
    Maggioni, Francesca
    Potra, Florian A.
    Computers and Operations Research, 2022, 147
  • [37] A Comparative Study of Slope Failure Prediction Using Logistic Regression, Support Vector Machine and Least Square Support Vector Machine Models
    Zhou, Lim Yi
    Shan, Fam Pei
    Shimizu, Kunio
    Imoto, Tomoaki
    Lateh, Habibah
    Peng, Koay Swee
    PROCEEDINGS OF THE 24TH NATIONAL SYMPOSIUM ON MATHEMATICAL SCIENCES (SKSM24): MATHEMATICAL SCIENCES EXPLORATION FOR THE UNIVERSAL PRESERVATION, 2017, 1870
  • [38] Age Estimation using Active Appearance Models and Support Vector Machine Regression
    Luu, Khoa
    Ricanek, Karl, Jr.
    Bui, Tien D.
    Suen, Ching Y.
    2009 IEEE 3RD INTERNATIONAL CONFERENCE ON BIOMETRICS: THEORY, APPLICATIONS AND SYSTEMS, 2009, : 314 - +
  • [39] Identification of ARX Hammerstein Models Based on Twin Support Vector Machine Regression
    Aldhaifallah, Mujahed
    Nisar, K. S.
    2016 13TH INTERNATIONAL MULTI-CONFERENCE ON SYSTEMS, SIGNALS & DEVICES (SSD), 2016, : 571 - 578
  • [40] Predicting Factor of Safety of Slope Using an Improved Support Vector Machine Regression Model
    Lei, Daxing
    Zhang, Yaoping
    Lu, Zhigang
    Lin, Hang
    Jiang, Zheyuan
    MATHEMATICS, 2024, 12 (20)