Water quality prediction and classification based on principal component regression and gradient boosting classifier approach

被引:76
|
作者
Khan, Md. Saikat Islam [1 ,4 ]
Islam, Nazrul [2 ,4 ,5 ]
Uddin, Jia [3 ]
Islam, Sifatul [1 ,4 ]
Nasir, Mostofa Kamal [1 ,4 ]
机构
[1] Dept Comp Sci & Engn, Santosh 1902, Tangail, Bangladesh
[2] Dept Informat & Commun & Technol, Santosh 1902, Tangail, Bangladesh
[3] Woosong Univ, Endicott Coll, Dept Technol Studies, Daejeon, South Korea
[4] Mawlana Bhashani Sci & Technol Univ, Santosh 1902, Tangail, Bangladesh
[5] Mawlana Bhashani Sci & Technol Univ, Dept Informat & Commun & Technol, Santosh 1902, Tangail, Bangladesh
关键词
Water quality index; Principal component regression; Classification algorithm; Boxplot analysis; MULTIPLE LINEAR-REGRESSION; GROUNDWATER QUALITY; INDEX; MODEL; DISTRICT; NETWORK;
D O I
10.1016/j.jksuci.2021.06.003
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Estimating water quality has been one of the significant challenges faced by the world in recent decades. This paper presents a water quality prediction model utilizing the principal component regression tech-nique. Firstly, the water quality index (WQI) is calculated using the weighted arithmetic index method. Secondly, the principal component analysis (PCA) is applied to the dataset, and the most dominant WQI parameters have been extracted. Thirdly, to predict the WQI, different regression algorithms are used to the PCA output. Finally, the Gradient Boosting Classifier is utilized to classify the water quality status. The proposed system is experimentally evaluated on a Gulshan Lake-related dataset. The results demonstrate 95% prediction accuracy for the principal component regression method and 100% classification accuracy for the Gradient Boosting Classifier method, which show credible performance compared with the state -of-art models. (c) 2021 The Authors. Published by Elsevier B.V. on behalf of King Saud University. This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
引用
收藏
页码:4773 / 4781
页数:9
相关论文
共 50 条
  • [21] Prediction and classification of solar photovoltaic power generation using extreme gradient boosting regression model
    Rinesh, S.
    Deepa, S.
    Nandan, R. T.
    Sachin, R. S.
    Thamil, S., V
    Akash, R.
    Arun, M.
    Prajitha, C.
    Kumar, A. P. Senthil
    INTERNATIONAL JOURNAL OF LOW-CARBON TECHNOLOGIES, 2024, 19 : 2420 - 2430
  • [22] Prediction-based variable selection for component-wise gradient boosting
    Potts, Sophie
    Bergherr, Elisabeth
    Reinke, Constantin
    Griesbach, Colin
    INTERNATIONAL JOURNAL OF BIOSTATISTICS, 2024, 20 (01): : 293 - 314
  • [23] Classification by Principal Component Regression in the Real and Hypercomplex Domains
    El-Melegy, Moumen T.
    Kamal, Aliaa T.
    Hussain, Khaled F.
    El-Hawary, H. M.
    ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2023, 48 (08) : 10099 - 10108
  • [24] Classification by Principal Component Regression in the Real and Hypercomplex Domains
    Moumen T. El-Melegy
    Aliaa T. Kamal
    Khaled F. Hussain
    H. M. El-Hawary
    Arabian Journal for Science and Engineering, 2023, 48 : 10099 - 10108
  • [25] A principal component approach to dynamic regression models
    delMoral, MJ
    Valderrama, MJ
    INTERNATIONAL JOURNAL OF FORECASTING, 1997, 13 (02) : 237 - 244
  • [26] A recommendation system based on eXtreme Gradient Boosting Classifier
    Xu, Longteng
    Liu, Jiwei
    Gu, Yu
    PROCEEDINGS OF 2018 10TH INTERNATIONAL CONFERENCE ON MODELLING, IDENTIFICATION AND CONTROL (ICMIC), 2018,
  • [27] Principal curve classifier - A nonlinear approach to pattern classification
    Chang, KY
    Ghosh, J
    IEEE WORLD CONGRESS ON COMPUTATIONAL INTELLIGENCE, 1998, : 695 - 700
  • [28] Prediction of protein-protein interaction sites through eXtreme gradient boosting with kernel principal component analysis
    Wang, Xue
    Zhang, Yaqun
    Yu, Bin
    Salhi, Adil
    Chen, Ruixin
    Wang, Lin
    Liu, Zengfeng
    COMPUTERS IN BIOLOGY AND MEDICINE, 2021, 134 (134)
  • [29] Spectroscopic prediction of nonlinear properties by principal component regression
    LitaniBarzilai, I
    Schechter, I
    ANALYTICA CHIMICA ACTA, 1997, 348 (1-3) : 345 - 356
  • [30] Minimax rate in prediction for functional principal component regression
    Yang, Guangren
    Lin, Hongmei
    Lian, Heng
    COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2021, 50 (05) : 1240 - 1249