Water quality prediction and classification based on principal component regression and gradient boosting classifier approach

被引:76
|
作者
Khan, Md. Saikat Islam [1 ,4 ]
Islam, Nazrul [2 ,4 ,5 ]
Uddin, Jia [3 ]
Islam, Sifatul [1 ,4 ]
Nasir, Mostofa Kamal [1 ,4 ]
机构
[1] Dept Comp Sci & Engn, Santosh 1902, Tangail, Bangladesh
[2] Dept Informat & Commun & Technol, Santosh 1902, Tangail, Bangladesh
[3] Woosong Univ, Endicott Coll, Dept Technol Studies, Daejeon, South Korea
[4] Mawlana Bhashani Sci & Technol Univ, Santosh 1902, Tangail, Bangladesh
[5] Mawlana Bhashani Sci & Technol Univ, Dept Informat & Commun & Technol, Santosh 1902, Tangail, Bangladesh
关键词
Water quality index; Principal component regression; Classification algorithm; Boxplot analysis; MULTIPLE LINEAR-REGRESSION; GROUNDWATER QUALITY; INDEX; MODEL; DISTRICT; NETWORK;
D O I
10.1016/j.jksuci.2021.06.003
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Estimating water quality has been one of the significant challenges faced by the world in recent decades. This paper presents a water quality prediction model utilizing the principal component regression tech-nique. Firstly, the water quality index (WQI) is calculated using the weighted arithmetic index method. Secondly, the principal component analysis (PCA) is applied to the dataset, and the most dominant WQI parameters have been extracted. Thirdly, to predict the WQI, different regression algorithms are used to the PCA output. Finally, the Gradient Boosting Classifier is utilized to classify the water quality status. The proposed system is experimentally evaluated on a Gulshan Lake-related dataset. The results demonstrate 95% prediction accuracy for the principal component regression method and 100% classification accuracy for the Gradient Boosting Classifier method, which show credible performance compared with the state -of-art models. (c) 2021 The Authors. Published by Elsevier B.V. on behalf of King Saud University. This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
引用
收藏
页码:4773 / 4781
页数:9
相关论文
共 50 条
  • [31] Prediction of Blast-Induced Ground Vibration Using Principal Component Analysis–Based Classification and Logarithmic Regression Technique
    Vivek K. Himanshu
    A. K. Mishra
    Ashish K. Vishwakarma
    M. P. Roy
    P. K. Singh
    Mining, Metallurgy & Exploration, 2022, 39 : 2065 - 2074
  • [32] Prediction of demand for iron ores in China based on principal component regression analysis
    Niu, Jing-Kao
    Beijing Keji Daxue Xuebao/Journal of University of Science and Technology Beijing, 2011, 33 (10): : 1177 - 1181
  • [33] Application of Principal Component Regression and Partial Least Squares Regression in Ultraviolet Spectrum Water Quality Detection
    Li, Jiangtong
    Luo, Yongdao
    Dai, Honglin
    2017 INTERNATIONAL CONFERENCE ON OPTICAL INSTRUMENTS AND TECHNOLOGY: OPTOELECTRONIC IMAGING/SPECTROSCOPY AND SIGNAL PROCESSING TECHNOLOGY, 2017, 10620
  • [34] A combined water quality classification model based on kernel principal component analysis and machine learning techniques
    Dilmi, Smail
    DESALINATION AND WATER TREATMENT, 2022, 279 : 61 - 67
  • [35] Evaluation of principal component selection methods to form a global prediction model by principal component regression
    Xie, YL
    Kalivas, JH
    ANALYTICA CHIMICA ACTA, 1997, 348 (1-3) : 19 - 27
  • [36] River water quality assessment based on principal component analysis
    Li, Guiping
    Yu, Zhongbo
    HYDROLOGICAL CYCLE AND WATER RESOURCES SUSTAINABILITY IN CHANGING ENVIRONMENTS, 2011, 350 : 430 - 435
  • [37] Prediction of higher heating value of coal based on gradient boosting regression tree model
    Xu, Na
    Wang, Zhiwei
    Dai, Yuchen
    Li, Qiang
    Zhu, Wei
    Wang, Ru
    Finkelman, Robert B.
    INTERNATIONAL JOURNAL OF COAL GEOLOGY, 2023, 274
  • [38] Prediction of phthalate in dust in children's bedroom based on gradient boosting regression tree
    Sun, Chanjuan
    Wang, Qinghao
    Zhang, Jialing
    Liu, Wei
    Zhang, Yinping
    Li, Baizhan
    Zhao, Zhuohui
    Deng, Qihong
    Zhang, Xin
    Qian, Hua
    Zou, Zhijun
    Yang, Xu
    Sun, Yuexia
    Chen, Huang
    BUILDING AND ENVIRONMENT, 2024, 251
  • [39] Iceberg draft prediction using gradient boosting regression algorithm
    Azimi H.
    Shiri H.
    Mahdianpari M.
    Marine Systems & Ocean Technology, 2023, 18 (3-4) : 151 - 166
  • [40] Support vector classifier based on principal component analysis
    Zheng Chunhong
    Jiao Licheng
    Li Yongzhao
    JOURNAL OF SYSTEMS ENGINEERING AND ELECTRONICS, 2008, 19 (01) : 184 - 190