Water quality prediction and classification based on principal component regression and gradient boosting classifier approach

被引:76
|
作者
Khan, Md. Saikat Islam [1 ,4 ]
Islam, Nazrul [2 ,4 ,5 ]
Uddin, Jia [3 ]
Islam, Sifatul [1 ,4 ]
Nasir, Mostofa Kamal [1 ,4 ]
机构
[1] Dept Comp Sci & Engn, Santosh 1902, Tangail, Bangladesh
[2] Dept Informat & Commun & Technol, Santosh 1902, Tangail, Bangladesh
[3] Woosong Univ, Endicott Coll, Dept Technol Studies, Daejeon, South Korea
[4] Mawlana Bhashani Sci & Technol Univ, Santosh 1902, Tangail, Bangladesh
[5] Mawlana Bhashani Sci & Technol Univ, Dept Informat & Commun & Technol, Santosh 1902, Tangail, Bangladesh
关键词
Water quality index; Principal component regression; Classification algorithm; Boxplot analysis; MULTIPLE LINEAR-REGRESSION; GROUNDWATER QUALITY; INDEX; MODEL; DISTRICT; NETWORK;
D O I
10.1016/j.jksuci.2021.06.003
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Estimating water quality has been one of the significant challenges faced by the world in recent decades. This paper presents a water quality prediction model utilizing the principal component regression tech-nique. Firstly, the water quality index (WQI) is calculated using the weighted arithmetic index method. Secondly, the principal component analysis (PCA) is applied to the dataset, and the most dominant WQI parameters have been extracted. Thirdly, to predict the WQI, different regression algorithms are used to the PCA output. Finally, the Gradient Boosting Classifier is utilized to classify the water quality status. The proposed system is experimentally evaluated on a Gulshan Lake-related dataset. The results demonstrate 95% prediction accuracy for the principal component regression method and 100% classification accuracy for the Gradient Boosting Classifier method, which show credible performance compared with the state -of-art models. (c) 2021 The Authors. Published by Elsevier B.V. on behalf of King Saud University. This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
引用
收藏
页码:4773 / 4781
页数:9
相关论文
共 50 条
  • [41] Time series regression and prediction based on boosting regression
    Gu, Wen
    Li, Baifeng
    Niu, Baolong
    Wei, Wei
    Zheng, Zhiming
    PROCEEDINGS OF 2014 IEEE WORKSHOP ON ADVANCED RESEARCH AND TECHNOLOGY IN INDUSTRY APPLICATIONS (WARTIA), 2014, : 251 - 254
  • [42] Support vector classifier based on principal component analysis
    Zheng Chunhong
    Journal of Systems Engineering and Electronics, 2008, (01) : 184 - 190
  • [43] Quality Variable Prediction for Dynamic Process Based on Adaptive Principal Component Regression with Selective Integration of Multiple Local Models
    Tian, Ying
    Zhu, Yuting
    KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2021, 15 (04): : 1193 - 1215
  • [44] PCB PAD DETECTION ALGORITHM BASED ON PRINCIPAL COMPONENT ANALYSIS AND CLASSIFICATION REGRESSION TREE
    Ye, Xuhui
    Tang, Yuxuan
    Zhang, Daode
    Hu, Xinyu
    JOURNAL OF FLOW VISUALIZATION AND IMAGE PROCESSING, 2022, 29 (01) : 89 - 107
  • [45] Epilepsy EEG signals classification based on sparse principal component logistic regression model
    Li, Xi
    Qiao, Yuanhua
    Duan, Lijuan
    Miao, Jun
    COMPUTER METHODS IN BIOMECHANICS AND BIOMEDICAL ENGINEERING, 2024,
  • [46] Face Expression Recognition Based on Equable Principal Component Analysis and Linear Regression Classification
    Zhu, Yani
    Li, Xiaoxin
    Wu, Guohua
    2016 3RD INTERNATIONAL CONFERENCE ON SYSTEMS AND INFORMATICS (ICSAI), 2016, : 876 - 880
  • [47] Cancer classification by kernel principal component self-regression
    Zhang, Bai-ling
    AI 2006: ADVANCES IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2006, 4304 : 719 - +
  • [48] Prediction of Blast-Induced Ground Vibration Using Principal Component Analysis-Based Classification and Logarithmic Regression Technique
    Himanshu, Vivek K.
    Mishra, A. K.
    Vishwakarma, Ashish K.
    Roy, M. P.
    Singh, P. K.
    MINING METALLURGY & EXPLORATION, 2022, 39 (05) : 2065 - 2074
  • [49] Principal component regression approach for QT variability estimation
    Karjalainen, P. A.
    Tarvainen, M. P.
    Laitinen, T.
    2005 27TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY, VOLS 1-7, 2005, : 1145 - 1147
  • [50] A weighted principal component regression approach for system identification
    Xiao, XS
    Mukkamala, R
    Cohen, RJ
    PROCEEDINGS OF THE 2003 IEEE WORKSHOP ON STATISTICAL SIGNAL PROCESSING, 2003, : 206 - 209