Toward Design of Internet of Things and Machine Learning-Enabled Frameworks for Analysis and Prediction of Water Quality

被引:8
|
作者
Rahu, Mushtaque Ahmed [1 ]
Chandio, Abdul Fattah [1 ]
Aurangzeb, Khursheed [2 ]
Karim, Sarang [3 ]
Alhussein, Musaed [2 ]
Anwar, Muhammad Shahid [4 ]
机构
[1] Quaid e Awam Univ Engn Sci & Technol, Dept Elect Engn, Nawabshah 67450, Pakistan
[2] King Saud Univ, Coll Comp & Informat Sci, Dept Comp Engn, POB 51178, Riyadh 11543, Saudi Arabia
[3] Quaid E Awam Univ Engn Sci & Technol, Dept Telecommun Engn, Nawabshah 67450, Pakistan
[4] Gachon Univ, Dept AI & Software, Seongnam Si 13120, South Korea
关键词
Data collection; environmental monitoring; Internet of Things (IoT); machine learning; water quality analysis; water quality class (WQC); water quality index (WQI); F-SCORE; INDEX; IOT; TECHNOLOGIES; NETWORKS; MALAYSIA; SYSTEMS; MODEL;
D O I
10.1109/ACCESS.2023.3315649
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The degradation of water quality has become a critical concern worldwide, necessitating innovative approaches for monitoring and predicting water quality. This paper proposes an integrated framework that combines the Internet of Things (IoT) and machine learning paradigms for comprehensive water quality analysis and prediction. The IoT-enabled framework comprises four modules: sensing, coordinator, data processing, and decision. The IoT framework is equipped with temperature, pH, turbidity, and Total Dissolved Solids (TDS) sensors to collect the data from Rohri Canal, SBA, Pakistan. The acquired data is preprocessed and then analyzed using machine learning models to predict the Water Quality Index (WQI) and Water Quality Class (WQC). With this aim, we designed a machine learning-enabled framework for water quality analysis and prediction. Preprocessing steps such as data cleaning, normalization using the Z-score technique, correlation, and splitting are performed before applying machine learning models. Regression models: LSTM (Long Short-Term Memory), SVR (Support Vector Regression), MLP (Multilayer Perceptron) and NARNet (Nonlinear Autoregressive Network) are employed to predict the WQI, and classification models: SVM (Support Vector Machine), XGBoost (eXtreme Gradient Boosting), Decision Trees, and Random Forest are employed to predict the WQC. Before that, the Dataset used for evaluating machine learning models is split into two subsets: Dataset 1 and Dataset 2. Dataset 1 comprises 600 values for each parameter, while Dataset 2 includes the complete set of 6000 values for each parameter. This division enables comparison and evaluation of the models' performance. The results indicate that the MLP regression model has strong predictive performance with the lowest Mean Absolute Error (MAE), Mean Squared Error (MSE), and Root Mean Squared Error (RMSE) values, along with the highest R-squared (0.93), indicating accurate and precise predictions. In contrast, the SVR model demonstrates weaker performance, evidenced by higher errors and a lower R-squared (0.73). Among classification algorithms, the Random Forest achieves the highest metrics: accuracy (0.91), precision (0.93), recall (0.92), and F1-score (0.91). It is also conceived that the machine learning models perform better when applied to datasets with smaller numbers of values compared to datasets with larger numbers of values. Moreover, comparisons with existing studies reveal this study's improved regression performance, with consistently lower errors and higher R-squared values. For classification, the Random Forest model outperforms others, with exceptional accuracy, precision, recall, and F1-score metrics.
引用
收藏
页码:101055 / 101086
页数:32
相关论文
共 50 条
  • [21] Machine Learning-Enabled Genome Mining and Bioactivity Prediction of Natural Products
    Yuan, Yujie
    Shi, Chengyou
    Zhao, Huimin
    ACS SYNTHETIC BIOLOGY, 2023, 12 (09): : 2650 - 2662
  • [22] Intelligent Prediction and Continuous Monitoring of Water Quality in Aquaculture: Integration of Machine Learning and Internet of Things for Sustainable Management
    Baena-Navarro, Ruben
    Carriazo-Regino, Yulieth
    Torres-Hoyos, Francisco
    Pinedo-Lopez, Jhon
    WATER, 2025, 17 (01)
  • [23] Machine Learning-Enabled Optical Architecture Design of Perovskite Solar Cells
    Li, Zong-Zheng
    Guo, Chaorong
    Lv, Wenlei
    Huang, Peng
    Zhang, Yongyou
    JOURNAL OF PHYSICAL CHEMISTRY LETTERS, 2024, 15 (14): : 3835 - 3842
  • [24] Towards Predictive Water Quality: Synergies Between Machine Learning and Internet of Things
    Zrouri, Amira
    El Farissi, Ilhame
    ADVANCES IN SMART MEDICAL, IOT & ARTIFICIAL INTELLIGENCE, VOL 1, ICSMAI 2024, 2024, 11 : 152 - 159
  • [25] Machine learning-enabled forward prediction and inverse design of 4D-printed active plates
    Sun, Xiaohao
    Yue, Liang
    Yu, Luxia
    Forte, Connor T.
    Armstrong, Connor D.
    Zhou, Kun
    Demoly, Frederic
    Zhao, Ruike Renee
    Qi, H. Jerry
    NATURE COMMUNICATIONS, 2024, 15 (01)
  • [26] Machine learning-enabled discovery and design of membrane-active peptides
    Lee, Ernest Y.
    Wong, Gerard C. L.
    Ferguson, Andrew L.
    BIOORGANIC & MEDICINAL CHEMISTRY, 2018, 26 (10) : 2708 - 2718
  • [27] Machine Learning-Enabled Tactile Sensor Design for Dynamic Touch Decoding
    Lu, Yuyao
    Kong, Depeng
    Yang, Geng
    Wang, Ruohan
    Pang, Gaoyang
    Luo, Huayu
    Yang, Huayong
    Xu, Kaichen
    ADVANCED SCIENCE, 2023, 10 (32)
  • [28] Machine Learning-Enabled Prediction of 3D-Printed Microneedle Features
    Sarabi, Misagh Rezapour
    Alseed, M. Munzer
    Karagoz, Ahmet Agah
    Tasoglu, Savas
    BIOSENSORS-BASEL, 2022, 12 (07):
  • [29] Machine Learning-Enabled Nanoscale Phase Prediction in Engineered Poly(Vinylidene Fluoride)
    Babu, Anand
    Abraham, B. Moses
    Naskar, Sudip
    Ranpariya, Spandan
    Mandal, Dipankar
    SMALL, 2024,
  • [30] Boosting Vehicle-to-Cloud Communication by Machine Learning-Enabled Context Prediction
    Sliwa, Benjamin
    Falkenberg, Robert
    Liebig, Thomas
    Piatkowski, Nico
    Wietfeld, Christian
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2020, 21 (08) : 3497 - 3512