Two-stage meta-ensembling machine learning model for enhanced water quality forecasting

被引:3
|
作者
Heydari, Sepideh [1 ]
Nikoo, Mohammad Reza [2 ]
Mohammadi, Ali [3 ]
Barzegar, Rahim [4 ]
机构
[1] Univ Tehran, Fac Environm Engn, Dept Environm Engn, Tehran, Iran
[2] Sultan Qaboos Univ, Dept Civil & Architectural Engn, Muscat, Oman
[3] Sharif Univ Technol, Dept Ind Engn, Tehran, Iran
[4] Univ Quebec Abitibi Temiscamingue UQAT, Res Inst Mines & Environm RIME, Groundwater Res Grp GRES, Amos, PQ, Canada
关键词
Water quality forecasting; Machine learning; Multi-objective optimization; Genetic algorithm; Grey Wolf Optimizer; Chlorophyll-a and Dissolved oxygen; PREDICTION; IMPLEMENTATION; DECOMPOSITION; PERFORMANCE; STREAMFLOW; RESOURCES; SYSTEM;
D O I
10.1016/j.jhydrol.2024.131767
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
Accurate short-term forecasting of water quality variables (WQVs) such as dissolved oxygen (DO) and chlorophyll-a (Chl-a) is crucial for the effective management of aquatic resources. This study introduces a robust two-stage optimization-ensembling framework that integrates the Grey Wolf Optimizer (GWO) and the Nondominated Sorting Genetic Algorithm II (NSGA-II) to enhance the forecasting capabilities of machine learning (ML) models. Focusing on Small Prespa Lake, Greece, we implemented an array of diverse ML techniques, including eXtreme Gradient Boosting (XGB), Gradient Boosting Regressor (GBR), Light Gradient-Boosting Machine (LightGBM), and Multilayer Perceptron (MLP). These models were fine-tuned using GWO to optimize their performance over critical WQVs predicted six hours in advance. Our methodology employed rigorous data preprocessing techniques, including lag time feature engineering and principal component analysis (PCA), to handle the high dimensionality of the dataset. Optimal lag times ranging from 6 to 24 hour were evaluated, with the 24-hour lag proving to be the most effective in utilizing historical data to enhance forecasting accuracy. The GWO not only facilitated hyperparameter tuning but also demonstrated a notable improvement (7.6%) in the Kling-Gupta Efficiency (KGE) over conventional randomized search methods. Subsequently, the NSGA-II was utilized for multi-objective optimization, constructing powerful model ensembles that outperformed the individual GWO-optimized models by up to a 7% in KGE. In comparision to a standard genetic algorithm-based ensemble, the NSGA-II ensemble demonstrated superior effectiveness in balancing solution quality. This innovative approach not only establishes a new benchmark in water quality forecasting but also contributes substantially to proactive environmental monitoring and management strategies.
引用
收藏
页数:16
相关论文
共 50 条
  • [41] Two-stage grey support vector machine prediction model
    Zhou, Huaping
    Yuan, Yue
    Computer Modelling and New Technologies, 2014, 18 (12): : 372 - 378
  • [42] A Two-Stage Approach for Flight Departure Delay Forecasting Using Ensemble Learning
    Guan, Feng
    Hao, Mengyan
    Guo, Zhen
    CICTP 2020: TRANSPORTATION EVOLUTION IMPACTING FUTURE MOBILITY, 2020, : 209 - 220
  • [43] Two-Stage Hybrid Extreme Learning Machine for Sequential Imbalanced Data
    Mao, Wentao
    Wang, Jinwan
    He, Ling
    Tian, Yangyang
    PROCEEDINGS OF ELM-2015, VOL 1: THEORY, ALGORITHMS AND APPLICATIONS (I), 2016, 6 : 423 - 433
  • [44] Typhoon flood forecasting using integrated two-stage Support Vector Machine approach
    Lin, Gwo-Fong
    Chou, Yang-Ching
    Wu, Ming-Chang
    JOURNAL OF HYDROLOGY, 2013, 486 : 334 - 342
  • [45] Two-stage extreme learning machine for high-dimensional data
    Peng Liu
    Yihua Huang
    Lei Meng
    Siyuan Gong
    Guopeng Zhang
    International Journal of Machine Learning and Cybernetics, 2016, 7 : 765 - 772
  • [46] Prediction of lattice thermal conductivity with two-stage interpretable machine learning
    Hu, Jinlong
    Zuo, Yuting
    Hao, Yuzhou
    Shu, Guoyu
    Wang, Yang
    Feng, Minxuan
    Li, Xuejie
    Wang, Xiaoying
    Sun, Jun
    Ding, Xiangdong
    Gao, Zhibin
    Zhu, Guimei
    Li, Baowen
    CHINESE PHYSICS B, 2023, 32 (04)
  • [47] Two-Stage Electricity Demand Modeling Using Machine Learning Algorithms
    Gajowniczek, Krzysztof
    Zabkowski, Tomasz
    ENERGIES, 2017, 10 (10)
  • [48] Predicting Stock Price Using Two-Stage Machine Learning Techniques
    Jun Zhang
    Lan Li
    Wei Chen
    Computational Economics, 2021, 57 : 1237 - 1261
  • [49] Machine Learning for K-Adaptability in Two-Stage Robust Optimization
    Julien, Esther
    Postek, Krzysztof
    Birbil, S. llker
    INFORMS JOURNAL ON COMPUTING, 2024,
  • [50] Two-stage pedestrian detection based on multiple features and machine learning
    Chong, Y.-W. (apollobest@126.com), 1600, Science Press (38):