Multi-Model Machine Learning Approach Accurately Predicts Lake Dissolved Oxygen With Multiple Environmental Inputs

被引:1
|
作者
Lin, Shuqi [1 ,2 ,3 ]
Pierson, Donald C. [2 ,3 ]
Ladwig, Robert [4 ]
Kraemer, Benjamin M. [5 ]
Hu, Fenjuan R. S. [6 ]
机构
[1] Canada Ctr Inland Waters, Environm & Climate Change Canada, Burlington, ON, Canada
[2] Uppsala Univ, Erken Lab, Uppsala, Sweden
[3] Uppsala Univ, Limnol Dept, Uppsala, Sweden
[4] Univ Wisconsin Madison, Ctr Limnol, Madison, WI USA
[5] IGB Leibniz Inst Freshwater Ecol & Inland Fisherie, Berlin, Germany
[6] VIA Univ Coll, Res Ctr Bldg Energy Water & Climate, Horsens, Denmark
关键词
LONG-TERM CHANGES; CLIMATE-CHANGE; CENTRAL BASIN; NORTH SHORE; HYPOXIA; PHOSPHORUS; DEPLETION; MODEL; VARIABILITY; DYNAMICS;
D O I
10.1029/2023EA003473
中图分类号
P1 [天文学];
学科分类号
0704 ;
摘要
As a key water quality parameter, dissolved oxygen (DO) concentration, and particularly changes in bottom water DO is fundamental for understanding the biogeochemical processes in lake ecosystems. Based on two machine learning (ML) models, Gradient Boost Regressor (GBR) and long-short-term-memory (LSTM) network, this study developed three ML model approaches: direct GBR; direct LSTM; and a 2-step mixed ML model workflow combining both GBR and LSTM. They were used to simulate multi-year surface and bottom DO concentrations in five lakes. All approaches were trained with readily available environmental data as predictors. Indices of lake thermal structure and mixing provided by a one-dimensional (1-D) hydrodynamic model were also included as predictors in the ML models. The advantages of each ML approach were not consistent for all the tested lakes, but the best one of them was defined that can estimate DO concentration with coefficient of determination (R2) up to 0.6-0.7 in each lake. All three approaches have normalized mean absolute error (NMAE) under 0.15. In a polymictic lake, the 2-step mixed model workflow showed better representation of bottom DO concentrations, with a highest true positive rate (TPR) of hypolimnetic hypoxia detection of over 90%, while the other workflows resulted in, TPRs are around 50%. In most of the tested lakes, the predicted surface DO concentrations and variables indicating stratified conditions (i.e., Wedderburn number and the temperature difference between surface and bottom water) are essential for simulating bottom DO. The ML approaches showed promising results and could be used to support short- and long-term water management plans. Dissolved oxygen (DO) concentrations is the essential water quality parameter in lake systems. Nowadays, with the development of data-driven machine learning (ML) models, prediction of DO concentrations can be achieved via these models in lakes with long-term DO concentration observations. This study developed three ML model approaches with one mixed two kind of ML models, and test them in five lakes. Readily available environmental data and the derived hydrodynamic data from process-based hydrodynamic model were used as predictors. All three ML approaches showed promising results, and the mixed ML approach show better skill in the lake stratifying and mixing irregularly. To predict hypoxia in the bottom of the lake, the surface DO concentrations and variables indicating water column stratification are important. A 2-step machine learning workflow combining Gradient Boost Regressor and long-short-term-memory was applied to simulate dissolved oxygen A one-dimensional process-based hydrodynamic model provides ML models with indices of lake thermal structure and mixing In a polymictic lake, the 2-step mixed machine learning workflow showed over 90% true positive rate of hypolimnetic hypoxia detection
引用
收藏
页数:16
相关论文
共 50 条
  • [41] Multi-model ensemble predictions of precipitation and temperature using machine learning algorithms
    Ahmed, Kamal
    Sachindra, D. A.
    Shahid, Shamsuddin
    Iqbal, Zafar
    Nawaz, Nadeem
    Khan, Najeebullah
    ATMOSPHERIC RESEARCH, 2020, 236
  • [42] A Method of Multi-model Machine Learning for Electrical Energy Prediction Accuracy Improvement
    Im J.H.
    Seong Y.R.
    Oh H.-R.
    Transactions of the Korean Institute of Electrical Engineers, 2022, 71 (06): : 876 - 883
  • [43] The Geometry of Flow: Advancing Predictions of River Geometry With Multi-Model Machine Learning
    Chang, Shuyu Y.
    Ghahremani, Zahra
    Manuel, Laura
    Erfani, Seyed Mohammad Hassan
    Shen, Chaopeng
    Cohen, Sagy
    Van Meter, Kimberly J.
    Pierce, Jennifer L.
    Meselhe, Ehab A.
    Goharian, Erfan
    WATER RESOURCES RESEARCH, 2024, 60 (10)
  • [44] Research on multi-model imaging machine learning for distinguishing early hepatocellular carcinoma
    Ya Ma
    Yue Gong
    QingTao Qiu
    Changsheng Ma
    Shuang Yu
    BMC Cancer, 24
  • [45] A multi-model modeling approach to nonlinear systems based on lazy learning
    Pan Tianhong
    Li Shaoyuan
    Wang Xin
    Proceedings of the 24th Chinese Control Conference, Vols 1 and 2, 2005, : 268 - 273
  • [46] Multi-model deep learning approach for collaborative filtering recommendation system
    Aljunid, Mohammed Fadhel
    Huchaiah, Manjaiah Doddaghatta
    CAAI TRANSACTIONS ON INTELLIGENCE TECHNOLOGY, 2020, 5 (04) : 268 - 275
  • [47] A Machine Learning Approach Identifies a 30-Gene Model That Predicts Sensitivity to Selinexor in Multiple Myeloma
    Lagana, Alessandro
    Bhalla, Sherry
    Aleman, Adolfo
    Mei, Anna Huo-Chang
    Leshchenko, Violetta
    Landesman, Yosef
    Melnekoff, David
    Beno, Itai
    Madduri, Deepu
    Richter, Joshua R.
    Chari, Ajai
    Cho, Hearn Jay
    Jagannath, Sundar
    Dudley, Joel
    Parekh, Samir
    BLOOD, 2019, 134
  • [48] A Multi-Model Machine Learning Approach for Monitoring Calories Being Burnt During Workouts Using Smart Calorie Tracer
    Challagundla Y.
    Badri Narayanan K.
    Devatha K.S.
    Bharathi V.C.
    Ravindra J.V.R.
    EAI Endorsed Transactions on Pervasive Health and Technology, 2024, 10
  • [49] A Novel Multi-Model Machine Learning Approach to Real-Time Road Accident Prediction and Driving Behavior Analysis
    Dinesh, Diya
    2021 INTERNATIONAL SYMPOSIUM ON COMPUTER SCIENCE AND INTELLIGENT CONTROLS (ISCSIC 2021), 2021, : 67 - 72
  • [50] An explainable machine learning approach for hospital emergency department visits forecasting using continuous training and multi-model regression
    Pelaez-Rodriguez, C.
    Torres-Lopez, R.
    Perez-Aracil, J.
    Lopez-Laguna, N.
    Sanchez-Rodriguez, S.
    Salcedo-Sanz, S.
    COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2024, 245