Stream salinity prediction in data-scarce regions: Application of transfer learning and uncertainty quantification

被引:0
|
作者
Khodkar, Kasra [1 ]
Mirchi, Ali [1 ]
Nourani, Vahid [2 ]
Kaghazchi, Afsaneh [1 ]
Sadler, Jeffrey M. [1 ]
Mansaray, Abubakarr [3 ]
Wagner, Kevin [3 ]
Alderman, Phillip D. [4 ]
Taghvaeian, Saleh [5 ]
Bailey, Ryan T. [6 ]
机构
[1] Oklahoma State Univ, Dept Biosyst & Agr Engn, Stillwater, OK 74078 USA
[2] Univ Tabriz, Fac Civil Engn, Ctr Excellence Hydroinformat, Tabriz, Iran
[3] Oklahoma State Univ, Oklahoma Water Resources Ctr, Stillwater, OK 74078 USA
[4] Oklahoma State Univ, Dept Plant & Soil Sci, Stillwater, OK 74078 USA
[5] Univ Nebraska Lincoln, Dept Biol Syst Engn, Lincoln, NE 68583 USA
[6] Colorado State Univ, Dept Civil & Environm Engn, Ft Collins, CO 80523 USA
基金
美国食品与农业研究所;
关键词
Stream salinity; Machine learning; Missing data; Lower Upper Bound Estimation (LUBE); Uncertainty quantification; Upper Red River Basin; QUALITY; WATER; MODELS; DISCHARGE;
D O I
10.1016/j.jconhyd.2024.104418
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Scarcity of stream salinity data poses a challenge to understanding salinity dynamics and its implications for water supply management in water-scarce salt-prone regions around the world. This paper introduces a framework for generating continuous daily stream salinity estimates using instance-based transfer learning (TL) and assessing the reliability of the synthetic salinity data through uncertainty quantification via prediction intervals (PIs). The framework was developed using two temporally distinct specific conductance (SC) datasets from the Upper Red River Basin (URRB) located in southwestern Oklahoma and Texas Panhandle, United States. The instance-based TL approach was implemented by calibrating Feedforward Neural Networks (FFNNs) on a source SC dataset of around 1200 instantaneous grab samples collected by United States Geological Survey (USGS) from 1959 to 1993. The trained FFNNs were subsequently tested on a target dataset (1998-present) of 220 instantaneous grab samples collected by the Oklahoma Water Resources Board (OWRB). The framework's generalizability was assessed in the data-rich Bird Creek watershed in Oklahoma by manipulating continuous SC data to simulate data-scarce conditions for training the models and using the complete Bird Creek dataset for model evaluation. The Lower Upper Bound Estimation (LUBE) method was used with FFNNs to estimate PIs for uncertainty quantification. Autoregressive SC prediction methods via FFNN were found to be reliable with Nash Sutcliffe Efficiency (NSE) values of 0.65 and 0.45 on in-sample and out-of-sample test data, respectively. The same modeling scenario resulted in an NSE of 0.54 for the Bird Creek data using a similar missing data ratio, whereas a higher ratio of observed data increased the accuracy (NSE = 0.84). The relatively narrow estimated PIs for the North Fork Red River in the URRB indicated satisfactory stream salinity predictions, showing an average width equivalent to 25 % of the observed range and a confidence level of 70 %.
引用
收藏
页数:16
相关论文
共 50 条
  • [21] Synthesizing labeled data to enhance soft sensor performance in data-scarce regions
    Lyu, Yuting
    Chen, Junghui
    Song, Zhihuan
    CONTROL ENGINEERING PRACTICE, 2021, 115
  • [22] Regional Mapping of Groundwater Resources in Data-Scarce Regions: The Case of Laos
    Viossanges, Mathieu
    Pavelic, Paul
    Rebelo, Lisa-Maria
    Lacombe, Guillaume
    Sotoukee, Touleelor
    HYDROLOGY, 2018, 5 (01)
  • [23] Benchmarking pavement practices in data-scarce regions - case of Saudi Arabia
    Hamdar, Yara S.
    Bakchan, Amal
    Chehab, Ghassan R.
    Al-Qadi, Imad
    Little, Dallas
    INTERNATIONAL JOURNAL OF PAVEMENT ENGINEERING, 2021, 22 (03) : 294 - 306
  • [24] SnowCloudHydroA New Framework for Forecasting Streamflow in Snowy, Data-Scarce Regions
    Sproles, Eric A.
    Crumley, Ryan L.
    Nolin, Anne W.
    Mar, Eugene
    Lopez Moreno, Juan Ignacio
    REMOTE SENSING, 2018, 10 (08)
  • [25] Enhanced landslide susceptibility mapping in data-scarce regions via unsupervised few-shot learning
    Kong, Linghao
    Feng, Wenkai
    Yi, Xiaoyu
    Xue, Zhenghai
    Bai, Luyao
    GONDWANA RESEARCH, 2025, 138 : 31 - 46
  • [26] Geostatistical Framework for Estimation of VS30 in Data-Scarce Regions
    Gilder, Charlotte E. L.
    De Risi, Raffaele
    De Luca, Flavia
    Pokhrel, Rama Mohan
    Vardanega, Paul J.
    BULLETIN OF THE SEISMOLOGICAL SOCIETY OF AMERICA, 2022, 112 (06) : 2981 - 3000
  • [27] Development of Strategy for SWAT Hydrologic Modeling in Data-Scarce Regions of Peru
    Daneshvar, Fariborz
    Frankenberger, Jane R.
    Bowling, Laura C.
    Cherkauer, Keith A.
    Moraes, Andre G. de Lima
    JOURNAL OF HYDROLOGIC ENGINEERING, 2021, 26 (07)
  • [28] Transfer learning framework for streamflow prediction in large-scale transboundary catchments: Sensitivity analysis and applicability in data-scarce basins
    Ma, Kai
    Shen, Chaopeng
    Xu, Ziyue
    He, Daming
    JOURNAL OF GEOGRAPHICAL SCIENCES, 2024, 34 (05) : 963 - 984
  • [29] Integrated hydrodynamic and machine learning models for compound flooding prediction in a data-scarce estuarine delta
    Sampurno, Joko
    Vallaeys, Valentin
    Ardianto, Randy
    Hanert, Emmanuel
    NONLINEAR PROCESSES IN GEOPHYSICS, 2022, 29 (03) : 301 - 315
  • [30] The Potential of Deep Learning for Satellite Rainfall Detection over Data-Scarce Regions, the West African Savanna
    Estebanez-Camarena, Monica
    Taormina, Riccardo
    van de Giesen, Nick
    ten Veldhuis, Marie-Claire
    REMOTE SENSING, 2023, 15 (07)