A novel framework to predict chlorophyll-a concentrations in water bodies through multi-source big data and machine learning algorithms

被引:9
|
作者
Karimian, Hamed [1 ]
Huang, Jinhuang [1 ]
Chen, Youliang [2 ]
Wang, Zhaoru [3 ]
Huang, Jinsong [4 ]
机构
[1] Jiangsu Ocean Univ, Sch Marine Technol & Geomat, Lianyungang 222005, Peoples R China
[2] Jiangxi Univ Sci & Technol, Sch Civil & Surveying Engn, Ganzhou 341000, Peoples R China
[3] Jiangxi Univ Sci & Technol, Sch Resources & Environm Engn, Ganzhou 341000, Peoples R China
[4] Zhejiang Zhipu Engn Technol Co Ltd, Huzhou 313000, Peoples R China
基金
中国国家自然科学基金;
关键词
Remote sensing; Random forest; Water pollution; Eutrophication; Spatial analysis; CYANOBACTERIAL BLOOMS; MERIS DATA; MODIS; IMAGES; LEVEL; DEEP;
D O I
10.1007/s11356-023-27886-2
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Eutrophication happens when water bodies are enriched by minerals and nutrients. Dense blooms of noxious are the most obvious effect of eutrophication that harms water quality, and by increasing toxic substances damage the water ecosystem. Therefore, it is critical to monitor and investigate the development process of eutrophication. The concentration of chlorophyll-a (chl-a) in water bodies is an essential indicator of eutrophication in them. Previous studies in predicting chlorophyll-a concentrations suffered from low spatial resolution and discrepancies between predicted and observed values. In this paper, we used various remote sensing and ground observation data and proposed a novel machine learning-based framework, a random forest inversion model, to provide the spatial distribution of chl-a in 2 m spatial resolution. The results showed our model outperformed other base models, and the goodness of fit improved by over 36.6% while MSE and MAE decreased by over 15.17% and over 21.26% respectively. Moreover, we compared the feasibility of GF-1 and Sentinel-2 remote sensing data in chl-a concentration prediction. We found that better prediction results can be obtained by using GF-1 data, with the goodness of fit reaching 93.1% and MSE only 3.589. The proposed method and findings of this study can be used in future water management studies and as an aid for decision-makers in this field.
引用
收藏
页码:79402 / 79422
页数:21
相关论文
共 50 条
  • [31] Recent trends of machine learning applied to multi-source data of medicinal plants
    Zhang, Yanying
    Wang, Yuanzhong
    JOURNAL OF PHARMACEUTICAL ANALYSIS, 2023, 13 (12) : 1388 - 1407
  • [32] BRECCIA: A Novel Multi-source Fusion Framework for Dynamic Geospatial Data Analysis
    Sacharny, D.
    Henderson, T. C.
    Simmons, R.
    Mitiche, A.
    Welker, T.
    Fan, X.
    2017 IEEE INTERNATIONAL CONFERENCE ON MULTISENSOR FUSION AND INTEGRATION FOR INTELLIGENT SYSTEMS (MFI), 2017, : 390 - 396
  • [33] Understanding the impact of population dynamics on water use utilizing multi-source big data
    Zhou, Guihuan
    Li, Zhanjie
    Wang, Wei
    Wang, Qianyang
    Yu, Jingshan
    JOURNAL OF HYDROINFORMATICS, 2024, 26 (03) : 549 - 566
  • [34] Privacy Aware Non-linear Support Vector Machine for Multi-source Big Data
    Lu, Yunmei
    Phoungphol, Piyaphol
    Zhang, Yanqing
    2014 IEEE 13TH INTERNATIONAL CONFERENCE ON TRUST, SECURITY AND PRIVACY IN COMPUTING AND COMMUNICATIONS (TRUSTCOM), 2014, : 783 - 789
  • [35] Enhanced SHL Recognition Using Machine Learning and Deep Learning Models with Multi-source Data
    Li, Mengyuan
    Zhu, Jun
    Zhang, Yuanyuan
    Lu, Xiaoling
    ADJUNCT PROCEEDINGS OF THE 2023 ACM INTERNATIONAL JOINT CONFERENCE ON PERVASIVE AND UBIQUITOUS COMPUTING & THE 2023 ACM INTERNATIONAL SYMPOSIUM ON WEARABLE COMPUTING, UBICOMP/ISWC 2023 ADJUNCT, 2023, : 505 - 510
  • [36] Predicting China's Maize Yield Using Multi-Source Datasets and Machine Learning Algorithms
    Miao, Lijuan
    Zou, Yangfeng
    Cui, Xuefeng
    Kattel, Giri Raj
    Shang, Yi
    Zhu, Jingwen
    REMOTE SENSING, 2024, 16 (13)
  • [37] A Smart Social Insurance Big Data Analytics Framework Based on Machine Learning Algorithms
    Senousy, Youssef
    Shehab, Abdulaziz
    Hanna, Wael K.
    Riad, Alaa M.
    El-bakry, Hazem A.
    Elkhamisy, Nashaat
    CYBERNETICS AND INFORMATION TECHNOLOGIES, 2020, 20 (01) : 95 - 111
  • [38] Towards of multi-source data fusion framework of geo-referenced and non-georeferenced data: prospects for use in surface water bodies
    Villalpando, Fermin
    Tuxpan, Jose
    Ramos-Leal, Jose Alfredo
    Marin, Ana Elizabeth
    GEOCARTO INTERNATIONAL, 2023,
  • [39] Reducing Cost of Process Modeling through Multi-source Data Transfer Learning
    Chan, Lester Lik Teck
    Chen, Junghui
    2019 12TH ASIAN CONTROL CONFERENCE (ASCC), 2019, : 474 - 479
  • [40] Digital Mapping and Scenario Prediction of Soil Salinity in Coastal Lands Based on Multi-Source Data Combined with Machine Learning Algorithms
    Zhou, Mengge
    Li, Yonghua
    REMOTE SENSING, 2024, 16 (14)