Integrative modeling of heterogeneous soil salinity using sparse ground samples and remote sensing images

被引:23
|
作者
Wang, Lingyue [1 ,2 ,3 ]
Hu, Ping [4 ]
Zheng, Hongwei [1 ,2 ,3 ]
Liu, Ying [1 ,2 ,3 ]
Cao, Xingwen [1 ,2 ,3 ]
Hellwich, Olaf [5 ]
Liu, Tie [1 ,2 ,3 ]
Luo, Geping [1 ,2 ,3 ]
Bao, Anming [1 ,2 ,3 ]
Chen, Xi [1 ,2 ,3 ]
机构
[1] Chinese Acad Sci, Xinjiang Inst Ecol & Geog, State Key Lab Desert & Oasis Ecol, Urumqi, Peoples R China
[2] Univ Chinese Acad Sci, Beijing, Peoples R China
[3] Chinese Acad Sci, Res Ctr Ecol & Environm Cent Asia, Urumqi, Peoples R China
[4] Xinjiang Univ, Sch Informat Sci & Engn, Urumqi 830017, Peoples R China
[5] Tech Univ Berlin, Dept Comp Vis & Remote Sensing, D-10587 Berlin, Germany
基金
中国国家自然科学基金;
关键词
Soil salinity; Sparse samples; Mixup; Light gradient boosting machine (LightGBM); Bayesian optimization; Shapley additive explanations (SHAP); MACHINE; CHINA; OPTIMIZATION; PREDICTION; PARAMETERS; MOISTURE;
D O I
10.1016/j.geoderma.2022.116321
中图分类号
S15 [土壤学];
学科分类号
0903 ; 090301 ;
摘要
Soil salinization is a major environmental risk caused by natural or human activities especially in arid and semi -arid regions. Machine learning for rapidly monitoring large-scale spatial soil salinization becomes possible. However, machine learning often needs large training samples and obtaining extensive soil salinization infor-mation by field investigation is laborious and difficult. In practice, the field soil sampling datasets are often sparse and non-normally distributed. The intricacy of features extracted from remote sensing images increases the model complexity and often leads to degradation in the prediction performance. To solve this problem, an integrative framework is proposed to predict soil salt content (SSC) based on light gradient boosting machine (LGBM). In this model, we first introduce the data augmentation method (Mixup) to improve sample diversity and alleviate model overfitting by the sparsity of samples. To improve the generalization and robustness of the model in different spatial heterogeneity of soil salinization, the Mixup-LGBM model is adaptively and jointly optimized by combining hyperparameters and feature selection in a Bayesian optimization framework. Furthermore, model interpretability is improved using shapley additive explanations (SHAP) value based on the combination of the confidence of the synthetic data through model visualization and feature importance assessment. In addition, different cases are simulated to test the model performance. In Case I, the raw sample-sparsity model using the data augmentation algorithm has higher prediction accuracy than other unused models. In Case II, the extreme sample-sparsity model still achieves satisfactory results while the other models can't learn any effective information after multiple iterations. The experimental results reveal that the proposed model can automatically find representative features in heterogeneous environments and has strong adaptability in different study areas. This finding indicates that digital elevation model (DEM) has a high influence on SSC in both study areas. Besides the DEM, soil salinization in the Manasi River Basin is more sensitive to human ac-tivities, while that in the Werigan-Kuqa River Delta Oasis is more sensitive to natural factors. The Mixup-LGBM model is suitable for predicting SSC in different sample sparsity scenarios while ensuring the high accuracy. The model has considerable potential for dealing with other complex sample sparsity regression tasks.
引用
收藏
页数:16
相关论文
共 50 条
  • [31] Monitoring of soil erosion caused by construction projects using remote sensing images
    Qiu, T.
    Lu, F. C.
    Zhang, J. J.
    Wang, R.
    2ND INTERNATIONAL CONFERENCE ON ADVANCES IN CIVIL AND ECOLOGICAL ENGINEERING RESEARCH, 2021, 626
  • [32] Ground, Proximal, and Satellite Remote Sensing of Soil Moisture
    Babaeian, Ebrahim
    Sadeghi, Morteza
    Jones, Scott B.
    Montzka, Carsten
    Vereecken, Harry
    Tuller, Markus
    REVIEWS OF GEOPHYSICS, 2019, 57 (02) : 530 - 616
  • [33] New approach to remote sensing of temperature and salinity in natural water samples
    Artlett, C. P.
    Pask, H. M.
    OPTICS EXPRESS, 2017, 25 (03): : 2840 - 2851
  • [34] Novel Land Cover Change Detection Deep Learning Framework with Very Small Initial Samples Using Heterogeneous Remote Sensing Images
    Zhu, Yangpeng
    Li, Qianyu
    Lv, Zhiyong
    Falco, Nicola
    REMOTE SENSING, 2023, 15 (18)
  • [35] Spatial prediction of soil salinity based on the Google Earth Engine platform with multitemporal synthetic remote sensing images
    Ma, Shilong
    He, Baozhong
    Ge, Xiangyu
    Luo, Xuefeng
    ECOLOGICAL INFORMATICS, 2023, 75
  • [36] A faster SVM classification technique for remote sensing images using reduced training samples
    Chowdhury K.
    Chaudhuri D.
    Pal A.K.
    Journal of Ambient Intelligence and Humanized Computing, 2023, 14 (12) : 16807 - 16827
  • [37] Classification of Remote Sensing Images Through Reweighted Sparse Subspace Representation Using Compressed Data
    Zhu, Jianchen
    Zhao, Shengjie
    Wu, Di
    TRAITEMENT DU SIGNAL, 2021, 38 (01) : 27 - 37
  • [38] Illumination Sensing Using Sparse Modeling
    Gogineni, Sandeep
    Nehorai, Arye
    2011 IEEE DIGITAL SIGNAL PROCESSING WORKSHOP AND IEEE SIGNAL PROCESSING EDUCATION WORKSHOP (DSP/SPE), 2011, : 255 - 258
  • [39] Study on Retrieval Method of Soil Organic Matter in Salinity Soil Using Unmanned Aerial Vehicle Remote Sensing
    Wang W.
    Peng J.
    Zhu W.
    Yang B.
    Liu Z.
    Gong H.
    Wang J.
    Yang T.
    Lou J.
    Sun Z.
    Journal of Geo-Information Science, 2024, 26 (03) : 736 - 752
  • [40] Modeling Forest Stand Parameters Using Remote Sensing Images and topographic information
    Xie, Shiqin
    Wang, Wei
    Wang, Zemin
    Huang, Guosheng
    PROCEEDINGS OF 2020 5TH INTERNATIONAL CONFERENCE ON MULTIMEDIA AND IMAGE PROCESSING (ICMIP 2020), 2020, : 109 - 114