Approach for generating high accuracy machine learning model for high resolution geochemical map completion using remote sensing data - Case study of Arizona, USA

被引:1
|
作者
Huang, Chenhui [1 ]
Shibuya, Akinobu [2 ]
机构
[1] NEC Corp Data Sci Res Labs, Miyukigaoka 34, Tsukuba, Ibaraki 3058501, Japan
[2] NEC Corp Syst Platform Res Labs, Miyukigaoka 34, Tsukuba, Ibaraki 3058501, Japan
关键词
geochemical distribution; remote sensing; data analysis; machine learning; SPECTROSCOPY; SEDIMENTS; SOILS;
D O I
10.1117/12.2524940
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Complete high resolution geochemical maps are strongly needed for mineral exploration; however, the previously proposed methods for making geochemical maps have low accuracy. In this research, we propose a new algorithm called sample density based mixture interpolation (SADBAMIN) for high resolution geochemical map completion using remote sensing data. In the SADBAMIN algorithm, first, according to the measured copper data density on the map, the map is classified into two parts: the area for training (T area) and the area waiting to be predicted (P area). The two areas are classified by the edge of the data point set's alpha shape. In the T area, a triangle area among three neighbourhood points is interpolated by using the kriging model. Then, remote sensing data, including advanced spaceborne thermal emission and reflection radiometer (ASTER) data, digital elevation model (DEM) data, and geophysics (magnetic) data, and copper geochemical data at all measured and partial randomly selected interpolated points are applied as training data to construct a random forest regression model. By considering the relationship between interpolation reliability and distance, a penalty on data selection probability of going into training data is given. Finally, by inputting the remote sensing data in the P area to the model, the copper data in this area can be obtained, and the completed map comprises these two parts. We use 16,000 measured points, 10-fold cross-validation, and root mean squared error (RMSE) for model evaluation. We achieved an RMSE of 293 ppm, while the RMSE of the previously proposed method is 347 ppm.
引用
收藏
页数:10
相关论文
共 50 条
  • [42] Integrative image segmentation optimization and machine learning approach for high quality land-use and land-cover mapping using multisource remote sensing data
    Gibril, Mohamed Barakat A.
    Idrees, Mohammed Oludare
    Yao, Kouame
    Shafri, Helmi Zulhaidi Mohd
    JOURNAL OF APPLIED REMOTE SENSING, 2018, 12 (01):
  • [43] Oyster Aquaculture Site Selection Using High-Resolution Remote Sensing: A Case Study in the Gulf of Maine, United States
    Jiang, Binbin
    Boss, Emmanuel
    Kiffney, Thomas
    Hesketh, Gabriel
    Bourdin, Guillaume
    Fan, Daidu
    Brady, Damian C.
    FRONTIERS IN MARINE SCIENCE, 2022, 9
  • [44] A machine learning approach to model leaf area index in Eucalyptus plantations using high-resolution satellite imagery and airborne laser scanner data
    Hirigoyen, Andres
    Acosta-Munoz, Cristina
    Ariza Salamanca, Antonio Jesus
    Angeles Varo-Martinez, Maria
    Rachid-Casnati, Cecilia
    Franco, Jorge
    Navara-Cerrillo, Rafael
    ANNALS OF FOREST RESEARCH, 2021, 64 (02) : 165 - 183
  • [45] Morphometric, rheological and compositional analysis of an effusive lunar dome using high resolution remote sensing data sets: A case study from Marius hills region
    Arya, A. S.
    Rajasekhar, R. P.
    Amitabh
    Krishna, B. Gopala
    Ajai
    Kumar, A. S. Kiran
    ADVANCES IN SPACE RESEARCH, 2014, 54 (10) : 2073 - 2086
  • [46] Geoid modeling using a high resolution geopotential model and terrain data: A case study in Canadian Rockies
    Prasanna, Herath Mudiyanselage Indika
    Chen, Wu
    JOURNAL OF APPLIED GEODESY, 2012, 6 (02) : 89 - 101
  • [47] A MACHINE LEARNING APPROACH FOR HIGH RESOLUTION FRACTIONAL VEGETATION COVER ESTIMATION USING PLANET CUBESAT AND RGB DRONE DATA FUSION
    Nesslage, Jacob
    Barreto, Brittany Lopez
    Weingram, Adam
    Hestir, Erin
    IGARSS 2023 - 2023 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2023, : 4879 - 4882
  • [48] Improving peak detection in high-resolution LC/MS metabolomics data using preexisting knowledge and machine learning approach
    Yu, Tianwei
    Jones, Dean P.
    BIOINFORMATICS, 2014, 30 (20) : 2941 - 2948
  • [49] Comparative analysis of different machine learning algorithms for urban footprint extraction in diverse urban contexts using high-resolution remote sensing imagery
    Gui, Baoling
    Bhardwaj, Anshuman
    Sam, Lydia
    JOURNAL OF GEOGRAPHICAL SCIENCES, 2025, 35 (03) : 664 - 696
  • [50] Comparative analysis of different machine learning algorithms for urban footprint extraction in diverse urban contexts using high-resolution remote sensing imagery
    GUI Baoling
    Anshuman BHARDWAJ
    Lydia SAM
    Journal of Geographical Sciences, 2025, 35 (03) : 664 - 696