An Integrated Framework for Data-Driven Mineral Prospectivity Mapping Using Bagging-Based Positive-Unlabeled Learning and Bayesian Cost-Sensitive Logistic Regression

被引:14
|
作者
Zhang, Zhiqiang [1 ]
Wang, Gongwen [2 ,3 ,4 ]
Carranza, Emmanuel John M. [5 ]
Fan, Junjie [6 ]
Liu, Xinxing [1 ,7 ]
Zhang, Xiang [6 ]
Dong, Yulong [6 ]
Chang, XiaoPeng [6 ]
Sha, Deming [8 ]
机构
[1] Hebei GEO Univ, Hebei Key Lab Strateg Crit Mineral Resources, Shijiazhuang 050031, Hebei, Peoples R China
[2] China Univ Geosci, Sch Earth Sci & Resources, Beijing 100083, Peoples R China
[3] China Univ Geosci, MNR Key Lab Explorat Theory & Technol Crit Minera, Beijing 100083, Peoples R China
[4] Beijing Key Lab Land & Resources Informat Res & D, Beijing 100083, Peoples R China
[5] Univ Free State, Fac Nat & Agr Sci, Dept Geol, Bloemfontein, South Africa
[6] China Geol Survey, Geophys Survey Ctr, Langfang 065000, Peoples R China
[7] Hebei GEO Univ, Sch Earth Sci, Shijiazhuang 050031, Hebei, Peoples R China
[8] China Geol Survey, Shenyang 110000, Peoples R China
关键词
Positive-unlabeled learning; Bayesian cost-sensitive logistic regression; Uncertainty quantification; Mineral prospectivity mapping; Wulong Au district; RANDOM FOREST; GOLD DEPOSIT; MODELS; UNCERTAINTY; PREDICTION; DISTRICT; CHINA;
D O I
10.1007/s11053-022-10120-0
中图分类号
P [天文学、地球科学];
学科分类号
07 ;
摘要
Mineral prospectivity mapping (MPM) is a spatial quantitative approach to delineation of exploration targets. The input data and the data integration approaches are sources of critical uncertainties in MPM. Mineralization is a rare event. Hence, the imbalanced geosciences datasets used in data-driven MPM are usually composed of sparse positive samples and abundant unlabeled data. Selecting reliable negative samples is more challenging than selecting positive samples for MPM. Another issue in MPM is cost-sensitive classification because false positive errors if followed-up could result in increasing exploration costs, whereas false negative errors if not recognized could result in missing undiscovered mineral deposits. Few data-driven methods for MPM that simultaneously address systemic uncertainty from imbalanced data and cost-sensitive issues are reported in the literature. This study proposes an integrated framework for data-driven MPM, which is denoted as BPUL-BCSLR, with Bagging-based positive-unlabeled learning (BPUL) and Bayesian cost-sensitive logistic regression (BCSLR) in order to address simultaneously the issues mentioned above. We utilized the BPUL-BCSLR framework for MPM in the Wulong Au district, China. The performance of the BPUL-BCSLR framework was compared with that of logistic regression (LR) and with the BCSLR. The prediction-area plot was utilized to evaluate the performance of the LR, BCSLR, and BPUL-BCSLR predictive models and to outline exploration targets in the study area. Return-risk analysis was carried out to determine low risk-high return targets obtained by the BCSLR and BPUL-BCSLR frameworks. The results demonstrate that the BPUL-BCSLR framework outperformed the LR and BCSLR. The low risk-high return exploration targets obtained through the BPUL-BCSLR framework can benefit further exploration for Au in the Wulong Au district.
引用
收藏
页码:3041 / 3060
页数:20
相关论文
共 12 条
  • [1] An Integrated Framework for Data-Driven Mineral Prospectivity Mapping Using Bagging-Based Positive-Unlabeled Learning and Bayesian Cost-Sensitive Logistic Regression
    Zhiqiang Zhang
    Gongwen Wang
    Emmanuel John M. Carranza
    Junjie Fan
    Xinxing Liu
    Xiang Zhang
    Yulong Dong
    XiaoPeng Chang
    Deming Sha
    Natural Resources Research, 2022, 31 : 3041 - 3060
  • [2] Bagging-based positive-unlabeled learning algorithm with Bayesian hyperparameter optimization for three-dimensional mineral potential mapping
    Zhang, Zhiqiang
    Wang, Gongwen
    Liu, Chong
    Cheng, Lizhen
    Sha, Deming
    COMPUTERS & GEOSCIENCES, 2021, 154
  • [3] Landslide Susceptibility Modeling Using Bagging-Based Positive-Unlabeled Learning
    Wu, Bangyu
    Qiu, Weirong
    Jia, Junxiong
    Liu, Naihao
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2021, 18 (05) : 766 - 770
  • [4] A Framework for Data-Driven Mineral Prospectivity Mapping with Interpretable Machine Learning and Modulated Predictive Modeling
    Mou, Nini
    Carranza, Emmanuel John M.
    Wang, Gongwen
    Sun, Xiang
    NATURAL RESOURCES RESEARCH, 2023, 32 (06) : 2439 - 2462
  • [5] A Framework for Data-Driven Mineral Prospectivity Mapping with Interpretable Machine Learning and Modulated Predictive Modeling
    Nini Mou
    Emmanuel John M. Carranza
    Gongwen Wang
    Xiang Sun
    Natural Resources Research, 2023, 32 : 2439 - 2462
  • [6] Data-driven logistic-based weighting of geochemical and geological evidence layers in mineral prospectivity mapping
    Yousefi, Mahyar
    Nykanen, Vesa
    JOURNAL OF GEOCHEMICAL EXPLORATION, 2016, 164 : 94 - 106
  • [7] Data-Driven Mineral Prospectivity Mapping Based on Known Deposits Using Association Rules
    Yu, Xiaotong
    Yu, Pengpeng
    Wang, Kunyi
    Cao, Wei
    Zhou, Yongzhang
    NATURAL RESOURCES RESEARCH, 2024, 33 (03) : 1025 - 1048
  • [8] An interpretable data-driven approach for customer purchase prediction using cost-sensitive learning
    Xiao, Fei
    Chen, Shui-xia
    Chen, Zi-yu
    Wang, Ya-nan
    Wang, Jian-qiang
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 138
  • [9] An Uncertainty-Quantification Machine Learning Framework for Data-Driven Three-Dimensional Mineral Prospectivity Mapping
    Zhang, Zhiqiang
    Wang, Gongwen
    Carranza, Emmanuel John M.
    Du, Jingguo
    Li, Yingjie
    Liu, Xinxing
    Su, Yongjun
    NATURAL RESOURCES RESEARCH, 2024, 33 (04) : 1393 - 1411
  • [10] Bagging-based Positive-Unlabeled Data Learning Algorithm with Base Learners Random Forest and XGBoost for 3D Exploration Targeting in the Kalatongke District, Xinjiang, China
    Gao, Meng
    Wang, Gongwen
    Yang, Wangdong
    Zhang, Zhiqiang
    Cai, Dingzhou
    Xu, Yunchou
    Yang, Shuren
    NATURAL RESOURCES RESEARCH, 2023, 32 (02) : 437 - 459