The Anomaly Detector, Semi-supervised Classifier, and Supervised Classifier Based on K-Nearest Neighbors in Geochemical Anomaly Detection: A Comparative Study

被引:8
|
作者
Chen, Yongliang [1 ]
Lu, Laijun [1 ]
机构
[1] Jilin Univ, Coll Earth Sci, Changchun 130061, Jilin, Peoples R China
基金
中国国家自然科学基金;
关键词
K-nearest neighbor; Supervised classification; Semi-supervised classification; Geochemical anomaly detection; Polymetallic mineral deposits; RECOGNITION;
D O I
10.1007/s11004-022-10042-w
中图分类号
P [天文学、地球科学];
学科分类号
07 ;
摘要
Unsupervised anomaly detection techniques mainly model the population distribution of geochemical exploration data, but do not consider the mineral deposit information found in the study area. Supervised classification techniques can make full use of mineral deposit information to distinguish geochemical anomalies from the background. However, these classification techniques usually cannot properly address the data imbalance in distinguishing geochemical anomalies from the background. The data imbalance of geochemical exploration data means that there are only a few known mineralized data points in the study area and a large number of data points to be evaluated. Semi-supervised classification techniques are machine learning algorithms developed to solve the classification problem of a small amount of labeled data and a large amount of unlabeled data. These techniques are suitable for dealing with classification problems such as identifying anomalies from geochemical exploration data. Therefore, in this study, the K-nearest neighbor (KNN) algorithm, an effective machine learning technique for constructing anomaly detection models, semi-supervised classification models, and supervised classification models, was adopted to construct the anomaly detection model, semi-supervised classification model, and supervised classification model for detecting polymetallic anomalies in the case study in the Baishan area (China). The stream sediment survey data collected from the four 1:200,000 geological maps and the 30 polymetallic deposits found in the study area were used to train the KNN-based models for detecting polymetallic anomalies. The receiver operating characteristic (ROC) curve and the area under the ROC curve (AUC) were adopted to compare the performance of the KNN-based models in detecting polymetallic anomalies. The results show that the KNN-based semi-supervised and supervised classification models have similar performance in detecting polymetallic anomalies, and are superior to the KNN-based anomaly detection model. Therefore, as long as the training dataset is defined according to the known deposits in the study area, the KNN-based semi-supervised classification model and supervised classification model are potentially effective methods for detecting mineralization-related geochemical anomalies.
引用
收藏
页码:1011 / 1033
页数:23
相关论文
共 50 条
  • [41] Semi-Supervised Learning-Based Method for Unknown Anomaly Detection
    Cheng, Yudong
    Zhou, Fang
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2024, 61 (07): : 1670 - 1680
  • [42] Semi-Supervised Anomaly Detection Based on Deep Generative Models with Transformer
    Shangguan, Weimin
    Fan, Wentao
    Chen, Ziyi
    6TH INTERNATIONAL CONFERENCE ON INNOVATION IN ARTIFICIAL INTELLIGENCE, ICIAI2022, 2022, : 172 - 177
  • [43] Semi-supervised anomaly detection algorithms: A comparative summary and future research directions
    Elizabeth Villa-Perez, Miryam
    Alvarez-Carmona, Miguel A.
    Loyola-Gonzalez, Octavio
    Angel Medina-Perez, Miguel
    Carlos Velazco-Rossell, Juan
    Raymond Choo, Kim-Kwang
    KNOWLEDGE-BASED SYSTEMS, 2021, 218
  • [44] Semi-Supervised Bolt Anomaly Detection Based on Local Feature Reconstruction
    Peng, Yun
    Liu, Chuangwei
    Yan, Yi
    Ma, Nachuan
    Wang, Deming
    Liu, Chengju
    Chen, Qijun
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2023, 72
  • [45] Flow-based anomaly detection using semi-supervised learning
    Jadidi, Zahra
    Muthukkumarasamy, Vallipuram
    Sithirasenan, Elankayer
    Singh, Kalvinder
    2015 9TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATION SYSTEMS (ICSPCS), 2015,
  • [46] Data Analysis and Anomaly Detection in a Wind Farm with k-Nearest Neighbors
    Weiss, Bassel
    Esteban, Segundo
    Santos, Matilde
    INTELLIGENT DATA ENGINEERING AND AUTOMATED LEARNING - IDEAL 2024, PT II, 2025, 15347 : 223 - 235
  • [47] Constrained Parameter Estimation for Semi-supervised Learning: The Case of the Nearest Mean Classifier
    Loog, Marco
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, PT II: EUROPEAN CONFERENCE, ECML PKDD 2010, 2010, 6322 : 291 - 304
  • [48] Surrounding Influenced K-Nearest Neighbors: A New Distance Based Classifier
    Mendialdua, I.
    Sierra, B.
    Lazkano, E.
    Irigoien, I.
    Jauregi, E.
    ADVANCED DATA MINING AND APPLICATIONS, ADMA 2010, PT I, 2010, 6440 : 270 - 277
  • [49] Semi-supervised anomaly detection methods for leakage identification in water distribution networks: A comparative study
    Tornyeviadzi, Hoese Michel
    Mohammed, Hadi
    Seidu, Razak
    MACHINE LEARNING WITH APPLICATIONS, 2023, 14
  • [50] FMCW Radar-based Anomaly Detection in Toilet by Supervised Machine Learning Classifier
    Takabatake, Wataru
    Yamamoto, Kohei
    Toyoda, Kentaroh
    Ohtsuki, Tomoaki
    Shibata, Yohei
    Nagate, Atsushi
    2019 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2019,