A fuzzy K-nearest neighbor classifier to deal with imperfect data

被引:17
|
作者
Cadenas, Jose M. [1 ]
Carmen Garrido, M. [1 ]
Martinez, Raquel [2 ]
Munoz, Enrique [3 ]
Bonissone, Piero P. [4 ]
机构
[1] Univ Murcia, Dept Informat & Commun Engn, Murcia, Spain
[2] Catholic Univ Murcia, Dept Comp Engn, Murcia, Spain
[3] Univ Milan, Dept Comp Sci, Crema, Italy
[4] Piero P Bonissone Analyt LLC, San Diego, CA USA
关键词
k-nearest neighbors; Classification; Imperfect data; Distance/dissimilarity measures; Combination methods; PERFORMANCE; RULES; ALGORITHMS;
D O I
10.1007/s00500-017-2567-x
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The k-nearest neighbors method (kNN) is a nonparametric, instance-based method used for regression and classification. To classify a new instance, the kNN method computes its k nearest neighbors and generates a class value from them. Usually, this method requires that the information available in the datasets be precise and accurate, except for the existence of missing values. However, data imperfection is inevitable when dealing with real-world scenarios. In this paper, we present the kNN(imp) classifier, a k-nearest neighbors method to perform classification from datasets with imperfect value. The importance of each neighbor in the output decision is based on relative distance and its degree of imperfection. Furthermore, by using external parameters, the classifier enables us to define the maximum allowed imperfection, and to decide if the final output could be derived solely from the greatest weight class (the best class) or from the best class and a weighted combination of the closest classes to the best one. To test the proposed method, we performed several experiments with both synthetic and real-world datasets with imperfect data. The results, validated through statistical tests, show that the kNN(imp) classifier is robust when working with imperfect data and maintains a good performance when compared with other methods in the literature, applied to datasets with or without imperfection.
引用
收藏
页码:3313 / 3330
页数:18
相关论文
共 50 条
  • [31] Categorical Data Classification based on Fuzzy K-Nearest Neighbor Approach
    Rustamaji, Heru Cahya
    Simanjuntak, Oliver Samuel
    Luhrie, Shalfa Fitriga
    Yuwono, Bambang
    Juwairiah
    2019 5TH INTERNATIONAL CONFERENCE ON SCIENCE ININFORMATION TECHNOLOGY (ICSITECH): EMBRACING INDUSTRY 4.0 - TOWARDS INNOVATION IN CYBER PHYSICAL SYSTEM, 2019, : 171 - 175
  • [32] Evaluation of k-Nearest Neighbor classifier performance for direct marketing
    Govindarajan, M.
    Chandrasekaran, R. M.
    EXPERT SYSTEMS WITH APPLICATIONS, 2010, 37 (01) : 253 - 258
  • [33] A Fast k-Nearest Neighbor Classifier Using Unsupervised Clustering
    Vajda, Szilard
    Santosh, K. C.
    RECENT TRENDS IN IMAGE PROCESSING AND PATTERN RECOGNITION (RTIP2R 2016), 2017, 709 : 185 - 193
  • [34] An Algorithm of Incremental Bayesian Classifier Based on K-Nearest Neighbor
    Wang, Dong
    Xiong, Shi-huan
    MEMS, NANO AND SMART SYSTEMS, PTS 1-6, 2012, 403-408 : 1455 - 1459
  • [35] Boosting the distance estimation -: Application to the K-Nearest Neighbor Classifier
    Amores, J
    Sebe, N
    Radeva, P
    PATTERN RECOGNITION LETTERS, 2006, 27 (03) : 201 - 209
  • [36] Fault Diagnosis Based on LTSA and K-Nearest Neighbor Classifier
    Jiang, Jingsheng
    Wang, Huaqing
    Ke, Yanliang
    Xiang, Wei
    Zhendong yu Chongji/Journal of Vibration and Shock, 2017, 36 (11): : 134 - 139
  • [37] A fall detection system using k-nearest neighbor classifier
    Liu, Chien-Liang
    Lee, Chia-Hoang
    Lin, Ping-Min
    EXPERT SYSTEMS WITH APPLICATIONS, 2010, 37 (10) : 7174 - 7181
  • [38] Classification of facial expressions using K-Nearest Neighbor Classifier
    Sohail, Abu Sayeed Md.
    Bhattacharya, Prabir
    COMPUTER VISION/COMPUTER GRAPHICS COLLABORATION TECHNIQUES, 2007, 4418 : 555 - +
  • [39] A Sparse Reconstructive Evidential K-Nearest Neighbor Classifier for High-Dimensional Data
    Gong, Chaoyu
    Su, Zhi-Gang
    Wang, Pei-Hong
    Wang, Qian
    You, Yang
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (06) : 5563 - 5576
  • [40] Resp-kNN: A probabilistic k-nearest neighbor classifier for sparsely labeled data
    Calma, Adrian
    Reitmaier, Tobias
    Sick, Bernhard
    2016 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2016, : 4040 - 4047