Improved kNN Rule for Small Training Sets

被引:2
|
作者
Cheamanunkul, Sunsern [1 ]
Freund, Yoav [1 ]
机构
[1] Univ Calif San Diego, Dept Comp Sci & Engn, La Jolla, CA 92093 USA
关键词
D O I
10.1109/ICMLA.2014.37
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The traditional k-NN classification rule predicts a label based on the most common label of the k nearest neighbors (the plurality rule). It is known that the plurality rule is optimal when the number of examples tends to infinity. In this paper we show that the plurality rule is sub-optimal when the number of labels is large and the number of examples is small. We propose a simple k-NN rule that takes into account the labels of all of the neighbors, rather than just the most common label. We present a number of experiments on both synthetic datasets and real-world datasets, including MNIST and SVHN. We show that our new rule can achieve lower error rates compared to the majority rule in many cases.
引用
收藏
页码:201 / 206
页数:6
相关论文
共 50 条
  • [41] Composition of rule sets and ontologies
    Assmann, Uwe
    Johannes, Jendrik
    Henriksson, Jakob
    Savga, Ilie
    REASONING WEB, 2006, 4126 : 68 - 92
  • [42] Evolving Explainable Rule Sets
    Shahrzad, Hormoz
    Hodjat, Babak
    Miikkulainen, Risto
    PROCEEDINGS OF THE 2022 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE COMPANION, GECCO 2022, 2022, : 1779 - 1784
  • [43] Genetic Optimization of Training Sets for Improved Machine Learning Models of Molecular Properties
    Browning, Nicholas J.
    Ramakrishnan, Rapunathan
    von Lilienfeld, O. Anatole
    Roethlisberger, Ursula
    JOURNAL OF PHYSICAL CHEMISTRY LETTERS, 2017, 8 (07): : 1351 - 1359
  • [44] Essential classification rule sets
    Baralis, E
    Chiusano, S
    ACM TRANSACTIONS ON DATABASE SYSTEMS, 2004, 29 (04): : 635 - 674
  • [45] Image set classification using candidate sets selection and improved reverse training
    Ren, Zhenwen
    Wu, Bin
    Zhang, Xiaoqian
    Sun, Quansen
    NEUROCOMPUTING, 2019, 341 : 60 - 69
  • [46] An improved congestion-aware routing mechanism in sensor networks using fuzzy rule sets
    G. Sangeetha
    M. Vijayalakshmi
    Sannasi Ganapathy
    A. Kannan
    Peer-to-Peer Networking and Applications, 2020, 13 : 890 - 904
  • [47] An improved congestion-aware routing mechanism in sensor networks using fuzzy rule sets
    Sangeetha, G.
    Vijayalakshmi, M.
    Ganapathy, Sannasi
    Kannan, A.
    PEER-TO-PEER NETWORKING AND APPLICATIONS, 2020, 13 (03) : 890 - 904
  • [48] An Arrhythmia Classification Method in Utilizing the Weighted KNN and the Fitness Rule
    Jung, W-H.
    Lee, S-G.
    IRBM, 2017, 38 (03) : 138 - 148
  • [49] An Improved Algorithm based on KNN and Random Forest
    Liang, Jun
    Liu, Qin
    Nie, Nuihua
    Zeng, Biqing
    Zhang, Zanbo
    PROCEEDINGS OF THE THIRD INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND APPLICATION ENGINEERING (CSAE2019), 2019,
  • [50] Class Confidence Weighted kNN Algorithms for Imbalanced Data Sets
    Liu, Wei
    Chawla, Sanjay
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PT II: 15TH PACIFIC-ASIA CONFERENCE, PAKDD 2011, 2011, 6635 : 345 - 356