Improved kNN Rule for Small Training Sets

被引:2
|
作者
Cheamanunkul, Sunsern [1 ]
Freund, Yoav [1 ]
机构
[1] Univ Calif San Diego, Dept Comp Sci & Engn, La Jolla, CA 92093 USA
关键词
D O I
10.1109/ICMLA.2014.37
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The traditional k-NN classification rule predicts a label based on the most common label of the k nearest neighbors (the plurality rule). It is known that the plurality rule is optimal when the number of examples tends to infinity. In this paper we show that the plurality rule is sub-optimal when the number of labels is large and the number of examples is small. We propose a simple k-NN rule that takes into account the labels of all of the neighbors, rather than just the most common label. We present a number of experiments on both synthetic datasets and real-world datasets, including MNIST and SVHN. We show that our new rule can achieve lower error rates compared to the majority rule in many cases.
引用
收藏
页码:201 / 206
页数:6
相关论文
共 50 条
  • [1] An improved kNN algorithm - Fuzzy kNN
    Shang, WQ
    Huang, HK
    Zhu, HB
    Lin, YM
    Wang, ZH
    Qu, YL
    COMPUTATIONAL INTELLIGENCE AND SECURITY, PT 1, PROCEEDINGS, 2005, 3801 : 741 - 746
  • [2] Improved imputation of rule sets in class association rule modeling: application to transportation mode choice
    Jiajia Zhang
    Tao Feng
    Harry Timmermans
    Zhengkui Lin
    Transportation, 2023, 50 : 63 - 106
  • [3] Improved imputation of rule sets in class association rule modeling: application to transportation mode choice
    Zhang, Jiajia
    Feng, Tao
    Timmermans, Harry
    Lin, Zhengkui
    TRANSPORTATION, 2023, 50 (01) : 63 - 106
  • [4] SPXYE: an improved method for partitioning training and validation sets
    Ting Gao
    Lina Hu
    Zhizhen Jia
    Tianna Xia
    Chao Fang
    Hongzhi Li
    LiHong Hu
    Yinghua Lu
    Hui Li
    Cluster Computing, 2019, 22 : 3069 - 3078
  • [5] SPXYE: an improved method for partitioning training and validation sets
    Gao, Ting
    Hu, Lina
    Jia, Zhizhen
    Xia, Tianna
    Fang, Chao
    Li, Hongzhi
    Hu, LiHong
    Lu, Yinghua
    Li, Hui
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2019, 22 (02): : S3069 - S3078
  • [6] Analysis of Co-training Algorithm with Very Small Training Sets
    Didaci, Luca
    Fumera, Giorgio
    Roli, Fabio
    STRUCTURAL, SYNTACTIC, AND STATISTICAL PATTERN RECOGNITION, 2012, 7626 : 719 - 726
  • [7] Research on small files classification based on improved KNN algorithm and pretreatment strategy
    Shi, Hengliang
    Bai, Xiaolei
    Zhen, Lintao
    ICIC Express Letters, 2015, 9 (02): : 603 - 608
  • [8] A novel anomaly detection using small training sets
    Yin, QB
    Shen, LR
    Zhang, RB
    Li, XY
    INTELLIGENT DATA ENGINEERING AND AUTOMATED LEARNING IDEAL 2005, PROCEEDINGS, 2005, 3578 : 258 - 263
  • [9] Priors for people tracking from small training sets
    Urtasun, R
    Fleet, DJ
    Hertzmann, A
    Fua, P
    TENTH IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION, VOLS 1 AND 2, PROCEEDINGS, 2005, : 403 - 410
  • [10] Accurate Post Training Quantization With Small Calibration Sets
    Hubara, Itay
    Nahshan, Yury
    Hanani, Yair
    Banner, Ron
    Soudry, Daniel
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139