On optimum choice of k in nearest neighbor classification

被引:85
|
作者
Ghosh, Anil K. [1 ]
机构
[1] Indian Stat Inst, Theoret Stat & Math Unit, Kolkata 700108, India
关键词
accuracy index; Bayesian strength function; cross-validation; misclassification rate; neighborhood parameter; non-informative prior; optimal Bayes risk; posterior probability;
D O I
10.1016/j.csda.2005.06.007
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
A major issue in k-nearest neighbor classification is how to choose the optimum value of the neighborhood parameter k. Popular cross-validation techniques often fail to guide us well in selecting k mainly due to the presence of multiple minimizers of the estimated misclassification rate. This article investigates a Bayesian method in this connection, which solves the problem of multiple optimizers. The utility of the proposed method is illustrated using some benchmark data sets. (C) 2005 Elsevier B.V. All rights reserved.
引用
收藏
页码:3113 / 3123
页数:11
相关论文
共 50 条
  • [1] On nearest neighbor classification using adaptive choice of k
    Ghosh, Anil K.
    JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2007, 16 (02) : 482 - 502
  • [2] CHOICE OF NEIGHBOR ORDER IN NEAREST-NEIGHBOR CLASSIFICATION
    Hall, Peter
    Park, Byeong U.
    Samworth, Richard J.
    ANNALS OF STATISTICS, 2008, 36 (05): : 2135 - 2152
  • [3] CHOICE OF THE SMOOTHING PARAMETER AND EFFICIENCY OF K-NEAREST NEIGHBOR CLASSIFICATION
    ENAS, GG
    CHOI, SC
    COMPUTERS & MATHEMATICS WITH APPLICATIONS-PART A, 1986, 12 (02): : 235 - 244
  • [4] Improved k-nearest neighbor classification
    Wu, YQ
    Ianakiev, K
    Govindaraju, V
    PATTERN RECOGNITION, 2002, 35 (10) : 2311 - 2318
  • [5] Analysis of the k-nearest neighbor classification
    Li, Jing
    Cheng, Ming
    INFORMATION SCIENCE AND MANAGEMENT ENGINEERING, VOLS 1-3, 2014, 46 : 1911 - 1917
  • [6] Comparative Analysis of K-Nearest Neighbor and Modified K-Nearest Neighbor Algorithm for Data Classification
    Okfalisa
    Mustakim
    Gazalba, Ikbal
    Reza, Nurul Gayatri Indah
    2017 2ND INTERNATIONAL CONFERENCES ON INFORMATION TECHNOLOGY, INFORMATION SYSTEMS AND ELECTRICAL ENGINEERING (ICITISEE): OPPORTUNITIES AND CHALLENGES ON BIG DATA FUTURE INNOVATION, 2017, : 294 - 298
  • [7] Joint Evidential K-Nearest Neighbor Classification
    Gong, Chaoyu
    Li, Yongbin
    Liu, Yong
    Wang, Pei-hong
    You, Yang
    2022 IEEE 38TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2022), 2022, : 2113 - 2126
  • [8] A Classification Algorithm in Li-K Nearest Neighbor
    Wang, Bangjun
    Zhang, Li
    Wang, Xiaoqian
    2013 FOURTH GLOBAL CONGRESS ON INTELLIGENT SYSTEMS (GCIS), 2013, : 185 - 189
  • [9] Deep Metric Learning for K Nearest Neighbor Classification
    Liao, Tingting
    Lei, Zhen
    Zhu, Tianqing
    Zeng, Shan
    Li, Yaqin
    Yuan, Cao
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (01) : 264 - 275
  • [10] k -Nearest Neighbor Curves in Imaging Data Classification
    Cabon, Yann
    Suehs, Carey
    Bommart, Sebastien
    Vachier, Isabelle
    Marin, Gregory
    Bourdin, Arnaud
    Molinari, Nicolas
    FRONTIERS IN APPLIED MATHEMATICS AND STATISTICS, 2019, 5