Accelerated k-nearest neighbors algorithm based on principal component analysis for text categorization

被引:0
|
作者
Min DU
Xing-shu CHEN
机构
[1] SchoolofComputerScience,SichuanUniversity
关键词
D O I
暂无
中图分类号
TP391.1 [文字信息处理];
学科分类号
摘要
Text categorization is a significant technique to manage the surging text data on the Internet.The k-nearest neighbors(kNN) algorithm is an effective,but not efficient,classification model for text categorization.In this paper,we propose an effective strategy to accelerate the standard kNN,based on a simple principle:usually,near points in space are also near when they are projected into a direction,which means that distant points in the projection direction are also distant in the original space.Using the proposed strategy,most of the irrelevant points can be removed when searching for the k-nearest neighbors of a query point,which greatly decreases the computation cost.Experimental results show that the proposed strategy greatly improves the time performance of the standard kNN,with little degradation in accuracy.Specifically,it is superior in applications that have large and high-dimensional datasets.
引用
收藏
页码:407 / 416
页数:10
相关论文
共 50 条
  • [21] An efficient clustering algorithm based on the k-nearest neighbors with an indexing ratio
    Raneem Qaddoura
    Hossam Faris
    Ibrahim Aljarah
    International Journal of Machine Learning and Cybernetics, 2020, 11 : 675 - 714
  • [22] An efficient clustering algorithm based on the k-nearest neighbors with an indexing ratio
    Qaddoura, Raneem
    Faris, Hossam
    Aljarah, Ibrahim
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2020, 11 (03) : 675 - 714
  • [23] A quantum k-nearest neighbors algorithm based on the Euclidean distance estimation
    Zardini, Enrico
    Blanzieri, Enrico
    Pastorello, Davide
    QUANTUM MACHINE INTELLIGENCE, 2024, 6 (01)
  • [24] EDITING FOR THE K-NEAREST NEIGHBORS RULE BY A GENETIC ALGORITHM
    KUNCHEVA, LI
    PATTERN RECOGNITION LETTERS, 1995, 16 (08) : 809 - 814
  • [25] A UNIMODAL CLUSTERING-ALGORITHM BASED ON THE K-NEAREST NEIGHBORS METHOD
    KOVALENKO, AP
    AUTOMATION AND REMOTE CONTROL, 1993, 54 (05) : 794 - 798
  • [26] Weighted K-nearest neighbors classification based on Whale optimization algorithm
    Anvari, S.
    Azgomi, M. Abdollahi
    Dishabi, M. R. Ebrahimi
    Maheri, M.
    IRANIAN JOURNAL OF FUZZY SYSTEMS, 2023, 20 (03): : 61 - 74
  • [27] Quantum K-nearest neighbors classification algorithm based on Mahalanobis distance
    Gao, Li-Zhen
    Lu, Chun-Yue
    Guo, Gong-De
    Zhang, Xin
    Lin, Song
    FRONTIERS IN PHYSICS, 2022, 10
  • [28] BRANCH AND BOUND ALGORITHM FOR COMPUTING K-NEAREST NEIGHBORS
    FUKUNAGA, K
    NARENDRA, PM
    IEEE TRANSACTIONS ON COMPUTERS, 1975, C 24 (07) : 750 - 753
  • [29] PERFORMANCE OF K-NEAREST NEIGHBORS ALGORITHM IN OPINION CLASSIFICATION
    Jedrzejewski, Krzysztof
    Zamorski, Maurycy
    FOUNDATIONS OF COMPUTING AND DECISION SCIENCES, 2013, 38 (02) : 97 - 110
  • [30] K-Nearest Neighbors Hashing
    He, Xiangyu
    Wang, Peisong
    Cheng, Jian
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 2834 - 2843