Array-index:: a plug&search K nearest neighbors method for high-dimensional data

被引:20
|
作者
Al Aghbari, Z [1 ]
机构
[1] Univ Sharjah, Dept Comp Sci, Sharjah, U Arab Emirates
关键词
indexing methods; image databases; KNN image search; array-index; plug&search method;
D O I
10.1016/j.datak.2004.06.015
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Previous algorithms of data partitioning methods (DPMs) to find the exact K-nearest neighbors (KNN) at high dimensions are outperformed by a linear scan method [J.M. Kleinberg, Two algorithms for nearest neighbor search in high dimensions, 29th ACM Symposium on Theory of computing, 1997; R. Weber, H.-J. Schek, S. Blott. A quantitative analysis and performance study for similarity-search methods in high-dimensional spaces. in: Proc. of the 24th VLDB, USA, 1998]. In this paper, we present a "plug& search" method to greatly speed up the exact KNN search of existing DPMs. The idea is to linearize the data partitions produced by a DPM, rather than the points themselves, into a one-dimensional array-index, that is simple, compact and fast. Unlike most DPMs that support KNN search, which require storage space linear, or exponential [J.M. Kleinberg, Two algorithms for nearest neighbor search in high dimensions, 29th ACM Symposium on Theory of computing, 1997; M. Hagedoom, Nearest neighbors can be found efficiently if the dimension is small relative to the input size, ICDT 2003], in dimensions, the array-index requires a storage space that is linear in the number of mapped partitions. (C) 2004 Elsevier B.V. All rights reserved.
引用
收藏
页码:333 / 352
页数:20
相关论文
共 50 条
  • [1] Hubs in space: Popular nearest neighbors in high-dimensional data
    Radovanović, Miloš
    Nanopoulos, Alexandros
    Ivanović, Mirjana
    Journal of Machine Learning Research, 2010, 11 : 2487 - 2531
  • [2] Hubs in Space: Popular Nearest Neighbors in High-Dimensional Data
    Radovanovic, Milos
    Nanopoulos, Alexandros
    Ivanovic, Mirjana
    JOURNAL OF MACHINE LEARNING RESEARCH, 2010, 11 : 2487 - 2531
  • [3] Hybrid (CPU/GPU) Exact Nearest Neighbors Search in High-Dimensional Spaces
    Muhr, David
    Affenzeller, Michael
    ARTIFICIAL INTELLIGENCE APPLICATIONS AND INNOVATIONS, AIAI 2022, PART II, 2022, 647 : 112 - 123
  • [4] Secure k Nearest Neighbors Query for High-Dimensional Vectors in Outsourced Environments
    Xue, Wenzhuo
    Li, Hui
    Peng, Yanguo
    Cui, Jiangtao
    Shi, Yu
    IEEE TRANSACTIONS ON BIG DATA, 2018, 4 (04) : 586 - 599
  • [5] Supporting K nearest neighbors query on high-dimensional data in P2P systems
    Li M.
    Lee W.-C.
    Sivasubramaniam A.
    Zhao J.
    Front. Comput. Sci. China, 2008, 3 (234-247): : 234 - 247
  • [6] Weighted k-nearest neighbors feature selection for high-dimensional multi-class data
    Bugata, Peter
    Drotar, Peter
    2019 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC), 2019, : 3066 - 3073
  • [7] Nearest neighbor search on vertically partitioned high-dimensional data
    Dellis, E
    Seeger, B
    Vlachou, A
    DATA WAREHOUSING AND KNOWLEDGE DISCOVERY, PROCEEDINGS, 2005, 3589 : 243 - 253
  • [8] An efficient nearest neighbor search in high-dimensional data spaces
    Lee, DH
    Kim, HJ
    INFORMATION PROCESSING LETTERS, 2002, 81 (05) : 239 - 246
  • [9] High-Dimensional Nearest Neighbor Search with Remote Data Centers
    Wang, Changzhou
    Wang, Xiaoyang Sean
    Knowledge and Information Systems, 2002, 4 (04) : 440 - 465
  • [10] GPU-Accelerated Reverse K-Nearest Neighbor Search for High-Dimensional Data
    Tsuihiji, Kyohei
    Amagasa, Toshiyuki
    ADVANCES IN NETWORK-BASED INFORMATION SYSTEMS, NBIS-2022, 2022, 526 : 279 - 288