Array-index:: a plug&search K nearest neighbors method for high-dimensional data

被引:20
|
作者
Al Aghbari, Z [1 ]
机构
[1] Univ Sharjah, Dept Comp Sci, Sharjah, U Arab Emirates
关键词
indexing methods; image databases; KNN image search; array-index; plug&search method;
D O I
10.1016/j.datak.2004.06.015
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Previous algorithms of data partitioning methods (DPMs) to find the exact K-nearest neighbors (KNN) at high dimensions are outperformed by a linear scan method [J.M. Kleinberg, Two algorithms for nearest neighbor search in high dimensions, 29th ACM Symposium on Theory of computing, 1997; R. Weber, H.-J. Schek, S. Blott. A quantitative analysis and performance study for similarity-search methods in high-dimensional spaces. in: Proc. of the 24th VLDB, USA, 1998]. In this paper, we present a "plug& search" method to greatly speed up the exact KNN search of existing DPMs. The idea is to linearize the data partitions produced by a DPM, rather than the points themselves, into a one-dimensional array-index, that is simple, compact and fast. Unlike most DPMs that support KNN search, which require storage space linear, or exponential [J.M. Kleinberg, Two algorithms for nearest neighbor search in high dimensions, 29th ACM Symposium on Theory of computing, 1997; M. Hagedoom, Nearest neighbors can be found efficiently if the dimension is small relative to the input size, ICDT 2003], in dimensions, the array-index requires a storage space that is linear in the number of mapped partitions. (C) 2004 Elsevier B.V. All rights reserved.
引用
收藏
页码:333 / 352
页数:20
相关论文
共 50 条
  • [21] A nearest neighbor search algorithm of high-dimensional data based on sequential NPsim matrix
    李文法
    Wang Gongming
    Ma Nan
    Liu Hongzhe
    High Technology Letters, 2016, 22 (03) : 241 - 247
  • [22] Secure Cloud-Aided Approximate Nearest Neighbor Search on High-Dimensional Data
    Liu, Jia
    Wang, Yinchai
    Wei, Fengrui
    Han, Qing
    Tao, Yunting
    Zhao, Liping
    Li, Xinjin
    Sun, Hongbo
    IEEE ACCESS, 2023, 11 : 109027 - 109037
  • [23] Exploiting lower bounds to accelerate approximate nearest neighbor search on high-dimensional data
    Liu, Yingfan
    Wei, Hao
    Cheng, Hong
    INFORMATION SCIENCES, 2018, 465 : 484 - 504
  • [24] An efficient LSH indexing on discriminative short codes for high-dimensional nearest neighbors
    Feng Xiaokang
    Cui Jiangtao
    Li Hui
    Liu Yingfan
    MULTIMEDIA TOOLS AND APPLICATIONS, 2019, 78 (17) : 24407 - 24429
  • [25] Imputation methods for high-dimensional mixed-type datasets by nearest neighbors
    Faisal, Shahla
    Tutz, Gerhard
    COMPUTERS IN BIOLOGY AND MEDICINE, 2021, 135
  • [26] An efficient LSH indexing on discriminative short codes for high-dimensional nearest neighbors
    Feng Xiaokang
    Cui Jiangtao
    Li Hui
    Liu Yingfan
    Multimedia Tools and Applications, 2019, 78 : 24407 - 24429
  • [27] An Optimal Proximity Method for Nearest Neighbor Search in High Dimensional Data
    Pasunuri, Raghunadh
    Venkaiah, Vadlamudi China
    PROCEEDINGS OF THE 2016 2ND INTERNATIONAL CONFERENCE ON CONTEMPORARY COMPUTING AND INFORMATICS (IC3I), 2016, : 479 - 483
  • [28] Exploring the Meaningfulness of Nearest Neighbor Search in High-Dimensional Space
    Chen, Zhonghan
    Zhang, Ruiyuan
    Zhao, Xi
    Cheng, Xiaojun
    Zhou, Xiaofang
    DATABASES THEORY AND APPLICATIONS, ADC 2024, 2025, 15449 : 181 - 194
  • [29] New instability results for high-dimensional nearest neighbor search
    Giannella, Chris
    INFORMATION PROCESSING LETTERS, 2009, 109 (19) : 1109 - 1113
  • [30] A Sparse Reconstructive Evidential K-Nearest Neighbor Classifier for High-Dimensional Data
    Gong, Chaoyu
    Su, Zhi-Gang
    Wang, Pei-Hong
    Wang, Qian
    You, Yang
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (06) : 5563 - 5576