Array-index:: a plug&search K nearest neighbors method for high-dimensional data

被引:20
|
作者
Al Aghbari, Z [1 ]
机构
[1] Univ Sharjah, Dept Comp Sci, Sharjah, U Arab Emirates
关键词
indexing methods; image databases; KNN image search; array-index; plug&search method;
D O I
10.1016/j.datak.2004.06.015
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Previous algorithms of data partitioning methods (DPMs) to find the exact K-nearest neighbors (KNN) at high dimensions are outperformed by a linear scan method [J.M. Kleinberg, Two algorithms for nearest neighbor search in high dimensions, 29th ACM Symposium on Theory of computing, 1997; R. Weber, H.-J. Schek, S. Blott. A quantitative analysis and performance study for similarity-search methods in high-dimensional spaces. in: Proc. of the 24th VLDB, USA, 1998]. In this paper, we present a "plug& search" method to greatly speed up the exact KNN search of existing DPMs. The idea is to linearize the data partitions produced by a DPM, rather than the points themselves, into a one-dimensional array-index, that is simple, compact and fast. Unlike most DPMs that support KNN search, which require storage space linear, or exponential [J.M. Kleinberg, Two algorithms for nearest neighbor search in high dimensions, 29th ACM Symposium on Theory of computing, 1997; M. Hagedoom, Nearest neighbors can be found efficiently if the dimension is small relative to the input size, ICDT 2003], in dimensions, the array-index requires a storage space that is linear in the number of mapped partitions. (C) 2004 Elsevier B.V. All rights reserved.
引用
收藏
页码:333 / 352
页数:20
相关论文
共 50 条
  • [41] An adaptive index structure for high-dimensional similarity search
    Wu, P
    Manjunath, BS
    Chandrasekaran, S
    ADVANCES IN MUTLIMEDIA INFORMATION PROCESSING - PCM 2001, PROCEEDINGS, 2001, 2195 : 71 - 77
  • [42] On optimizing nearest neighbor queries in high-dimensional data spaces
    Berchtold, S
    Böhm, C
    Keim, D
    Krebs, F
    Kriegel, HP
    DATABASE THEORY - ICDT 2001, PROCEEDINGS, 2001, 1973 : 435 - 449
  • [43] A Fast Exact k-Nearest Neighbors Algorithm for High Dimensional Search Using k-Means Clustering and Triangle Inequality
    Wang, Xueyi
    2011 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2011, : 1293 - 1299
  • [44] NEAREST NEIGHBORS AND VORONOI VOLUMES IN HIGH-DIMENSIONAL POINT-PROCESSES WITH VARIOUS DISTANCE FUNCTIONS
    NEWMAN, CM
    RINOTT, Y
    ADVANCES IN APPLIED PROBABILITY, 1985, 17 (04) : 794 - 809
  • [45] SR-tree: An index structure for nearest-neighbor searching of high-dimensional point data
    Katayama, Norio
    Satoh, Shin'ichi
    Systems and Computers in Japan, 1998, 29 (06) : 59 - 73
  • [46] Broadcast schedules and query processing for k nearest neighbors search on multi-dimensional index trees in a multi-channel environment
    Shu-Yu Fu
    Chuan-Ming Liu
    2006 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS, VOLS 1-6, PROCEEDINGS, 2006, : 2646 - +
  • [47] Towards Secure Approximate k-Nearest Neighbor Query Over Encrypted High-Dimensional Data
    Peng, Yanguo
    Li, Hui
    Cui, Jiangtao
    Ma, Jianfeng
    Liu, Yingfan
    IEEE ACCESS, 2018, 6 : 23137 - 23151
  • [48] Exploit Every Bit: Effective Caching for High-Dimensional Nearest Neighbor Search
    Tang, Bo
    Yiu, Man Lung
    Hua, Kien A.
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2016, 28 (05) : 1175 - 1188
  • [49] k Nearest Neighbor Similarity Join Algorithm on High-Dimensional Data Using Novel Partitioning Strategy
    Ma, Youzhong
    Hua, Qiaozhi
    Wen, Zheng
    Zhang, Ruiling
    Zhang, Yongxin
    Li, Haipeng
    SECURITY AND COMMUNICATION NETWORKS, 2022, 2022
  • [50] High-Dimensional Nearest Neighbor Search-Based Blocking in Entity Resolution
    Zhang, Kaiyu
    Sun, Chenchen
    Shen, Derong
    Nie, Tiezheng
    Kou, Yue
    WEB INFORMATION SYSTEMS AND APPLICATIONS, WISA 2024, 2024, 14883 : 215 - 226