Nearest neighbor search on vertically partitioned high-dimensional data

被引:0
|
作者
Dellis, E [1 ]
Seeger, B [1 ]
Vlachou, A [1 ]
机构
[1] Univ Marburg, Dept Math & Comp Sci, D-35032 Marburg, Germany
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we present a new approach to indexing multidimensional data that is particularly suitable for the efficient incremental processing of nearest neighbor queries. The basic idea is to use index-striping that vertically splits the data space into multiple low- and medium-dimensional data spaces. The data from each of these lower-dimensional subspaces is organized by using a standard multi-dimensional index structure. In order to perform incremental NN-queries on top of index-striping efficiently, we first develop an algorithm for merging the results received from the underlying indexes. Then, an accurate cost model relying on a power law is presented that determines an appropriate number of indexes. Moreover, we consider the problem of dimension assignment, where each dimension is assigned to a lower-dimensional subspace, such that the cost of nearest neighbor queries is minimized. Our experiments confirm the validity of our cost model and evaluate the performance of our approach.
引用
收藏
页码:243 / 253
页数:11
相关论文
共 50 条
  • [1] An efficient nearest neighbor search in high-dimensional data spaces
    Lee, DH
    Kim, HJ
    INFORMATION PROCESSING LETTERS, 2002, 81 (05) : 239 - 246
  • [2] High-Dimensional Nearest Neighbor Search with Remote Data Centers
    Wang, Changzhou
    Wang, Xiaoyang Sean
    Knowledge and Information Systems, 2002, 4 (04) : 440 - 465
  • [3] Nearest Neighbor Search in High-Dimensional Spaces
    Andoni, Alexandr
    MATHEMATICAL FOUNDATIONS OF COMPUTER SCIENCE 2011, 2011, 6907 : 1 - 1
  • [4] An algorithm for incremental nearest neighbor search in high-dimensional data spaces
    Lee, DH
    Lee, HD
    Choi, IH
    Kim, HJ
    HUMAN SOCIETY AND THE INTERNET, PROCEEDINGS: INTERNET-RELATED SOCIO-ECONOMIC ISSUES, 2001, 2105 : 436 - 453
  • [5] Fast Scalable Approximate Nearest Neighbor Search for High-dimensional Data
    Bashyam, K. G. Renga
    Vadhiyar, Sathish
    2020 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING (CLUSTER 2020), 2020, : 294 - 302
  • [6] Fast nearest neighbor search in high-dimensional space
    Berchtold, S
    Ertl, B
    Keim, DA
    Kriegel, HP
    Seidl, T
    14TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, PROCEEDINGS, 1998, : 209 - 218
  • [7] Accelerating massive queries of approximate nearest neighbor search on high-dimensional data
    Liu, Yingfan
    Song, Chaowei
    Cheng, Hong
    Xia, Xiaofang
    Cui, Jiangtao
    KNOWLEDGE AND INFORMATION SYSTEMS, 2023, 65 (10) : 4185 - 4212
  • [8] Accelerating massive queries of approximate nearest neighbor search on high-dimensional data
    Yingfan Liu
    Chaowei Song
    Hong Cheng
    Xiaofang Xia
    Jiangtao Cui
    Knowledge and Information Systems, 2023, 65 : 4185 - 4212
  • [9] Exploring the Meaningfulness of Nearest Neighbor Search in High-Dimensional Space
    Chen, Zhonghan
    Zhang, Ruiyuan
    Zhao, Xi
    Cheng, Xiaojun
    Zhou, Xiaofang
    DATABASES THEORY AND APPLICATIONS, ADC 2024, 2025, 15449 : 181 - 194
  • [10] Fuzzy nearest neighbor clustering of high-dimensional data
    Wang, HB
    Yu, YQ
    Zhou, DR
    Meng, B
    2003 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-5, PROCEEDINGS, 2003, : 2569 - 2572