Nearest neighbor search on vertically partitioned high-dimensional data

被引:0
|
作者
Dellis, E [1 ]
Seeger, B [1 ]
Vlachou, A [1 ]
机构
[1] Univ Marburg, Dept Math & Comp Sci, D-35032 Marburg, Germany
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we present a new approach to indexing multidimensional data that is particularly suitable for the efficient incremental processing of nearest neighbor queries. The basic idea is to use index-striping that vertically splits the data space into multiple low- and medium-dimensional data spaces. The data from each of these lower-dimensional subspaces is organized by using a standard multi-dimensional index structure. In order to perform incremental NN-queries on top of index-striping efficiently, we first develop an algorithm for merging the results received from the underlying indexes. Then, an accurate cost model relying on a power law is presented that determines an appropriate number of indexes. Moreover, we consider the problem of dimension assignment, where each dimension is assigned to a lower-dimensional subspace, such that the cost of nearest neighbor queries is minimized. Our experiments confirm the validity of our cost model and evaluate the performance of our approach.
引用
收藏
页码:243 / 253
页数:11
相关论文
共 50 条
  • [31] Indexing the solution space: A new technique for nearest neighbor search in high-dimensional space
    Berchtold, S
    Keim, DA
    Kriegel, HP
    Seidl, T
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2000, 12 (01) : 45 - 57
  • [32] Towards Efficient Index Construction and Approximate Nearest Neighbor Search in High-Dimensional Spaces
    Zhao, Xi
    Tian, Yao
    Huang, Kai
    Zheng, Bolong
    Zhou, Xiaofang
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2023, 16 (08): : 1979 - 1991
  • [33] A depth-based nearest neighbor algorithm for high-dimensional data classification
    Harikumar S.
    Aravindakshan Savithri A.
    Kaimal R.
    Turkish Journal of Electrical Engineering and Computer Sciences, 2019, 27 (06): : 4082 - 4101
  • [34] A depth-based nearest neighbor algorithm for high-dimensional data classification
    Harikumar, Sandhya
    Aravindakshan Savithri, Akhil
    Kaimal, Ramachandra
    TURKISH JOURNAL OF ELECTRICAL ENGINEERING AND COMPUTER SCIENCES, 2019, 27 (06) : 4082 - 4101
  • [35] A Heterogeneous High-Dimensional Approximate Nearest Neighbor Algorithm
    Dubiner, Moshe
    IEEE TRANSACTIONS ON INFORMATION THEORY, 2012, 58 (10) : 6646 - 6658
  • [36] High-dimensional shared nearest neighbor clustering algorithm
    Yin, J
    Fan, XL
    Chen, YQ
    Ren, JT
    FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY, PT 2, PROCEEDINGS, 2005, 3614 : 494 - 502
  • [37] Redefining nearest neighbor classification in high-dimensional settings
    Lopez, Julio
    Maldonado, Sebastian
    PATTERN RECOGNITION LETTERS, 2018, 110 : 36 - 43
  • [38] Approximate Nearest Neighbor Search on High Dimensional Data - Experiments, Analyses, and Improvement
    Li, Wen
    Zhang, Ying
    Sun, Yifang
    Wang, Wei
    Li, Mingjie
    Zhang, Wenjie
    Lin, Xuemin
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2020, 32 (08) : 1475 - 1488
  • [39] Exploit Every Bit: Effective Caching for High-Dimensional Nearest Neighbor Search (Extended Abstract)
    Tang, Bo
    Yiu, Man Lung
    Hua, Kien A.
    2017 IEEE 33RD INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2017), 2017, : 45 - 46
  • [40] NEAREST NEIGHBOR CLUSTERING OVER PARTITIONED DATA
    Khedr, Ahmed M.
    COMPUTING AND INFORMATICS, 2011, 30 (05) : 1011 - 1036