Effectiveness of NAQ-tree as index structure for similarity search in high-dimensional metric space

被引:17
|
作者
Zhang, Ming [1 ]
Alhajj, Reda [1 ,2 ]
机构
[1] Univ Calgary, Dept Comp Sci, Calgary, AB T2N 1N4, Canada
[2] Global Univ, Dept Comp Sci, Beirut, Lebanon
关键词
Knn search; High dimensionality; Dimensionality reduction; Indexing; Similarity search; NEAREST-NEIGHBOR; QUERIES; FILE;
D O I
10.1007/s10115-008-0190-y
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Similarity search (e.g., k-nearest neighbor search) in high-dimensional metric space is the key operation in many applications, such as multimedia databases, image retrieval and object recognition, among others. The high dimensionality and the huge size of the data set require an index structure to facilitate the search. State-of-the-art index structures are built by partitioning the data set based on distances to certain reference point(s). Using the index, search is confined to a small number of partitions. However, these methods either ignore the property of the data distribution (e.g., VP-tree and its variants) or produce non-disjoint partitions (e.g., M-tree and its variants, DBM-tree); these greatly affect the search efficiency. In this paper, we study the effectiveness of a new index structure, called Nested-Approximate-eQuivalence-class tree (NAQ-tree), which overcomes the above disadvantages. NAQ-tree is constructed by recursively dividing the data set into nested approximate equivalence classes. The conducted analysis and the reported comparative test results demonstrate the effectiveness of NAQ-tree in significantly improving the search efficiency.
引用
收藏
页码:1 / 26
页数:26
相关论文
共 50 条
  • [1] Effectiveness of NAQ-tree as index structure for similarity search in high-dimensional metric space
    Ming Zhang
    Reda Alhajj
    Knowledge and Information Systems, 2010, 22 : 1 - 26
  • [2] Effectiveness of NAQ-tree in handling reverse nearest-neighbor queries in high-dimensional metric space
    Ming Zhang
    Reda Alhajj
    Knowledge and Information Systems, 2012, 31 : 307 - 343
  • [3] Effectiveness of NAQ-tree in handling reverse nearest-neighbor queries in high-dimensional metric space
    Zhang, Ming
    Alhajj, Reda
    KNOWLEDGE AND INFORMATION SYSTEMS, 2012, 31 (02) : 307 - 343
  • [4] An adaptive index structure for high-dimensional similarity search
    Wu, P
    Manjunath, BS
    Chandrasekaran, S
    ADVANCES IN MUTLIMEDIA INFORMATION PROCESSING - PCM 2001, PROCEEDINGS, 2001, 2195 : 71 - 77
  • [5] The GC-tree: A high-dimensional index structure for similarity search in image databases
    Cha, GH
    Chung, CW
    IEEE TRANSACTIONS ON MULTIMEDIA, 2002, 4 (02) : 235 - 247
  • [6] The SS+-tree: An improved index structure for similarity searches in a high-dimensional feature space
    Kurniamati, R
    Jin, JS
    Shepherd, JA
    STORAGE AND RETRIEVAL FOR IMAGE AND VIDEO DATABASES V, 1997, 3022 : 110 - 120
  • [7] A High-Dimensional Space Index Structure BZ-tree
    Xu, Hongbo
    2010 INTERNATIONAL CONFERENCE ON INFORMATION, ELECTRONIC AND COMPUTER SCIENCE, VOLS 1-3, 2010, : 761 - 764
  • [8] A parallel similarity search in high dimensional metric space using M-tree
    Alpkocak, A
    Danisman, T
    Ulker, T
    ADVANCED ENVIRONMENTS, TOOLS, AND APPLICATIONS FOR CLUSTER COMPUTING, 2002, 2326 : 166 - 171
  • [9] An efficient high-dimensional index structure using cell signatures for similarity search
    Chang, JW
    Song, KT
    ADVANCES IN WEB-AGE INFORMATION MANAGEMENT, PROCEEDINGS, 2001, 2118 : 26 - 33
  • [10] Surface spatial index structure of high-dimensional space
    An, JY
    Chen, YPP
    Xu, QY
    INTELLIGENT DATA ENGINEERING AND AUTOMATED LEARNING IDEAL 2004, PROCEEDINGS, 2004, 3177 : 272 - 278