Algorithm based on RGH-tree for similarity search queries

被引:0
|
作者
Zhang, Zhao-Gong [1 ,2 ]
Li, Jian-Zhong [1 ,2 ]
机构
[1] Coll. of Comp. Sci. and Technol., Harbin Inst. of Technol., Harbin 150001, China
[2] Heilongjiang Univ., Harbin 150080, China
来源
| 1969年 / Chinese Academy of Sciences卷 / 13期
关键词
Algorithms - Computational complexity - Database systems - Information retrieval - Query languages - Theorem proving - Trees (mathematics);
D O I
暂无
中图分类号
学科分类号
摘要
Similarity search is a very important problem in data mining. It retrieves similar objects in database and finds proximity between objects. It can be applied to image/picture databases, spatial databases, and time-series database. For Euclid space (a special metric space), similarity search algorithms based on R-tree are efficient in low-dimensional space, but degenerate into linear scan for high-dimensional space. This phenomenon is called dimensionality curse. This paper presents a new partition and index method of metric space, rgh-tree which distributes and partitions objects by using distance information of objects with few fixed reference. It produces a balance tree with no data overlay. In addition, an algorithm based on rgh-tree, which is suitable for similarity search in metric space, is presented. The algorithm overcomes the shortcomings of the exiting algorithms, which has less I/O cost and times of computing distance, with average complexity o(n0.58).
引用
收藏
相关论文
共 50 条
  • [11] Phonetic Matching and Syntactic Tree Similarity Based QA System for SMS Queries
    Mittal, Ankush
    Bhatt, Pooja
    Kumar, Padam
    2014 INTERNATIONAL CONFERENCE ON GREEN COMPUTING COMMUNICATION AND ELECTRICAL ENGINEERING (ICGCCEE), 2014,
  • [12] Exact Trajectory Similarity Search With N-tree: An Efficient Metric Index for kNN and Range Queries
    Gueting, Ralf hartmut
    Das, Suvam kumar
    Valdes, Fabio
    Ray, Suprio
    ACM TRANSACTIONS ON SPATIAL ALGORITHMS AND SYSTEMS, 2025, 11 (01)
  • [13] Efficient Bitmap-based Indexing and Retrieval of Similarity Search Image Queries
    Jafari, Omid
    Nagarkar, Parth
    Montano, Jonathan
    2020 IEEE SOUTHWEST SYMPOSIUM ON IMAGE ANALYSIS AND INTERPRETATION (SSIAI 2020), 2020, : 58 - 61
  • [14] Cache and Priority Queue Based Approximation Technique for a Stream of Similarity Search Queries
    Nalepa, Filip
    Batko, Michal
    Zezula, Pavel
    SIMILARITY SEARCH AND APPLICATIONS, SISAP 2017, 2017, 10609 : 17 - 33
  • [15] Tree-Structured Vector Quantization for Similarity Queries
    Wu, Hanwei
    Wang, Qiwen
    Flierl, Markus
    2017 DATA COMPRESSION CONFERENCE (DCC), 2017, : 467 - 467
  • [16] A Sentence Similarity Algorithm Based on Weighted Keyword Tree
    Yu, Tiantian
    2014 2ND INTERNATIONAL CONFERENCE ON SOCIAL SCIENCE AND HEALTH (ICSSH 2014), PT 2, 2014, 56 : 326 - 330
  • [17] Gene cluster algorithm based on most similarity tree
    Lu Xin-guo
    Lin Ya-ping
    Li Xiao-long
    Yi Ye-qing
    Cai li-jun
    Wang Hai-jun
    Eighth International Conference on High-Performance Computing in Asia-Pacific Region, Proceedings, 2005, : 652 - 656
  • [18] Queries with ordering based on similarity
    Carrasquel Oropeza, Soraya Odalis
    Rodriguez de Tineo, Rosseline Carmen
    Tineo, Leonid
    TELEMATIQUE, 2013, 12 (01): : 24 - 45
  • [19] Pivot Generation Algorithm with a Complete Binary Tree for Efficient Exact Similarity Search
    Yamagishi, Yuki
    Aoyama, Kazuo
    Saito, Kazumi
    Ikeda, Tetsuo
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2018, E101D (01): : 142 - 151
  • [20] Indexing large metric spaces for similarity search queries
    Bozkaya, T
    Ozsoyoglu, M
    ACM TRANSACTIONS ON DATABASE SYSTEMS, 1999, 24 (03): : 361 - 404