Algorithm based on RGH-tree for similarity search queries

被引:0
|
作者
Zhang, Zhao-Gong [1 ,2 ]
Li, Jian-Zhong [1 ,2 ]
机构
[1] Coll. of Comp. Sci. and Technol., Harbin Inst. of Technol., Harbin 150001, China
[2] Heilongjiang Univ., Harbin 150080, China
来源
| 1969年 / Chinese Academy of Sciences卷 / 13期
关键词
Algorithms - Computational complexity - Database systems - Information retrieval - Query languages - Theorem proving - Trees (mathematics);
D O I
暂无
中图分类号
学科分类号
摘要
Similarity search is a very important problem in data mining. It retrieves similar objects in database and finds proximity between objects. It can be applied to image/picture databases, spatial databases, and time-series database. For Euclid space (a special metric space), similarity search algorithms based on R-tree are efficient in low-dimensional space, but degenerate into linear scan for high-dimensional space. This phenomenon is called dimensionality curse. This paper presents a new partition and index method of metric space, rgh-tree which distributes and partitions objects by using distance information of objects with few fixed reference. It produces a balance tree with no data overlay. In addition, an algorithm based on rgh-tree, which is suitable for similarity search in metric space, is presented. The algorithm overcomes the shortcomings of the exiting algorithms, which has less I/O cost and times of computing distance, with average complexity o(n0.58).
引用
收藏
相关论文
共 50 条
  • [21] A Hybrid Sparrow Search Algorithm Based on Constructing Similarity
    Liu Jianhua
    Wang Zhiheng
    IEEE ACCESS, 2021, 9 : 117581 - 117595
  • [22] Regular polygon based search algorithm for processing maximum range queries
    Sato, Hideki
    Narita, Ryoichi
    Smart Innovation, Systems and Technologies, 2015, 30 : 99 - 114
  • [23] A Fast Tree-Based Search Algorithm for Cluster Search Engine
    Tsai, Chun-Wei
    Huang, Ko-Wei
    Chiang, Ming-Chao
    Yang, Chu-Sing
    2009 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC 2009), VOLS 1-9, 2009, : 1603 - +
  • [24] Impact of the Initialization in Tree-Based Fast Similarity Search Techniques
    Serrano, Aureo
    Mico, Luisa
    Oncina, Jose
    SIMILARITY-BASED PATTERN RECOGNITION: FIRST INTERNATIONAL WORKSHOP, SIMBAD 2011, 2011, 7005 : 163 - 176
  • [25] A Tree-Based Indexing Approach for Diverse Textual Similarity Search
    Yu, Minghe
    Chai, Chengliang
    Yu, Ge
    IEEE ACCESS, 2021, 9 : 8866 - 8876
  • [26] Impact of the Initialization in Tree-Based Fast Similarity Search Techniques
    Serrano, Aureo
    Mico, Luisa
    Oncina, Jose
    SIMILARITY-BASED PATTERN RECOGNITION, 2011, 7005 : 163 - 176
  • [27] A Similarity-Guaranteed Clustering Algorithm and Its Search Tree for Handling an Increased Weight
    Lappanitchayakul, Kreadtisak
    Hiransakolwong, Nualsawat
    PROCEEDINGS OF THE 8TH WSEAS INTERNATIONAL CONFERENCE ON APPLIED INFORMATICS AND COMMUNICATIONS, PTS I AND II: NEW ASPECTS OF APPLIED INFORMATICS AND COMMUNICATIONS, 2008, : 297 - +
  • [28] Efficient Similarity Search with a Pivot-Based Complete Binary Tree
    Yamagishi, Yuki
    Aoyama, Kazuo
    Saito, Kazumi
    Ikeda, Tetsuo
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2017, E100D (10): : 2526 - 2536
  • [29] An algorithm based on valuation forecasting for game tree search
    Guangyun Tan
    Peipei Wei
    Yongyi He
    Huahu Xu
    Xinxin Shi
    Ping Yi
    International Journal of Machine Learning and Cybernetics, 2021, 12 : 1083 - 1095
  • [30] Tree Based Search Algorithm for Binary Image Compression
    Hooda, Reetu
    Pan, W. David
    2019 IEEE SOUTHEASTCON, 2019,