DB-LSH 2.0: Locality-Sensitive Hashing With Query-Based Dynamic Bucketing

被引:3
|
作者
Tian, Yao [1 ]
Zhao, Xi [1 ]
Zhou, Xiaofang [1 ]
机构
[1] Hong Kong Univ Sci & Technol, Dept Comp Sci & Engn, Kowloon, Hong Kong, Peoples R China
关键词
Costs; Indexes; Search problems; Time complexity; Nearest neighbor methods; Hash functions; Extraterrestrial measurements; Approximate nearest neighbor search; high-dimensional spaces; locality sensitive hashing; APPROXIMATE NEAREST-NEIGHBOR; SEARCH; TREES;
D O I
10.1109/TKDE.2023.3295831
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Locality-sensitive hashing (LSH) is a promising family of methods for the high-dimensional approximate nearest neighbor (ANN) search problem due to its sub-linear query time and strong theoretical guarantee. Existing LSH methods either suffer from large index sizes and hash boundary problems, or incur a linear cost for high-quality candidate identification. This dilemma is addressed in a novel method called DB-LSH proposed in this paper. It organizes the projected spaces with multi-dimensional indexes instead of fixed-width hash buckets, which significantly reduces space costs. High-quality candidates can be generated efficiently by dynamically constructing query-based hypercubic buckets with the required widths through index-based window queries. A novel incremental search strategy called DBI-LSH is also developed to further boost the query performance, which incrementally accesses the next best point for higher accuracy and efficiency. Considering the intermediate query information of each query, DBA-LSH is designed to adaptively tune termination conditions without scarifying the success probability. Our theoretical analysis proves that DB-LSH has a smaller query cost than the existing work while DBA-LSH and DBI-LSH have lower expected query costs than DB-LSH. An extensive range of experiments on real-world data show the superiority of our approaches over the state-of-the-art methods in both efficiency and accuracy.
引用
收藏
页码:1000 / 1015
页数:16
相关论文
共 50 条
  • [1] DB-LSH: Locality-Sensitive Hashing with Query-based Dynamic Bucketing
    Tian, Yao
    Zhao, Xi
    Thou, Xiaofang
    2022 IEEE 38TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2022), 2022, : 2250 - 2262
  • [2] Can LSH (locality-sensitive hashing) be replaced by neural network?
    Renyang Liu
    Jun Zhao
    Xing Chu
    Yu Liang
    Wei Zhou
    Jing He
    Soft Computing, 2024, 28 : 1041 - 1053
  • [3] Can LSH (locality-sensitive hashing) be replaced by neural network?
    Liu, Renyang
    Zhao, Jun
    Chu, Xing
    Liang, Yu
    Zhou, Wei
    He, Jing
    SOFT COMPUTING, 2024, 28 (02) : 887 - 902
  • [4] BCH-LSH: a new scheme of locality-sensitive hashing
    Ma, Yuena
    Feng, Xiaoyi
    Liu, Yang
    Li, Shuhong
    IET IMAGE PROCESSING, 2018, 12 (06) : 850 - 855
  • [5] DET-LSH: A Locality-Sensitive Hashing Scheme with Dynamic Encoding Tree for Approximate Nearest Neighbor Search
    Wei, Jiuqi
    Peng, Botao
    Lee, Xiaodong
    Palpanas, Themis
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2024, 17 (09): : 2241 - 2254
  • [6] IP.LSH.DBSCAN: Integrated Parallel Density-Based Clustering Through Locality-Sensitive Hashing
    Keramatian, Amir
    Gulisano, Vincenzo
    Papatriantafilou, Marina
    Tsigas, Philippas
    EURO-PAR 2022: PARALLEL PROCESSING, 2022, 13440 : 268 - 284
  • [7] Query-aware locality-sensitive hashing scheme for l p norm
    Huang, Qiang
    Feng, Jianlin
    Fang, Qiong
    Ng, Wilfred
    Wang, Wei
    VLDB JOURNAL, 2017, 26 (05): : 683 - 708
  • [8] Query-Aware Locality-Sensitive Hashing for Approximate Nearest Neighbor Search
    Huang, Qiang
    Feng, Jianlin
    Zhang, Yikai
    Fang, Qiong
    Ng, Wilfred
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2015, 9 (01): : 1 - 12
  • [9] Locality-Sensitive Hashing Based Multiobjective Memetic Algorithm for Dynamic Pickup and Delivery Problems
    Wang, Fangxiao
    Gao, Yuan
    Zhu, Zexuan
    2014 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2014, : 661 - 666
  • [10] Digital Watermarks for Videos Based on a Locality-Sensitive Hashing Algorithm
    Sun, Yajuan
    Srivastava, Gautam
    MOBILE NETWORKS & APPLICATIONS, 2023, 28 (05): : 1724 - 1737