SLIPP: A Space-Efficient Learned Index for String Keys

被引:0
|
作者
Zhou, Weihong [1 ]
Yang, Shiyu [1 ]
机构
[1] Guangzhou Univ, Guangzhou, Peoples R China
来源
2024 6TH INTERNATIONAL CONFERENCE ON BIG-DATA SERVICE AND INTELLIGENT COMPUTATION, BDSIC 2024 | 2024年
基金
国家重点研发计划;
关键词
In-memory; Learned index; String;
D O I
10.1145/3686540.3686550
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Efficient indexing structures are crucial for high-performance data access in in-memory data management systems. Traditional indexing methods, while effective in specific scenarios, often struggle with variable-length string keys and range queries. This paper presents the String Learned Index with Precise Positions (SLIPP), an enhancement of the Learned Index with Precise Positions (LIPP) that incorporates trie-based methodologies. By integrating trie characteristics with the predictive capabilities of a simple univariate linear regression model, SLIPP aims to optimize the handling of string keys, significantly reducing memory usage and improving lookup speeds. Our evaluation, utilizing the TLI experimental framework, demonstrates SLIPP's effectiveness in rapid lookups and highlights its adaptability to datasets featuring long common prefixes. Although SLIPP encounters challenges with intricate data distributions, its approach to string key indexing, building on the foundation of LIPP and incorporating trie features, offers a promising avenue for enhancing database systems to manage large datasets more efficiently and with lower space requirements.
引用
收藏
页码:69 / 77
页数:9
相关论文
共 50 条
  • [11] Space-Efficient String Indexing for Wildcard Pattern Matching
    Lewenstein, Moshe
    Nekrich, Yakov
    Vitter, Jeffrey Scott
    31ST INTERNATIONAL SYMPOSIUM ON THEORETICAL ASPECTS OF COMPUTER SCIENCE (STACS 2014), 2014, 25 : 506 - 517
  • [12] Space-Efficient Feature Maps for String Alignment Kernels
    Yasuo Tabei
    Yoshihiro Yamanishi
    Rasmus Pagh
    Data Science and Engineering, 2020, 5 : 168 - 179
  • [13] Space-efficient computation of parallel approximate string matching
    Sadiq, Muhammad Umair
    Yousaf, Muhammad Murtaza
    JOURNAL OF SUPERCOMPUTING, 2023, 79 (08): : 9093 - 9126
  • [14] Space-efficient construction of LZ-index
    Arroyuelo, D
    Navarro, G
    ALGORITHMS AND COMPUTATION, 2005, 3827 : 1143 - 1152
  • [15] Practical Space-Efficient Index for Structural Pattern Matching
    Kim, Sung-Hwan
    Cho, Hwan-Gue
    COMBINATORIAL ALGORITHMS (IWOCA 2022), 2022, 13270 : 369 - 382
  • [16] Space-Efficient Framework for Top-k String Retrieval Problems
    Hon, Wing-Kai
    Shah, Rahul
    Vitter, Jeffrey Scott
    2009 50TH ANNUAL IEEE SYMPOSIUM ON FOUNDATIONS OF COMPUTER SCIENCE: FOCS 2009, PROCEEDINGS, 2009, : 713 - 722
  • [17] A space-efficient algorithm for pre-distributing pairwise keys in sensor networks
    Kim, T
    Kim, S
    Oh, H
    COMPUTATIONAL SCIENCE - ICCS 2005, PT 3, 2005, 3516 : 958 - 962
  • [18] Space-Efficient Approximate String Matching Allowing Inversions in Fast Average Time
    Kim, Hwee
    Han, Yo-Sub
    FRONTIERS IN ALGORITHMICS, FAW 2014, 2014, 8497 : 141 - 150
  • [19] Compact suffix array -: A space-efficient full-text index
    Mäkinen, V
    FUNDAMENTA INFORMATICAE, 2003, 56 (1-2) : 191 - 210
  • [20] Space-Efficient Manifest Contracts
    Greenberg, Michael
    ACM SIGPLAN NOTICES, 2015, 50 (01) : 181 - 194