Towards Self-Indexing Relational Databases

被引:1
|
作者
Ruiz, Guillermo [1 ]
Chavez, Edgar [3 ]
Tellez, Eric S. [2 ]
机构
[1] Univ Michoacana, Div Estudios Posgrad, Fac Ingn Elect, Morelia, Michoacan, Mexico
[2] Univ Michoacana, Fac Ciencias Fisico-Matematicas, Morelia, Michoacan, Mexico
[3] Univ Nacl Autonoma Mexico, Math Inst, Mexico City 04510, DF, Mexico
关键词
D O I
10.1109/ENC.2013.9
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
In this paper, we present methods for compacting and self-indexing the tables of a relational database. This means we replace both the data and the indexes for a unique succinct data structure. This new compact representation uses less space than the plain text of the database. We implemented the necessary machinery to solve search and join operations without decompressing the database. A self-indexed database with this method uses close to 90% of the plain representation while supporting some relational queries and full text searches. When compared to SQlite3+FTS, our representation produces several times smaller databases and faster searches. Extracting the text from the compact representation is slower than SQlite3+FTS, and hence our approach is more useful for selective queries. To cope with this drawback we provide a way to improve the extraction speed at the cost of using more memory. Finally, we provide a large set of experimental results characterizing the memory and time performance of our proposal.
引用
收藏
页码:20 / 27
页数:8
相关论文
共 50 条
  • [21] Gradient-like flows and self-indexing in stratified Morse theory
    Grinberg, M
    TOPOLOGY, 2005, 44 (01) : 175 - 202
  • [22] Emergent relations - Self-indexing mediafor case-based reasoning
    Tan, M
    DIGITAL DESIGN: RESEARCH AND PRACTICE, 2003, : 73 - 81
  • [23] Improved self-indexing inverted files for full-text retrieval
    College of Compute Science, South-Central University for Nationalities, Wuhan 430074, China
    不详
    J. Comput. Inf. Syst., 2009, 2 (1017-1024):
  • [25] Towards semantic integrity in relational databases
    Botha, RA
    SECURITY IN THE INFORMATION SOCIETY: VISIONS AND PERSPECTIVES, 2002, 86 : 287 - 297
  • [26] Indexing Musical Sequences in Large Datasets Using Relational Databases
    Charapko, Aleksey
    Chuan, Ching-Hua
    INTERNATIONAL JOURNAL OF MULTIMEDIA DATA ENGINEERING & MANAGEMENT, 2015, 6 (02): : 1 - 18
  • [27] Practical High-Order Entropy-Compressed Text Self-Indexing
    Huo, Hongwei
    Long, Peng
    Vitter, Jeffrey Scott
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (03) : 2943 - 2960
  • [28] Towards the reverse engineering of denormalized relational databases
    Petit, JM
    Toumani, F
    Boulicaut, JF
    Kouloumdjian, J
    PROCEEDINGS OF THE TWELFTH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, 1996, : 218 - 227
  • [29] Spatio-temporal Indexing in Non-relational Distributed Databases
    Fox, Anthony
    Eichelberger, Chris
    Hughes, James
    Lyon, Skylar
    2013 IEEE INTERNATIONAL CONFERENCE ON BIG DATA, 2013,
  • [30] Towards Preference-aware Relational Databases
    Arvanitis, Anastasios
    Koutrika, Georgia
    2012 IEEE 28TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE), 2012, : 426 - 437