Distribution-Aware Compressed Full-Text Indexes

被引:6
|
作者
Ferragina, Paolo [1 ]
Siren, Jouni [2 ]
Venturini, Rossano [1 ]
机构
[1] Univ Pisa, Dipartimento Informat, I-56127 Pisa, Italy
[2] Univ Helsinki, Dept Comp Sci, SF-00510 Helsinki, Finland
基金
芬兰科学院;
关键词
Full-text indexing; Compressed full-text indexes; Succinct data structures; Dynamic programming; K-LINK PATH; WEIGHT; GRAPHS;
D O I
10.1007/s00453-013-9782-3
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
In this paper we address the problem of building a compressed self-index that, given a distribution for the pattern queries and a bound on the space occupancy, minimizes the expected query time within that index space bound. We solve this problem by exploiting a reduction to the problem of finding a minimum weight K-link path in a properly designed Directed Acyclic Graph. Interestingly enough, our solution can be used with any compressed index based on the Burrows-Wheeler transform. Our experiments compare this optimal strategy with several other known approaches, showing its effectiveness in practice.
引用
收藏
页码:529 / 546
页数:18
相关论文
共 50 条
  • [31] VIDEODISCS FOR FULL-TEXT SEARCHING
    SCHIPMA, PB
    ZIEMER, SM
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 1982, 183 (MAR): : 26 - CINF
  • [32] General science full-text
    Stoklosa, K
    LIBRARY JOURNAL, 2003, 128 (04) : 129 - 129
  • [33] FULL-TEXT ONLINE RETRIEVAL
    COLBERT, AW
    ONLINE, 1988, 12 (02): : 91 - 91
  • [34] SAGE full-text collections
    不详
    PROGRAM-ELECTRONIC LIBRARY AND INFORMATION SYSTEMS, 2003, 37 (04) : 271 - 271
  • [35] WHERE FULL-TEXT IS VIABLE
    COTTON, PL
    ONLINE REVIEW, 1987, 11 (02): : 87 - 93
  • [36] Full-text linking projects
    Hoffman, DJ
    ONLINE, 2001, 25 (01): : 40 - +
  • [37] FULL-TEXT SOURCES ON COMPUSERV
    MARCUS, J
    DATABASE-THE MAGAZINE OF ELECTRONIC DATABASE REVIEWS, 1995, 18 (03): : 91 - 93
  • [38] RESEARCH INTO FULL-TEXT RETRIEVAL
    OJALA, M
    DATABASE, 1990, 13 (04): : 78 - 80
  • [39] The weaknesses of full-text searching
    Beall, Jeffrey
    JOURNAL OF ACADEMIC LIBRARIANSHIP, 2008, 34 (05): : 438 - 444
  • [40] SEARCHING FULL-TEXT PERIODICALS - HOW FULL IS FULL
    PAGELL, R
    DATABASE, 1987, 10 (05): : 33 - 36