Improved Compressed Indexes for Full-Text Document Retrieval

被引:0
|
作者
Belazzougui, Djamal [1 ,2 ]
Navarro, Gonzalo [2 ]
机构
[1] Univ Paris 07, LIAFA, F-75221 Paris 05, France
[2] Univ Chile, Dept Comp Sci, Santiago, Chile
关键词
EFFICIENT ALGORITHMS; QUERIES;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We give new space/time tradeoffs for compressed indexes that answer document retrieval queries on general sequences. On a collection of D documents of total length 72, current approaches require at least vertical bar CSA vertical bar + O(n lg D/1g lg D) or 2 vertical bar CSA vertical bar + o(n) bits of space, where CSA is a full-text index. Using monotone mininum perfect hash functions, we give new algorithms for document listing with frequencies and top-k document retrieval using just vertical bar CSA vertical bar + O(n lg lg lg D) bits. We also improve current solutions that use 2 vertical bar CSA vertical bar + o(n) bits, and consider other problems such as colored range listing, top-k, most important documents, and computing arbitrary frequencies.
引用
收藏
页码:386 / +
页数:3
相关论文
共 50 条
  • [41] AN EVALUATION OF THE APPLICABILITY OF RANKING ALGORITHMS TO IMPROVE THE EFFECTIVENESS OF FULL-TEXT RETRIEVAL .2. ON THE EFFECTIVENESS OF RANKING ALGORITHMS ON FULL-TEXT RETRIEVAL
    RO, JS
    JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE, 1988, 39 (03): : 147 - 160
  • [42] RANKING DOCUMENT OUTPUT BASED ON FULL-TEXT INFORMATION
    KHARIN, NP
    NAUCHNO-TEKHNICHESKAYA INFORMATSIYA SERIYA 2-INFORMATSIONNYE PROTSESSY I SISTEMY, 1991, (02): : 11 - 15
  • [43] FULL-TEXT DOCUMENT DELIVERY ONLINE - IT MAKES SENSE
    BJORNER, SN
    ONLINE, 1990, 14 (05): : 109 - 112
  • [44] The Study on Key Technology of Mongolian Full-Text Retrieval
    Loglo, S.
    Sarula
    ADVANCES IN COMPUTER SCIENCE, ENVIRONMENT, ECOINFORMATICS, AND EDUCATION, PT IV, 2011, 217 : 340 - 345
  • [45] INTEGRATION OF MENU RETRIEVAL AND BOOLEAN RETRIEVAL FROM A FULL-TEXT DATABASE
    WATTERS, CR
    SHEPHERD, MA
    GRUNDKE, EW
    BODORIK, P
    ONLINE REVIEW, 1985, 9 (05): : 391 - 401
  • [46] A full-text information retrieval system for an epidemiological registry
    Cuggia, Marc
    Bayat, Sahar
    Garcelon, Nicolas
    Sanders, Lauren
    Rouget, Florence
    Coursin, Arnaud
    Pladys, Patrick
    MEDINFO 2010, PTS I AND II, 2010, 160 : 491 - 495
  • [47] ADDING COMPRESSION TO A FULL-TEXT RETRIEVAL-SYSTEM
    ZOBEL, J
    MOFFAT, A
    SOFTWARE-PRACTICE & EXPERIENCE, 1995, 25 (08): : 891 - 903
  • [48] Expanded information retrieval using full-text searching
    Kostoff, Ronald N.
    JOURNAL OF INFORMATION SCIENCE, 2010, 36 (01) : 104 - 113
  • [49] ACTS - AN AUTOMATIC CHINESE TEXT SEGMENTATION SYSTEM FOR FULL-TEXT RETRIEVAL
    WU, ZM
    TSENG, G
    JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE, 1995, 46 (02): : 83 - 96
  • [50] Subject retrieval from full-text databases in the humanities
    East, John W.
    PORTAL-LIBRARIES AND THE ACADEMY, 2007, 7 (02) : 227 - 241