Improved self-indexing inverted files for full-text retrieval

被引:0
|
作者
College of Compute Science, South-Central University for Nationalities, Wuhan 430074, China [1 ]
不详 [2 ]
机构
来源
J. Comput. Inf. Syst. | 2009年 / 2卷 / 1017-1024期
关键词
Indexing (of information) - Information retrieval;
D O I
暂无
中图分类号
学科分类号
摘要
Self-index is a promising way to improve retrieval time-and-space efficiency by compression index files. An improved inverted file self-index called IFSI is proposed for full-text information retrieval. IFSI includes two level indexes: the first level index which contains a subset of the documents that are likely to be returned as top results; and the second level index which includes the surplus documents. IFSI can create a skipped index on each compressed posting list with very little or no storage overhead with efficient coding scheme. IFSI also supports efficient incremental updates with allocating free space efficiently at the tail of post lists based on statistics-based approach. Detailed simulation results and comparison with other schemes prove that the proposed IFSI can not only greatly reduce decompress time, but also simultaneously allow extremely fast query processing. © 2009 Binary Information Press March, 2009.
引用
收藏
相关论文
共 50 条
  • [31] InfoBee/TR - a full-text retrieval system
    NTT Human Interface Labs
    NTT R&D, 10 (1103-1108):
  • [32] The Performance Study of Database Full-Text Retrieval
    Wu, Daiwen
    MODERN INDUSTRIAL IOT, BIG DATA AND SUPPLY CHAIN, IIOTBDSC 2020, 2021, 218 : 239 - 247
  • [33] A public library based on full-text retrieval
    Witten, IH
    Nevill-Manning, C
    McNab, R
    Cunningham, SJ
    COMMUNICATIONS OF THE ACM, 1998, 41 (04) : 71 - 75
  • [34] APPLICATION OF FULL-TEXT RETRIEVAL TO LITIGATION SUPPORT
    RUBIN, JS
    FORUM-AMERICAN BAR ASSOCIATION, 1976, 11 (04): : 1136 - 1141
  • [35] OPTOELECTRONIC FULL-TEXT RETRIEVAL-SYSTEM
    KIM, YW
    BERRA, PB
    OPTICAL ENGINEERING, 1992, 31 (05) : 906 - 914
  • [36] Full-text Retrieval System for Humanities Researches
    Murakawa, Takehiko
    Watagami, Yukiharu
    Utsunomiya, Keigo
    Nakagawa, Masaru
    KNOWLEDGE-BASED SOFTWARE ENGINEERING, 2012, 240 : 118 - +
  • [37] FULL-TEXT COMPUTER RETRIEVAL OF MEDICAL LITERATURE
    LLAURADO, JG
    INTERNATIONAL JOURNAL OF BIO-MEDICAL COMPUTING, 1986, 18 (3-4): : 161 - 163
  • [38] A Method of Full-text Retrieval Based on Lucene
    Chen, Xiangrong
    Sun, Yong
    Ge, Xiaopei
    Wang, Congwei
    PROCEEDINGS OF 2009 INTERNATIONAL CONFERENCE ON INFORMATION, ELECTRONIC AND COMPUTER SCIENCE, VOLS I AND II, 2009, : 217 - 220
  • [39] Combining Text Compression and String Matching: The Miracle of Self-Indexing
    Navarro, Conzalo
    PROCEEDINGS OF THE PRAGUE STRINGOLOGY CONFERENCE 2009, 2009, : 1 - 1
  • [40] SOFTWARE FOR INFORMATION-STORAGE AND RETRIEVAL TESTED, EVALUATED AND COMPARED .4. INDEXING AND FULL-TEXT RETRIEVAL PROGRAMS
    SIEVERTS, EG
    HOFSTEDE, M
    GROENIGER, BO
    ELECTRONIC LIBRARY, 1992, 10 (04): : 195 - 208