Improved self-indexing inverted files for full-text retrieval

被引:0
|
作者
College of Compute Science, South-Central University for Nationalities, Wuhan 430074, China [1 ]
不详 [2 ]
机构
来源
J. Comput. Inf. Syst. | 2009年 / 2卷 / 1017-1024期
关键词
Indexing (of information) - Information retrieval;
D O I
暂无
中图分类号
学科分类号
摘要
Self-index is a promising way to improve retrieval time-and-space efficiency by compression index files. An improved inverted file self-index called IFSI is proposed for full-text information retrieval. IFSI includes two level indexes: the first level index which contains a subset of the documents that are likely to be returned as top results; and the second level index which includes the surplus documents. IFSI can create a skipped index on each compressed posting list with very little or no storage overhead with efficient coding scheme. IFSI also supports efficient incremental updates with allocating free space efficiently at the tail of post lists based on statistics-based approach. Detailed simulation results and comparison with other schemes prove that the proposed IFSI can not only greatly reduce decompress time, but also simultaneously allow extremely fast query processing. © 2009 Binary Information Press March, 2009.
引用
收藏
相关论文
共 50 条
  • [21] Compression and full-text indexing for digital libraries
    Witten, IH
    Moffat, A
    Bell, TC
    DIGITAL LIBRARIES: CURRENT ISSUES, 1995, 916 : 181 - 201
  • [22] Full-text information retrieval: Introduction
    Sievert, MC
    JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE, 1996, 47 (04): : 261 - 262
  • [23] SkipBlock: Self-indexing for Block-Based Inverted List
    Campinas, Stephane
    Delbru, Renaud
    Tummarello, Giovanni
    ADVANCES IN INFORMATION RETRIEVAL, 2011, 6611 : 555 - 561
  • [24] HECATE: A FULL-TEXT RETRIEVAL SYSTEM FOR SHORT TEXT
    Wang, Song
    Xiong, Yongping
    PROCEEDINGS OF THE 2016 4TH INTERNATIONAL CONFERENCE ON ADVANCED MATERIALS AND INFORMATION TECHNOLOGY PROCESSING (AMITP 2016), 2016, 60 : 395 - 405
  • [25] AN EVALUATION OF THE APPLICABILITY OF RANKING ALGORITHMS TO IMPROVE THE EFFECTIVENESS OF FULL-TEXT RETRIEVAL .1. ON THE EFFECTIVENESS OF FULL-TEXT RETRIEVAL
    RO, JS
    JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE, 1988, 39 (02): : 73 - 78
  • [26] Adjacency matrix based full-text indexing models
    Zhou, SG
    Guan, JH
    Hu, YF
    Hu, JT
    Zhou, AY
    ADVANCES IN WEB-AGE INFORMATION MANAGEMENT, PROCEEDINGS, 2001, 2118 : 60 - 71
  • [27] Fragmented BWT: An Extended BWT for Full-Text Indexing
    Ito, Masaru
    Inoue, Hiroshi
    Taura, Kenjiro
    STRING PROCESSING AND INFORMATION RETRIEVAL, SPIRE 2016, 2016, 9954 : 97 - 109
  • [28] Full-text indexing of non-textual resources
    Byers, D
    COMPUTER NETWORKS AND ISDN SYSTEMS, 1998, 30 (1-7): : 141 - 148
  • [29] Adjacency matrix based full-text indexing models
    Zhou, Shui-Geng
    Hu, Yun-Fa
    Guan, Ji-Hong
    Ruan Jian Xue Bao/Journal of Software, 2002, 13 (10): : 1933 - 1942
  • [30] LOCAL FEEDBACK IN FULL-TEXT RETRIEVAL SYSTEMS
    ATTAR, R
    FRAENKEL, AS
    JOURNAL OF THE ACM, 1977, 24 (03) : 397 - 417