Improved self-indexing inverted files for full-text retrieval

被引:0
|
作者
College of Compute Science, South-Central University for Nationalities, Wuhan 430074, China [1 ]
不详 [2 ]
机构
来源
J. Comput. Inf. Syst. | 2009年 / 2卷 / 1017-1024期
关键词
Indexing (of information) - Information retrieval;
D O I
暂无
中图分类号
学科分类号
摘要
Self-index is a promising way to improve retrieval time-and-space efficiency by compression index files. An improved inverted file self-index called IFSI is proposed for full-text information retrieval. IFSI includes two level indexes: the first level index which contains a subset of the documents that are likely to be returned as top results; and the second level index which includes the surplus documents. IFSI can create a skipped index on each compressed posting list with very little or no storage overhead with efficient coding scheme. IFSI also supports efficient incremental updates with allocating free space efficiently at the tail of post lists based on statistics-based approach. Detailed simulation results and comparison with other schemes prove that the proposed IFSI can not only greatly reduce decompress time, but also simultaneously allow extremely fast query processing. © 2009 Binary Information Press March, 2009.
引用
收藏
相关论文
共 50 条
  • [41] COMPLETE INVERTED FILES FOR EFFICIENT TEXT RETRIEVAL AND ANALYSIS
    BLUMER, A
    BLUMER, J
    HAUSSLER, D
    MCCONNELL, R
    EHRENFEUCHT, A
    JOURNAL OF THE ACM, 1987, 34 (03) : 578 - 595
  • [42] THE NEW ERA OF FULL-TEXT FILES - ACS JOURNALS ONLINE
    GARSON, LR
    TERRANT, SW
    COHEN, SM
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 1984, 187 (APR): : 10 - CINF
  • [43] SEARCHING IAC FULL-TEXT FILES - ITS AWFULLY CONFUSING
    PAGELL, R
    DATABASE, 1987, 10 (05): : 39 - 46
  • [44] A Two-Tier Distributed Full-Text Indexing System
    Zhang, Wei-Zhe
    Chen, Hui-Xiang
    He, Hui
    Chen, Gui
    APPLIED MATHEMATICS & INFORMATION SCIENCES, 2014, 8 (01): : 321 - 326
  • [45] Full-text and structural XML indexing on B+-tree
    Shimizu, T
    Yoshikawa, M
    DATABASE AND EXPERT SYSTEMS APPLICATIONS, PROCEEDINGS, 2005, 3588 : 451 - 460
  • [46] INDEXING AND COMPRESSING FULL-TEXT DATABASES FOR CD-ROM
    WITTEN, IH
    BELL, TC
    NEVILL, CG
    JOURNAL OF INFORMATION SCIENCE, 1991, 17 (05) : 265 - 271
  • [47] Improved Space-Time Tradeoffs for Approximate Full-Text Indexing with One Edit Error
    Djamal Belazzougui
    Algorithmica, 2015, 72 : 791 - 817
  • [48] Improved Space-Time Tradeoffs for Approximate Full-Text Indexing with One Edit Error
    Belazzougui, Djamal
    ALGORITHMICA, 2015, 72 (03) : 791 - 817
  • [49] AN EVALUATION OF THE APPLICABILITY OF RANKING ALGORITHMS TO IMPROVE THE EFFECTIVENESS OF FULL-TEXT RETRIEVAL .2. ON THE EFFECTIVENESS OF RANKING ALGORITHMS ON FULL-TEXT RETRIEVAL
    RO, JS
    JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE, 1988, 39 (03): : 147 - 160
  • [50] The Study on Key Technology of Mongolian Full-Text Retrieval
    Loglo, S.
    Sarula
    ADVANCES IN COMPUTER SCIENCE, ENVIRONMENT, ECOINFORMATICS, AND EDUCATION, PT IV, 2011, 217 : 340 - 345