A Two-Tier Distributed Full-Text Indexing System

被引:0
|
作者
Zhang, Wei-Zhe [1 ]
Chen, Hui-Xiang [1 ]
He, Hui [1 ]
Chen, Gui [1 ]
机构
[1] Harbin Inst Technol, Sch Comp Sci & Technol, Harbin 150001, Peoples R China
来源
基金
国家高技术研究发展计划(863计划);
关键词
Distributed indexing; document partitioning; term partitioning; search efficiency; load balance;
D O I
暂无
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
The performance of indexing systems is very important for a search engine. Usually, indexing systems on large-scale clusters can provide high search efficiency, but it brings expensive hardware costs. The costs would be greatly reduced if a distributed indexing system runs on small-scale clusters connected by the Internet. Two current inverted file partitioning schemes: document partitioning and term partitioning, have their merits individually. A two-tier distributed full-text indexing system is implemented, which uses document partitioning among the clusters and term partitioning inside each cluster. Our experiments show that the system performs well in search efficiency, resource consuming and load balance.
引用
收藏
页码:321 / 326
页数:6
相关论文
共 50 条
  • [1] WEIGHTED AUTOMATA FOR FULL-TEXT INDEXING
    Zhang, Meng
    Hu, Liang
    Zhang, Yi
    INTERNATIONAL JOURNAL OF FOUNDATIONS OF COMPUTER SCIENCE, 2011, 22 (04) : 921 - 943
  • [2] An efficient synchronous indexing technique for full-text retrieval in distributed databases
    Hassen, Fadoua
    Amel, Grissa Touzi
    KNOWLEDGE-BASED AND INTELLIGENT INFORMATION & ENGINEERING SYSTEMS, 2017, 112 : 811 - 821
  • [3] Distributed Air Indexing Scheme for Full-Text Search on Multiple Wireless Channel
    Goel, Vikas
    Ahalawat, Anil Kumar
    Gupta, M. N.
    INTELLIGENT SYSTEMS TECHNOLOGIES AND APPLICATIONS, VOL 2, 2016, 385 : 125 - 135
  • [4] Compression and full-text indexing for digital libraries
    Witten, IH
    Moffat, A
    Bell, TC
    DIGITAL LIBRARIES: CURRENT ISSUES, 1995, 916 : 181 - 201
  • [5] Automated indexing for full-text information retrieval
    Berrios, DC
    JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2000, : 71 - 75
  • [6] A novel full-text indexing model for Chinese text retrieval
    Zhou, SG
    Hu, YF
    Hu, JT
    DATABASE AND EXPERT SYSTEMS APPLICATIONS, 2001, 2113 : 370 - 379
  • [7] Energy Efficient Distributed Indexing Scheme for Full-Text Search on Multi channel Broadcast
    Goel, Vikas
    Ahlawat, Anil Kumar
    Gupta, Amit Kr.
    2013 INTERNATIONAL CONFERENCE ON COMMUNICATION SYSTEMS AND NETWORK TECHNOLOGIES (CSNT 2013), 2013, : 664 - 669
  • [8] Adjacency matrix based full-text indexing models
    Zhou, SG
    Guan, JH
    Hu, YF
    Hu, JT
    Zhou, AY
    ADVANCES IN WEB-AGE INFORMATION MANAGEMENT, PROCEEDINGS, 2001, 2118 : 60 - 71
  • [9] Fragmented BWT: An Extended BWT for Full-Text Indexing
    Ito, Masaru
    Inoue, Hiroshi
    Taura, Kenjiro
    STRING PROCESSING AND INFORMATION RETRIEVAL, SPIRE 2016, 2016, 9954 : 97 - 109
  • [10] Full-text indexing of non-textual resources
    Byers, D
    COMPUTER NETWORKS AND ISDN SYSTEMS, 1998, 30 (1-7): : 141 - 148