A Two-Tier Distributed Full-Text Indexing System

被引:0
|
作者
Zhang, Wei-Zhe [1 ]
Chen, Hui-Xiang [1 ]
He, Hui [1 ]
Chen, Gui [1 ]
机构
[1] Harbin Inst Technol, Sch Comp Sci & Technol, Harbin 150001, Peoples R China
来源
基金
国家高技术研究发展计划(863计划);
关键词
Distributed indexing; document partitioning; term partitioning; search efficiency; load balance;
D O I
暂无
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
The performance of indexing systems is very important for a search engine. Usually, indexing systems on large-scale clusters can provide high search efficiency, but it brings expensive hardware costs. The costs would be greatly reduced if a distributed indexing system runs on small-scale clusters connected by the Internet. Two current inverted file partitioning schemes: document partitioning and term partitioning, have their merits individually. A two-tier distributed full-text indexing system is implemented, which uses document partitioning among the clusters and term partitioning inside each cluster. Our experiments show that the system performs well in search efficiency, resource consuming and load balance.
引用
收藏
页码:321 / 326
页数:6
相关论文
共 50 条
  • [21] AN EXPERT SYSTEM FOR SEARCHING IN FULL-TEXT
    GAUCH, S
    SMITH, JB
    INFORMATION PROCESSING & MANAGEMENT, 1989, 25 (03) : 253 - 263
  • [22] Expert system for searching in full-text
    Gauch, Susan, 1600, (25):
  • [23] Research on full-text indexing technology for documents based on COM components
    Wu Wanzhi
    Wu Shunxiang
    ICCSE 2008: PROCEEDINGS OF THE THIRD INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE & EDUCATION: ADVANCED COMPUTER TECHNOLOGY, NEW EDUCATION, 2008, : 578 - 581
  • [24] A distributed full-text top-k document dissemination system in distributed hash tables
    Rao, Weixiong
    Chen, Lei
    WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2011, 14 (5-6): : 545 - 572
  • [25] A new full-text indexing model with low space overhead for chinese text retrieval
    Zhou S.
    Guan J.
    International Journal on Digital Libraries, 2004, 4 (4) : 272 - 282
  • [26] A distributed full-text top-k document dissemination system in distributed hash tables
    Weixiong Rao
    Lei Chen
    World Wide Web, 2011, 14 : 545 - 572
  • [27] HECATE: A FULL-TEXT RETRIEVAL SYSTEM FOR SHORT TEXT
    Wang, Song
    Xiong, Yongping
    PROCEEDINGS OF THE 2016 4TH INTERNATIONAL CONFERENCE ON ADVANCED MATERIALS AND INFORMATION TECHNOLOGY PROCESSING (AMITP 2016), 2016, 60 : 395 - 405
  • [28] Full-text and structural indexing of XML documents on B+-tree
    Shimizu, T
    Yoshikawa, M
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2006, E89D (01): : 237 - 247
  • [29] Improved self-indexing inverted files for full-text retrieval
    College of Compute Science, South-Central University for Nationalities, Wuhan 430074, China
    不详
    J. Comput. Inf. Syst., 2009, 2 (1017-1024):
  • [30] Two-Tier Air Indexing for On-Demand XML Data Broadcast
    Sun, Weiwei
    Yu, Ping
    Qing, Yongrui
    Zhang, Zhuoyao
    Zheng, Baihua
    2009 29TH IEEE INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS, 2009, : 199 - +