Load Balancing Scheme on the Basis of Huffman Coding for P2P Information Retrieval

被引:0
|
作者
Kurasawa, Hisashi [1 ]
Takasu, Atsuhiro [2 ]
Adachi, Jun [2 ]
机构
[1] Univ Tokyo, Grad Sch Informat Sci & Technol, Tokyo 1018430, Japan
[2] Natl Inst Informat, Tokyo 1018430, Japan
关键词
peer-to-peer; information retrieval; load balancing; Huffman-coding;
D O I
10.1587/transinf.E92.D.2064
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Although a distributed index on a distributed hash table (DHT) enables efficient document query processing in Peer-to-Peer information retrieval (P2P IR), the index costs a lot to construct and it tends to be an unfair management because of the unbalanced term frequency distribution. We devised a new distributed index, named Huffman-DHT, for P2P IR. The new index uses an algorithm similar to Huffman coding with a modification to the DHT structure based on the term distribution. In a Huffman-DHT, a frequent term is assigned to a short ID and allocated a large space in the node ID space in DHT. Throuth ID management. the Huffman-DHT balances the index registration accesses among peers and reduces load concentrations. Huffman-DHT is the first approach to adapt concepts of coding theory and term frequency distribution to load balancing. We evaluated this approach in experiments using a document collection and assessed its load balancing capabilities in P2P IR. The experimental results indicated that it is most effective when the P2P system consists of about 30,000 nodes and contains many documents. Moreover, we proved that we can construct a Huffman-DHT easily by estimating the probability distribution of the term occurrence from a small number of sample documents.
引用
收藏
页码:2064 / 2072
页数:9
相关论文
共 50 条
  • [21] On index load balancing in scalable P2P media distribution
    Nandan, Alok
    Parker, Michael G.
    Pau, Giovanni
    Salomoni, Paola
    MULTIMEDIA TOOLS AND APPLICATIONS, 2006, 29 (03) : 325 - 339
  • [22] Routing based load balancing for unstructured P2P networks
    Xu, Ming
    Guan, Jihong
    PROCEEDINGS OF FUTURE GENERATION COMMUNICATION AND NETWORKING, WORKSHOP PAPERS, VOL 2, 2007, : 332 - +
  • [23] Uncoordinated load balancing and congestion games in P2P systems
    Suri, S
    Tóth, CD
    Zhou, YH
    PEER-TO-PEER SYSTEMS III, 2004, 3279 : 123 - 130
  • [24] An Effective Load Balancing Algorithm for Structured P2P Networks
    Fan Deming
    ADVANCES IN MANUFACTURING TECHNOLOGY, PTS 1-4, 2012, 220-223 : 2578 - 2584
  • [25] LOAD BALANCING EXPLOITING P2P TECHNOLOGY FOR SOFTSWITCH SYSTEM
    She Chunyan
    Peng Jin
    Le Lifeng
    Su Sen
    Shuang Kai
    PROCEEDINGS OF 2009 2ND IEEE INTERNATIONAL CONFERENCE ON BROADBAND NETWORK & MULTIMEDIA TECHNOLOGY, 2009, : 693 - +
  • [26] Load balancing approach in structured P2P storage system
    Information Security Research Center, Harbin Engineering University, Harbin 150001, China
    Nanjing Li Gong Daxue Xuebao, 1 (38-41):
  • [27] A novel P2P information clustering and retrieval mechanism
    Zhang, Huaxiang
    Liu, Peide
    ADVANCED DATA MINING AND APPLICATIONS, PROCEEDINGS, 2006, 4093 : 364 - 371
  • [28] PHIRST: A distributed architecture for P2P information retrieval
    Rosenfeld, Avi
    Goldman, Claudia V.
    Kaminka, Gal A.
    Kraus, Sarit
    INFORMATION SYSTEMS, 2009, 34 (02) : 290 - 303
  • [29] Replication methods for load balancing on distributed storages in P2P networks
    Yamamoto, F
    Maruta, D
    Oie, Y
    2005 SYMPOSIUM ON APPLICATIONS AND THE INTERNET, PROCEEDINGS, 2005, : 264 - 271
  • [30] Design and evaluation of load balancing algorithms in P2P streaming protocols
    Wang, Yongzhi
    Fu, Tom Z. J.
    Chiu, Dah Ming
    COMPUTER NETWORKS, 2011, 55 (18) : 4043 - 4054