Visualizing the structure of Web communities based on data acquired from a search engine

被引:9
|
作者
Murata, T [1 ]
机构
[1] Natl Inst Informat, Tokyo 1018430, Japan
关键词
Jaccard coefficient; visualization; Web community; Web structure mining;
D O I
10.1109/TIE.2003.817486
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Discovery of Web communities, groups of Web pages sharing common interests, is important for assisting users' information retrieval from the Web. This paper describes a method for visualizing Web communities and their internal structures. Visualization of Web communities in the form of graphs enables users to access related pages easily, and it often reflects the characteristics of the Web communities: Since related Web pages are often co-referred from the same Web page, the number of co-occurrences of references in a search engine is used for measuring the relation among pages. Two URLs are given to a search engine as keywords, and the value of the number of pages searched from both URLs divided by the number of pages searched from either URL, which is called the Jaccard coefficient, is calculated as the criteria for evaluating the relation between the two URLs. The value is used for determining the length of an edge in a graph so that vertices of related pages will be located close to each other. Our visualization system based on the method succeeds in clarifying,various genres of Web communities; although the system does not interpret the contents of the pages. The method of calculating the Jaccard coefficient is easily processed by computer systems, and it is suitable for visualization using the data acquired from a search engine.
引用
收藏
页码:860 / 866
页数:7
相关论文
共 50 条
  • [1] Web Search Based on Web Communities Feedback Data
    Adda, Mehdi
    Missaoui, Rokia
    Valtchev, Petko
    E-TECHNOLOGIES-INNOVATION IN AN OPEN WORLD, 2009, 26 : 169 - +
  • [2] Concept-based Web communities for Google™ search engine
    Tomiyama, T
    Ohgaya, R
    Shinmura, A
    Kawabata, T
    Takagi, T
    Nikravesh, M
    PROCEEDINGS OF THE 12TH IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS, VOLS 1 AND 2, 2003, : 1122 - 1128
  • [3] A decentralized search engine for dynamic Web communities
    Wang, Daze
    Tse, Quincy Chi Kwan
    Zhou, Ying
    KNOWLEDGE AND INFORMATION SYSTEMS, 2011, 26 (01) : 105 - 125
  • [4] A decentralized search engine for dynamic Web communities
    Daze Wang
    Quincy Chi Kwan Tse
    Ying Zhou
    Knowledge and Information Systems, 2011, 26 : 105 - 125
  • [5] Personalized Intelligent Search Engine Based on Web Data Mining
    Zhang, Hong
    Ma, Yanhong
    Zhang, Qiuyu
    Xie, Pengshou
    Bao, Zhongxian
    PROCEEDINGS OF 2009 INTERNATIONAL WORKSHOP ON INFORMATION SECURITY AND APPLICATION, 2009, : 584 - 587
  • [6] CIMG-BSDS: Image Clustering Based on Bookshelf Data Structure in Web Search Engine Visualization
    Jayanthi, S. K.
    Prema, S.
    GLOBAL TRENDS IN COMPUTING AND COMMUNICATION SYSTEMS, PT 1, 2012, 269 : 457 - +
  • [7] Research on the Optimization Strategy of Web Search Engine Based on Data Mining
    Chen, Ronghua
    ADVANCES IN MATERIALS, MACHINERY, ELECTRONICS II, 2018, 1955
  • [8] GeNemo: a search engine for web-based functional genomic data
    Zhang, Yongqing
    Cao, Xiaoyi
    Zhong, Sheng
    NUCLEIC ACIDS RESEARCH, 2016, 44 (W1) : W122 - W127
  • [9] Web search engine based on DNS
    Wang Liang
    Guo Yi-Ping
    Fang Ming
    JOURNAL OF NETWORK AND COMPUTER APPLICATIONS, 2007, 30 (02) : 466 - 478
  • [10] A Novel Architecture for Search Engine using Domain Based Web Log Data
    Sharma, Prem
    Yadav, Divakar
    INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2023, 20 (01) : 92 - 101