Visualizing the structure of Web communities based on data acquired from a search engine

被引:9
|
作者
Murata, T [1 ]
机构
[1] Natl Inst Informat, Tokyo 1018430, Japan
关键词
Jaccard coefficient; visualization; Web community; Web structure mining;
D O I
10.1109/TIE.2003.817486
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Discovery of Web communities, groups of Web pages sharing common interests, is important for assisting users' information retrieval from the Web. This paper describes a method for visualizing Web communities and their internal structures. Visualization of Web communities in the form of graphs enables users to access related pages easily, and it often reflects the characteristics of the Web communities: Since related Web pages are often co-referred from the same Web page, the number of co-occurrences of references in a search engine is used for measuring the relation among pages. Two URLs are given to a search engine as keywords, and the value of the number of pages searched from both URLs divided by the number of pages searched from either URL, which is called the Jaccard coefficient, is calculated as the criteria for evaluating the relation between the two URLs. The value is used for determining the length of an edge in a graph so that vertices of related pages will be located close to each other. Our visualization system based on the method succeeds in clarifying,various genres of Web communities; although the system does not interpret the contents of the pages. The method of calculating the Jaccard coefficient is easily processed by computer systems, and it is suitable for visualization using the data acquired from a search engine.
引用
收藏
页码:860 / 866
页数:7
相关论文
共 50 条
  • [21] EUREKA: A Web Based Search Engine for Hospitals
    Guidi, Gabriele
    Luschi, Alessio
    Miniati, Roberto
    Iadanza, Ernesto
    6TH EUROPEAN CONFERENCE OF THE INTERNATIONAL FEDERATION FOR MEDICAL AND BIOLOGICAL ENGINEERING, 2015, 45 : 625 - 628
  • [22] Overview of Mondou web search engine using text mining and information visualizing technologies
    Kawano, H
    2000 KYOTO INTERNATIONAL CONFERENCE ON DIGITAL LIBRARIES: RESEARCH AND PRACTICE, PROCEEDINGS, 2000, : 234 - 241
  • [23] A Grid-Enabled Framework of Expertise Search Engine Using Web-Based Online Communities
    Hassan, Mohammad Mehedi
    Lee, Pil-Woo
    Huh, Eui-Nam
    2008 11TH INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION TECHNOLOGY: ICCIT 2008, VOLS 1 AND 2, 2008, : 921 - +
  • [24] When the Web is your Data Lake: Creating a Search Engine for Datasets on the Web
    Noy, Natasha
    SIGMOD'20: PROCEEDINGS OF THE 2020 ACM SIGMOD INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2020, : 801 - 801
  • [25] Mondou: Interface with text data mining for Web search engine
    Kawano, H
    Hasegawa, T
    PROCEEDINGS OF THE THIRTY-FIRST HAWAII INTERNATIONAL CONFERENCE ON SYSTEM SCIENCES, VOL V: MODELING TECHNOLOGIES AND INTELLIGENT SYSTEMS TRACK, 1998, : 275 - 283
  • [26] Search Engine Design Based on Web Service and Lucene
    Zhang, Hongbin
    Liu, Juefu
    2009 WASE INTERNATIONAL CONFERENCE ON INFORMATION ENGINEERING, ICIE 2009, VOL II, 2009, : 458 - 461
  • [27] A relation-based search engine in Semantic Web
    Li, Yufei
    Wang, Yuan
    Huang, Xiaotao
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2007, 19 (02) : 273 - 282
  • [28] ArraySearch: A Web-Based Genomic Search Engine
    Wilson, Tyler J.
    Ge, Steven X.
    COMPARATIVE AND FUNCTIONAL GENOMICS, 2012,
  • [29] Research on the Intelligent Search Engine Based on the Semantic WEB
    Yuan, Hui
    Li, Yanxiang
    2012 INTERNATIONAL CONFERENCE ON EDUCATION REFORM AND MANAGEMENT INNOVATION (ERMI 2012), VOL 4, 2013, : 260 - 264
  • [30] Similarity based Automatic Web Search Engine Evaluation
    Shoeleh, Farzaneh
    Azimzadeh, Masoumeh
    Mirzaei, Akbar
    Farhoodi, Mojgan
    2016 8TH INTERNATIONAL SYMPOSIUM ON TELECOMMUNICATIONS (IST), 2016, : 643 - 648