Factors affecting web page similarity

被引:0
|
作者
Tombros, A [1 ]
Ali, ZS [1 ]
机构
[1] Queen Mary Univ London, Dept Comp Sci, London E1 4NS, England
来源
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Tools that allow effective information organisation, access and navigation are becoming increasingly important on the Web. Similarity between web pages is a concept that is central to such tools. In this paper, we examine the effect that content and layout-related aspects of web pages have on web page similarity. We consider the textual content contained within common HTML tags, the structural layout of pages, and the query terms contained within pages. Our study shows that combinations of factors can yield more promising results than individual factors, and that different aspects of web pages affect similarities between pages in a different manner. We found a number of factors that, when taken into account, can result in effective measures of similarity between web pages. Query information in particular, proved to be important for the effective organisation of web pages.
引用
收藏
页码:487 / 501
页数:15
相关论文
共 50 条
  • [21] Factors affecting semantic similarity among Jukugo neighbors
    Ogawa, Taeko
    Kawakami, Masahiro
    INTERNATIONAL JOURNAL OF PSYCHOLOGY, 2008, 43 (3-4) : 647 - 647
  • [22] Factors Affecting the Design of Icons in the Web Interface
    Yang, Geng
    PROCEEDINGS OF THE 2008 INTERNATIONAL CONFERENCE ON INDUSTRIAL DESIGN, VOL 2/2, 2008, : 235 - 242
  • [23] Factors affecting perceptions of Web site quality
    Loiacono, ET
    Taylor, NJ
    ASSOCIATION FOR INFORMATION SYSTEMS - PROCEEDINGS OF THE FIFTH AMERICAS CONFERENCE ON INFORMATION SYSTEMS (AMCIS 1999), 1999, : 529 - 531
  • [24] Factors Affecting Faculty Web Portal Usability
    Bringula, Rex P.
    Basa, Roselle S.
    EDUCATIONAL TECHNOLOGY & SOCIETY, 2011, 14 (04): : 253 - 265
  • [25] Page similarity and the Hausdorff distance
    Robertson, C
    Robinson, JA
    SEVENTH INTERNATIONAL CONFERENCE ON IMAGE PROCESSING AND ITS APPLICATIONS, 1999, (465): : 755 - 759
  • [26] Predicting web page performance level based on web page characteristics
    Zhou, Junzan
    Zhang, Yun
    Zhou, Bo
    Li, Shanping
    International Journal of Web Engineering and Technology, 2015, 10 (02) : 152 - 169
  • [27] Web page scoring based on link analysis of web page sets
    Nakakubo, Hitoshi
    Nakajima, Shinsuke
    Hatano, Kenji
    Miyazaki, Jun
    Uemura, Shunsuke
    DEXA 2007: 18TH INTERNATIONAL CONFERENCE ON DATABASE AND EXPERT SYSTEMS APPLICATIONS, PROCEEDINGS, 2007, : 269 - +
  • [28] A method for supporting web page design based on impression of web page
    Watanabe, M
    Yoshida, T
    Saiwaki, N
    Nishida, S
    IEEE RO-MAN 2000: 9TH IEEE INTERNATIONAL WORKSHOP ON ROBOT AND HUMAN INTERACTIVE COMMUNICATION, PROCEEDINGS, 2000, : 13 - 17
  • [29] Web page clustering: A hyperlink-based similarity and matrix-based hierarchical algorithms
    Hou, JY
    Zhang, YC
    Cao, JL
    WEB TECHNOLOGIES AND APPLICATIONS, 2003, 2642 : 201 - 212
  • [30] Cross-Browser Differences Detection Based on an Empirical Metric for Web Page Visual Similarity
    Xu, Zhen
    Miller, James
    ACM TRANSACTIONS ON INTERNET TECHNOLOGY, 2018, 18 (03)