Visual similarity comparison for Web page retrieval

被引:14
|
作者
Takama, Y [1 ]
Mitsuhashi, N [1 ]
机构
[1] Tokyo Metropolitan Univ, Tokyo 158, Japan
关键词
D O I
10.1109/WI.2005.157
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A Comparison method for Web pages in terms of visual similarity is proposed Conventional Web Information retrieval/gathering systems, such as search engines, extract keywords from HTML source files, based on which the similarity between pages is calculated. The extracted keywords are considered as semantic features representing the contents of Web pages. On the other hand, visual feature of Web pages is as important as semantic feature, because HTML is designed for visualizing a Web page in understandable manner for humans. The proposed method compares the layouts of Web pages based on image processing and graph matching. The experimental results show that the accuracy of layout analysis is 91.6% in average, and the visual similarity calculated by the proposed method is closer to the visual judgment by test subjects than color-based comparison method.
引用
收藏
页码:301 / 304
页数:4
相关论文
共 50 条
  • [41] Data Extraction from Web Forums Based on Similarity of Page Layout
    Wang, Yun
    Li, Bicheng
    Lin, Chen
    IEEE NLP-KE 2009: PROCEEDINGS OF INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND KNOWLEDGE ENGINEERING, 2009, : 340 - 344
  • [42] Sparse Similarity Matrix Learning for Visual Object Retrieval
    Yan, Zhicheng
    Yu, Yizhou
    2013 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2013,
  • [43] Learning Semantic and Visual Similarity for Endomicroscopy Video Retrieval
    Andre, Barbara
    Vercauteren, Tom
    Buchner, Anna M.
    Wallace, Michael B.
    Ayache, Nicholas
    IEEE TRANSACTIONS ON MEDICAL IMAGING, 2012, 31 (06) : 1276 - 1288
  • [44] That site looks 88.46% familiar: quantifying similarity of Web page design
    Martine, G
    Rugg, G
    EXPERT SYSTEMS, 2005, 22 (03) : 115 - 120
  • [45] Visual similarity at encoding and retrieval in an item recognition task
    Mate, Judit
    Baques, Josep
    QUARTERLY JOURNAL OF EXPERIMENTAL PSYCHOLOGY, 2009, 62 (07): : 1277 - 1284
  • [46] AFFECT IN WEB INTERFACES: A STUDY OF THE IMPACTS OF WEB PAGE VISUAL COMPLEXITY AND ORDER
    Deng, Liqiong
    Poole, Marshall Scott
    MIS QUARTERLY, 2010, 34 (04) : 711 - 730
  • [47] Web image retrieval refinement by visual contents
    Gong, Zhiguo
    Liu, Qian
    Zhang, Jingbai
    ADVANCES IN WEB-AGE INFORMATION MANAGEMENT, PROCEEDINGS, 2006, 4016 : 134 - 145
  • [48] Comparison of Similarity Coefficients for Chemical Database Retrieval
    Syuib, Mukhsin
    Arif, Shereena M.
    Malim, Nurul
    2013 FIRST INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE, MODELLING AND SIMULATION (AIMS 2013), 2013, : 129 - 133
  • [49] Comparison of Similarity Metrics in Microarray Experiment Retrieval
    Acici, Koray
    Ogul, Hasan
    2015 23RD SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2015, : 927 - 930
  • [50] Using Anchor Text Refined by Page Importance to Improve Web Retrieval
    Zhang, Yonggang
    Lei, Kai
    Huang, Lian'en
    PROCEEDINGS OF 2012 7TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE & EDUCATION, VOLS I-VI, 2012, : 1200 - 1203