Organizing WWW images based on the analysis of page layout and web link structure

被引:0
|
作者
Cai, D [1 ]
He, XF [1 ]
Ma, WY [1 ]
Wen, JR [1 ]
Zhang, HJ [1 ]
机构
[1] Microsoft Res Asia, Beijing 100080, Peoples R China
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Due to the rapid growth of the number of digital images on the Web, there is an increasing demand for effective and efficient method for organizing and retrieving the images available. This paper describes a method for clustering and embedding WWW images. By using a vision-based page segmentation algorithm, a web page is partitioned into blocks, and the textual and link information of an image can be accurately extracted from the block containing that image. By extracting the page-to-block, block-to-image, block-to-page relationships through link structure and page layout analysis, we construct an image graph. With the image graph model, we use techniques from spectral graph theory for image clustering and embedding. Some experimental results are given in the paper.
引用
收藏
页码:113 / 116
页数:4
相关论文
共 50 条
  • [31] Document Structure Meets Page Layout: Loopy Random Fields for Web News Content Extraction
    Spengler, Alex
    Gallinari, Patrick
    DOCENG2010: PROCEEDINGS OF THE 2010 ACM SYMPOSIUM ON DOCUMENT ENGINEERING, 2010, : 151 - 160
  • [32] Layout-Tree-based Approach for Identifying Visually Similar Blocks in a Web Page
    Zeng, Jun
    Flanagan, Brendan
    Hirokawa, Sachio
    2013 IEEE/ACIS 12TH INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION SCIENCE (ICIS), 2013, : 65 - 70
  • [33] Gender and Leadership: A Frame Analysis of University Home Web Page Images
    Hoover K.F.
    O'Neil D.A.
    Poutiatine M.
    Journal of Academic Ethics, 2014, 12 (1) : 15 - 27
  • [34] Chinese web page classification based on self-organizing mapping neural networks
    Liang, JZ
    ICCIMA 2003: FIFTH INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND MULTIMEDIA APPLICATIONS, PROCEEDINGS, 2003, : 96 - 101
  • [35] The impact of web site structure on link analysis
    Mandl, Thomas
    INTERNET RESEARCH, 2007, 17 (02) : 196 - 206
  • [36] Research on web page classification-based core characteristics and web structure
    Zengmin, Geng
    Jianxia, Du
    International Journal of Wireless and Mobile Computing, 2014, 7 (03) : 253 - 257
  • [37] Analysis of data obtained across of CHAEA questionnaire on line in web page www.estilosdeaprendizaje.es
    Garcia Cue, Jose Luis
    Santizo Rincon, Jose Antonio
    JOURNAL OF LEARNING STYLES, 2008, 1 (02): : 84 - 109
  • [38] Web page analysis: Experiments based on discussion and purchase web patterns
    Kocibova, Jana
    Klos, Karel
    Lehecka, Ondrej
    Kudelka, Milos
    Snasel, Vaclav
    PROCEEDING OF THE 2007 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE AND INTELLIGENT AGENT TECHNOLOGY, WORKSHOPS, 2007, : 221 - 225
  • [39] The digital classroomCrafting a web-based lesson Part two: Organizing the information and constructing the page
    Laurie A. Quinlan
    TechTrends, 1997, 42 (1) : 6 - 8
  • [40] Link recommendation in web index page based on multi-instance learning techniques
    National Laboratory for Novel Software Technology, Nanjing University, Nanjing 210093, China
    Jisuanji Yanjiu yu Fazhan, 2007, 3 (406-411): : 406 - 411