A Novel Architecture and Algorithm for Web Page Change Detection

被引:0
|
作者
Varshney, Naveen Kumar [1 ]
Sharma, Dilip Kumar [1 ]
机构
[1] GLA Univ, Dept CEA, Mathura, India
关键词
Web page change detection; Structural and Content changes; Change Detection; Crawlers;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Internet is used for exchange of information; subsequently people upload and update web pages and information on constant basis. Due to rapid changes in the content of the web pages it has become very necessary to develop a system which can detect recurrent changes in the minimum browsing time. This paper has devised algorithm for structural changes and defines a text code formula to detect the content changes and also represents Architecture for web page change detection system. Analysis and comparison of various web page change detection algorithms based on various parameters to find out strengths and weaknesses for detecting the changes of web pages is also one of the important features highlighted through this paper.
引用
收藏
页码:782 / 787
页数:6
相关论文
共 50 条
  • [1] An Enhanced Architecture and Algorithm for Web Page Change Detection
    Varshney, Naveen Kumar
    Sharma, Dilip Kumar
    PROCEEDINGS OF THE 2013 INTERNATIONAL CONFERENCE ON INFORMATION SYSTEMS AND COMPUTER NETWORKS (ISCON), 2013, : 151 - 154
  • [2] Parallel crawler architecture and web page change detection
    Computer Science and Information Technology, Jaypee Institute of Information Technology University, Noida, India
    WSEAS Trans. Comput., 2008, 7 (929-940):
  • [3] An efficient Web page change detection system based on an optimized Hungarian algorithm
    Khoury, Imad
    El-Mawas, Rami M.
    El-Rawas, Oussama
    Mounayar, Elias F.
    Artail, Hassan
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2007, 19 (05) : 599 - 613
  • [4] A NOVEL WEB PAGE DUPLICATION DETECTION FRAMEWORK
    Han, Zhongming
    Duan, Dagao
    Liu, Hongzhi
    Sun, Jianzhi
    2009 IEEE INTERNATIONAL CONFERENCE ON NETWORK INFRASTRUCTURE AND DIGITAL CONTENT, PROCEEDINGS, 2009, : 374 - 378
  • [5] Architecture and algorithm for web phishing detection
    Cao, Jiuxin
    Wang, Tianfeng
    Shi, Lili
    Mao, Bo
    Journal of Southeast University (English Edition), 2010, 26 (01) : 43 - 47
  • [6] A Novel Heuristic Page Rank Algorithm in Web Search
    He Yan-li
    OPTICAL, ELECTRONIC MATERIALS AND APPLICATIONS, PTS 1-2, 2011, 216 : 747 - 751
  • [7] Topical web crawling using weighted anchor text and web page change detection techniques
    Yadav, Divakar
    Sharma, Ak
    Gupta, J.P.
    WSEAS Transactions on Information Science and Applications, 2009, 6 (02): : 263 - 275
  • [8] A Novel and Efficient Approach For Near Duplicate Page Detection in Web Crawling
    Narayana, V. A.
    Premchand, P.
    Govardhan, A.
    2009 IEEE INTERNATIONAL ADVANCE COMPUTING CONFERENCE, VOLS 1-3, 2009, : 1492 - +
  • [9] An Idea of Formalizing Web page Information Architecture
    Yorinori, Kishimoto
    PROCEEDINGS OF THE 8TH WSEAS INTERNATIONAL CONFERENCE ON APPLIED COMPUTER SCIENCE (ACS'08): RECENT ADVANCES ON APPLIED COMPUTER SCIENCE, 2008, : 176 - +
  • [10] CaSePer: An efficient model for personalized web page change detection based on segmentation
    Kuppusamy, K. S.
    Aghila, G.
    JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2014, 26 (01) : 19 - 27